Gene Cagg_2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2212 
Symbol 
ID7266785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2710628 
End bp2711947 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content60% 
IMG OID643567043 
Productglycosyl transferase group 1 
Protein accessionYP_002463531 
Protein GI219849098 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCC TCATTCCGTC AGATGTCTTC CCACCCGATG GGCGAGGCGG AGCAGCGTGG 
AGTGCGTATG CCTTAGCTAT CGGCTTACAA ACACGTGGCC ATCAGGTACG GGTAATCGTA
CCGTGCCGCA CACCGTGCCG TAAGCCTCTT ATTGACCATA ATGGTCATCT GCCGGTGCAG
CGTGTCCCAT ATGCTGCACC CTCCATTCCC TTCATTCAAA ACTATTTTCG CTACGAACGC
TTCTGGCCGC GCCTGGCCCA AACGCTGATG ACCACCGTGC GCGACCTCGG TGGGGTGGAT
ATCATTCACG CCCAACACAC CCAAACCGCT GCTGCTGCCG TCATAGCCGG TGCTTCCCTC
GGCGTACCGG TCGTCGTCAC AGTACGTGAC CATTGGCCGT GGGACTATTT TGCCACCGGG
TTACACGGCA ACCGGATTCC ACACCCGGGT CGATCGCTCC CGGCGATTGC CACCGATCTC
ATCGGTCGTC TCGGCCCGCT ACGCGGGGTA TTGGCGTGGC CGGCCATTCC CTACATCGTC
GCCCATCTGC GCAAGCGTGC AACCCTGCTC GCTCAAGCTG ATGCGGTGAT CGCACCCAGC
AACTACATCG CCCGCCGGCT GACCGGGATC GTCGATCCGG CGCGCATTCA CGTGCTACCC
AACATGGTTG ATATTGCCGC GAGTGACGCA ATCGCTGCAA CGCCACCTCA GACCACGTGG
GAAGGTACCC TTGTGCTGTT TGCCGGCAAA CTCGAAGCAA ACAAAGGGGC GGAACTCCTG
ATTGACGTGA TCAACGACCT TGCCACCCGC CAGAACGAGC TTCCTCCCTT CACCCTCCTC
ATTGCCGGTG ATGGTGCCTT GCGCCCTGCG ATTGATCGTG CGTTAGCGAC CAGTGGCGTA
TCTGGAAGGG TACTGGCCTG GGTCGAACAC GACGAACTAC TCCGTCTGAC GGCTCGTTGC
GATGTCCTAC TCTTCCCATC AAATTGGGGT GAACCGCTAG CCCGTGCGCT CATCGAAGCA
GCCGCGCTCG GCGCACCGAT CATTGCTATG CCGACCGGCG GTACCCCCGA CATCATCAGG
CATGGTGAAA CCGGCATCCT CGCCCCAACC GTCGCGACGA TGGTAGAGTG GGTTATCCGA
CTACTCAACG ACCCGGCTCT GCGTCAACGC CTTGGCGCTG CCGCGCGCGC CACCGCCGCT
GAACGTTTCG ACGCCAACCG GTTACTACCC CGCTACGAGG CGCTCTATAC GGCACTAGCT
CACCGGGATA GGGGCGGTAA CATGCCCACA CGCCTCACGG CAAGATACCA AATTCATTAA
 
Protein sequence
MRILIPSDVF PPDGRGGAAW SAYALAIGLQ TRGHQVRVIV PCRTPCRKPL IDHNGHLPVQ 
RVPYAAPSIP FIQNYFRYER FWPRLAQTLM TTVRDLGGVD IIHAQHTQTA AAAVIAGASL
GVPVVVTVRD HWPWDYFATG LHGNRIPHPG RSLPAIATDL IGRLGPLRGV LAWPAIPYIV
AHLRKRATLL AQADAVIAPS NYIARRLTGI VDPARIHVLP NMVDIAASDA IAATPPQTTW
EGTLVLFAGK LEANKGAELL IDVINDLATR QNELPPFTLL IAGDGALRPA IDRALATSGV
SGRVLAWVEH DELLRLTARC DVLLFPSNWG EPLARALIEA AALGAPIIAM PTGGTPDIIR
HGETGILAPT VATMVEWVIR LLNDPALRQR LGAAARATAA ERFDANRLLP RYEALYTALA
HRDRGGNMPT RLTARYQIH