Gene Cagg_0321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0321 
Symbol 
ID7267502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp402244 
End bp403338 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content54% 
IMG OID643565189 
Productcobalamin synthesis protein P47K 
Protein accessionYP_002461703 
Protein GI219847270 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.276547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTGC CTCAGCAACC AATTCCGGTA ACTATTCTGA CCGGCTTTCT CGGTGCCGGG 
AAGACGACGT TGCTCAATCG CATCTTGAAA GCCGATCATG GCTTGCGGGT TGCCGTTCTC
GTCAACGATT TTGGTTCGAT CAATATCGAC GCGCAGCTCG TAATTGATGT CGAGAGTGAA
ACGATCTCCT TGGCAAACGG GTGCGTCTGC TGTACGATCC GCGATGATCT ACTCCAGACA
GCATTTACCT TGTTAGAAAG ACCACAGCCG CCAGAGTATC TGATTATTGA GGCAAGTGGC
GTAAGTGATC CGTGGGCAAT TGCCGATACC TTCCTGCTAC CGGAGCTACG CACCTATTTC
CGCCTCGATA GCGTGATCAC GGTTATTGAT GCCGAGTATG TACACCGGCA ACCCTCGTAT
GAGTCGTTGA TTGTCGAACA AATCAGCGCT GCTGATATTG TCGTGCTTAA CAAAATTGAT
CTCGTGCCGT CCGATCAGTT AACCGAGCTT GAAGCATGGG TGCGGCGGAT TGTCCCACAG
GCGCGTATTC TGCCGGCGAT GTATGCCGAT GTACCGCTGC GTTTGCTCTT AGATGTGGGC
CGGTTGCAAC ATCGTATGCC ACTCCCGCAT CTCGTTGTCG CTGCGGAATC CGACCATCAC
GACCATGATC ATCACGACCA CGACCACGAC CACGGCTCTG CATTTGCAAC GTGGAGCTAT
GTTGCCGATC AACCGTTTAC ACTCCATGCG TTTCGGCGCG TGATTCTTGA CTTACCGCCG
GCGATCTTTC GTGCGAAGGG ATTAGTCTAT CTTGCCGAAG TACCGCAGCG TCGAGCGGTA
TTGCAATTGG TCGGATCGCG TGTGCAGGTC ACGGTTGGTG AGCCATGGGG CAATCAGCCA
CCGCAGACAC AGATGGTCTT TATCGGCTTG CCGGGCCAGC TCGATGAGAC AACACTGCGT
TCGGCATTTG ACCGTTGCTT GATCGAGCAC GAAGCAGGAA CGACCGAGTC GGTTGCTCTC
CAACAAACCT GGCAACGCCC CCAGGTGTCG TCAGCTAACA GCAACGCCGA CGGTGATCAA
TCAATCGTGA GTTGA
 
Protein sequence
MEVPQQPIPV TILTGFLGAG KTTLLNRILK ADHGLRVAVL VNDFGSINID AQLVIDVESE 
TISLANGCVC CTIRDDLLQT AFTLLERPQP PEYLIIEASG VSDPWAIADT FLLPELRTYF
RLDSVITVID AEYVHRQPSY ESLIVEQISA ADIVVLNKID LVPSDQLTEL EAWVRRIVPQ
ARILPAMYAD VPLRLLLDVG RLQHRMPLPH LVVAAESDHH DHDHHDHDHD HGSAFATWSY
VADQPFTLHA FRRVILDLPP AIFRAKGLVY LAEVPQRRAV LQLVGSRVQV TVGEPWGNQP
PQTQMVFIGL PGQLDETTLR SAFDRCLIEH EAGTTESVAL QQTWQRPQVS SANSNADGDQ
SIVS