Gene Cagg_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3026 
Symbol 
ID7266557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3681585 
End bp3682790 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content62% 
IMG OID643567848 
Productelongation factor Tu 
Protein accessionYP_002464322 
Protein GI219849889 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0050] GTPases - translation elongation factors 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00485] translation elongation factor TU 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00137885 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCAAAC AGAAATTTGA GCGGACGAAA CCGCACGTCA ACGTCGGCAC CATCGGCCAC 
GTCGACCACG GGAAGACGAC GCTGACCGCA GCCATTACCA AGGTGCTCTC CCTCAAGGGT
GCCGCCCAGT TTATGGCGTA TGACCAGATC GACAACGCGC CCGAAGAGCG CGCCCGTGGT
ATTACCATCG CTATTCGCCA CGTCGAGTAT CAGACCGACA AGCGCCACTA TGCCCACGTC
GACTGCCCCG GTCACGCCGA CTATATCAAG AATATGATTA CCGGCGCAGC CCAAATGGAC
GGGGCCATCC TCGTGGTGAG CGCGCCCGAC GGCCCGATGC CGCAGACTCG TGAGCACATC
CTGCTGGCGC GCCAGGTGCA GGTGCCGGCC ATCGTTGTCT TCCTCAACAA GGTAGATATG
ATGGACGACC CGGAGTTGCT CGAGCTGGTC GAGTTGGAGC TGCGCGAGCT GCTGAGCAAG
TACGGCTTCC CCGGCGATGA GATTCCGATT GTGCGAGGCA GCGCCCGCAA TGCGCTGGAA
AGCCCGAGCA AGGATATCAA CGCGCCGGAG TATGCCTGCA TCCTTGAGCT GATGAACGCG
GTCGATGAGT ACATCCCGAC GCCGCAGCGG GCGGTTGACC AGCCGTTCCT GATGCCGATT
GAAGACGTGT TCGGGATTAA GGGCCGCGGT ACGGTGGTGA CGGGCCGCAT CGAGCGCGGC
AAGGTGAAGG TCGGAGACAC GGTCGAGATT GTGGGAATGA CCGACGAAGC GCCGCGGCGA
ACGGTGGTGA CCGGCGTGGA GATGTTCCAG AAGACGCTCG ATGAAGGGAT TGCCGGCGAC
AACGTGGGCT GTCTGCTGCG TGGCATTGAG CGGAACGAGG TAGAGCGCGG GCAAGTGTTG
TGTGCGCCGG GGAGCATCAA GCCGCACAAG AAGTTTGAGG CGCAGGTCTA CGTGTTGAAG
AAGGAAGAGG GTGGGCGCCA CACGCCGTTC TTCTCCGGCT ACCGACCACA GTTCTACATC
CGCACCACCG ATGTGACGGG GGCGATTAGC TTGCCGGCGG GGATGGAGAT GGTGATGCCG
GGCGACAACG TGGTGATGAC GATTGAGCTG ATTGTGCCGG TGGCGATTGA AGAGGGCCTC
CGCTTCGCCA TCCGTGAGGG TGGGCGCACC GTCGGTGCAG GTGTCGTCAC CAAGATCCTT
GATTAG
 
Protein sequence
MAKQKFERTK PHVNVGTIGH VDHGKTTLTA AITKVLSLKG AAQFMAYDQI DNAPEERARG 
ITIAIRHVEY QTDKRHYAHV DCPGHADYIK NMITGAAQMD GAILVVSAPD GPMPQTREHI
LLARQVQVPA IVVFLNKVDM MDDPELLELV ELELRELLSK YGFPGDEIPI VRGSARNALE
SPSKDINAPE YACILELMNA VDEYIPTPQR AVDQPFLMPI EDVFGIKGRG TVVTGRIERG
KVKVGDTVEI VGMTDEAPRR TVVTGVEMFQ KTLDEGIAGD NVGCLLRGIE RNEVERGQVL
CAPGSIKPHK KFEAQVYVLK KEEGGRHTPF FSGYRPQFYI RTTDVTGAIS LPAGMEMVMP
GDNVVMTIEL IVPVAIEEGL RFAIREGGRT VGAGVVTKIL D