Gene Cthe_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2081 
Symbol 
ID4810679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2473651 
End bp2475399 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content47% 
IMG OID640107488 
Productsmall GTP-binding protein 
Protein accessionYP_001038481 
Protein GI125974571 
COG category[R] General function prediction only 
COG ID[COG2262] GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03156] GTP-binding protein HflX 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00812611 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGGCG CGAAAGATGA ATTTCTGCCG GCGGGCCTTG CGGTCAAAAT GGCGGAACTT 
ACAGGGAAAA TCAACCGCGA AATAGCGGTG TATATAAACA GAAAAGGGAA TATAATTGAC
GTAAGTGTGG GAGACAGCAG CACCGTTTCA CTTCCGGAAG TGGAAGGAAG AAAGGATTTG
GCACGCCTTG TCGGGGTAAG ATGCATCCAT ACTCATCCCA ACGGTGAGGG AATGGTTTCA
CTGGTGGATT TAAATTCCCT GGTTAAGATG AGACTGGATG CCATGGTGGC AGTCGGAGTG
AAAGACGGGC GGATAACGGA AATATACGCG GCTTTGCCTG TGAGGGATGA AAACGGGAAT
TTGGGCAAAA CCACCGTGTA TGGACCCTTT GGCAAGGACG ACAAAAGAAT GAATAGGCTT
TGGGACATAA TACTTGAGAC GGACAAGCTT AAAAGTACGG TGGTGCACTT AAATGAGAGC
GATGAAGAAA GAGCCGTGCT GGTTGGGCTT GAGACTTCGT CAAAGGTCAT TGTGGGAGGA
AAAAGCGAAG GAGAAAGATC TTTGGACGAA CTGGAAGAGC TGGCCCGCAC TGCCGGAGCG
GTTGTTCTGG AAAAAATAAT ACAGAGAAGA CCTGCAAAAG ACCCGGCATT TTTTATCGGA
AGGGGAAAAG TTGAGGAACT TTCTCTTATA TGCCAGGCTC TTGACGCCAA TCTGATAATT
TTTGACGACG AGCTTTCGGG AGTCCAGATG AGGAATATTG AAGAGATGAC AGGAGTAAAG
GTTGTGGACA GGACCACTTT GATTTTGGAC ATATTTGCCA AAAGGGCGCG TTCCCGGGAG
GGAAAACTTC AAGTGGAACT GGCCCAGCTA AAATACAGGG TATCGAGGCT TGTGGGTCTT
GGGACCCAGC TTTCAAGGCT CGGAGGCGGT ATAGGAACAA GAGGTCCGGG TGAGAAAAAA
CTGGAGGTTG ACAGAAGGCA TATAAAGAGA AGAATAAGCT TCCTTGAAGC ACAACTTAAG
GATGTGGAAA AGAGAAGAAA TTCTTTCAGG GAAAGCCGGA CAAGGAACGC CATACCCACC
ATTGCGCTGG TGGGATATAC CAACGCGGGA AAATCCACTC TTATGAACAG GTTGTGCGAA
AGCAACGTCC TGGCAGAAGA CAAACTCTTT GCAACTCTTG ACCCCACGAC GAGAAGTTTT
AGACTTTCGG ACGGAAGGGA AGCGCTTCTC ATTGACACGG TGGGATTTAT AAGAAAGCTC
CCTCATGAGT TGGTGGAGGC GTTCAAGTCA ACTCTTGAAG AGGCAGTGTA TGCGGACATG
CTGATTCATG TGGTGGATGC TTCCAATGAG GAGGCGGAAG AACAAGTAAA GGTTGTGAAC
GATATCCTTG AAAGTCTCGG TGCGGCAAAC AAACCTGTTA TCATGGCACT CAACAAGATG
GATATGGTAA AGGGCGGCCT GAGGCTTGCA ATATCCAATC CGAACGGCAG GATATTTGAA
ATATCTGCCG TTACAGGACA GGGAATAGAT GCCATGCTCG AAGGCATCAG GGAAATGCTG
CCCGAGGATG AAAAGGAGGT AAGACTTTTT ATACCTTACA GTGACGGATG GGTCATATCC
TATATTTATC AAAACGGAAG AATACTTGAG CAAGTTCACG GCGAGTCGGG GACCGAAGTA
AAAGCTTTGA TAAAAAAACA CAGACTGAAA CCTGTCAGGG CATATATTTG TGGGAAATAC
CCTGTCTGA
 
Protein sequence
MQGAKDEFLP AGLAVKMAEL TGKINREIAV YINRKGNIID VSVGDSSTVS LPEVEGRKDL 
ARLVGVRCIH THPNGEGMVS LVDLNSLVKM RLDAMVAVGV KDGRITEIYA ALPVRDENGN
LGKTTVYGPF GKDDKRMNRL WDIILETDKL KSTVVHLNES DEERAVLVGL ETSSKVIVGG
KSEGERSLDE LEELARTAGA VVLEKIIQRR PAKDPAFFIG RGKVEELSLI CQALDANLII
FDDELSGVQM RNIEEMTGVK VVDRTTLILD IFAKRARSRE GKLQVELAQL KYRVSRLVGL
GTQLSRLGGG IGTRGPGEKK LEVDRRHIKR RISFLEAQLK DVEKRRNSFR ESRTRNAIPT
IALVGYTNAG KSTLMNRLCE SNVLAEDKLF ATLDPTTRSF RLSDGREALL IDTVGFIRKL
PHELVEAFKS TLEEAVYADM LIHVVDASNE EAEEQVKVVN DILESLGAAN KPVIMALNKM
DMVKGGLRLA ISNPNGRIFE ISAVTGQGID AMLEGIREML PEDEKEVRLF IPYSDGWVIS
YIYQNGRILE QVHGESGTEV KALIKKHRLK PVRAYICGKY PV