Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2081 |
Symbol | |
ID | 4810679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2473651 |
End bp | 2475399 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640107488 |
Product | small GTP-binding protein |
Protein accession | YP_001038481 |
Protein GI | 125974571 |
COG category | [R] General function prediction only |
COG ID | [COG2262] GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03156] GTP-binding protein HflX |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00812611 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGGCG CGAAAGATGA ATTTCTGCCG GCGGGCCTTG CGGTCAAAAT GGCGGAACTT ACAGGGAAAA TCAACCGCGA AATAGCGGTG TATATAAACA GAAAAGGGAA TATAATTGAC GTAAGTGTGG GAGACAGCAG CACCGTTTCA CTTCCGGAAG TGGAAGGAAG AAAGGATTTG GCACGCCTTG TCGGGGTAAG ATGCATCCAT ACTCATCCCA ACGGTGAGGG AATGGTTTCA CTGGTGGATT TAAATTCCCT GGTTAAGATG AGACTGGATG CCATGGTGGC AGTCGGAGTG AAAGACGGGC GGATAACGGA AATATACGCG GCTTTGCCTG TGAGGGATGA AAACGGGAAT TTGGGCAAAA CCACCGTGTA TGGACCCTTT GGCAAGGACG ACAAAAGAAT GAATAGGCTT TGGGACATAA TACTTGAGAC GGACAAGCTT AAAAGTACGG TGGTGCACTT AAATGAGAGC GATGAAGAAA GAGCCGTGCT GGTTGGGCTT GAGACTTCGT CAAAGGTCAT TGTGGGAGGA AAAAGCGAAG GAGAAAGATC TTTGGACGAA CTGGAAGAGC TGGCCCGCAC TGCCGGAGCG GTTGTTCTGG AAAAAATAAT ACAGAGAAGA CCTGCAAAAG ACCCGGCATT TTTTATCGGA AGGGGAAAAG TTGAGGAACT TTCTCTTATA TGCCAGGCTC TTGACGCCAA TCTGATAATT TTTGACGACG AGCTTTCGGG AGTCCAGATG AGGAATATTG AAGAGATGAC AGGAGTAAAG GTTGTGGACA GGACCACTTT GATTTTGGAC ATATTTGCCA AAAGGGCGCG TTCCCGGGAG GGAAAACTTC AAGTGGAACT GGCCCAGCTA AAATACAGGG TATCGAGGCT TGTGGGTCTT GGGACCCAGC TTTCAAGGCT CGGAGGCGGT ATAGGAACAA GAGGTCCGGG TGAGAAAAAA CTGGAGGTTG ACAGAAGGCA TATAAAGAGA AGAATAAGCT TCCTTGAAGC ACAACTTAAG GATGTGGAAA AGAGAAGAAA TTCTTTCAGG GAAAGCCGGA CAAGGAACGC CATACCCACC ATTGCGCTGG TGGGATATAC CAACGCGGGA AAATCCACTC TTATGAACAG GTTGTGCGAA AGCAACGTCC TGGCAGAAGA CAAACTCTTT GCAACTCTTG ACCCCACGAC GAGAAGTTTT AGACTTTCGG ACGGAAGGGA AGCGCTTCTC ATTGACACGG TGGGATTTAT AAGAAAGCTC CCTCATGAGT TGGTGGAGGC GTTCAAGTCA ACTCTTGAAG AGGCAGTGTA TGCGGACATG CTGATTCATG TGGTGGATGC TTCCAATGAG GAGGCGGAAG AACAAGTAAA GGTTGTGAAC GATATCCTTG AAAGTCTCGG TGCGGCAAAC AAACCTGTTA TCATGGCACT CAACAAGATG GATATGGTAA AGGGCGGCCT GAGGCTTGCA ATATCCAATC CGAACGGCAG GATATTTGAA ATATCTGCCG TTACAGGACA GGGAATAGAT GCCATGCTCG AAGGCATCAG GGAAATGCTG CCCGAGGATG AAAAGGAGGT AAGACTTTTT ATACCTTACA GTGACGGATG GGTCATATCC TATATTTATC AAAACGGAAG AATACTTGAG CAAGTTCACG GCGAGTCGGG GACCGAAGTA AAAGCTTTGA TAAAAAAACA CAGACTGAAA CCTGTCAGGG CATATATTTG TGGGAAATAC CCTGTCTGA
|
Protein sequence | MQGAKDEFLP AGLAVKMAEL TGKINREIAV YINRKGNIID VSVGDSSTVS LPEVEGRKDL ARLVGVRCIH THPNGEGMVS LVDLNSLVKM RLDAMVAVGV KDGRITEIYA ALPVRDENGN LGKTTVYGPF GKDDKRMNRL WDIILETDKL KSTVVHLNES DEERAVLVGL ETSSKVIVGG KSEGERSLDE LEELARTAGA VVLEKIIQRR PAKDPAFFIG RGKVEELSLI CQALDANLII FDDELSGVQM RNIEEMTGVK VVDRTTLILD IFAKRARSRE GKLQVELAQL KYRVSRLVGL GTQLSRLGGG IGTRGPGEKK LEVDRRHIKR RISFLEAQLK DVEKRRNSFR ESRTRNAIPT IALVGYTNAG KSTLMNRLCE SNVLAEDKLF ATLDPTTRSF RLSDGREALL IDTVGFIRKL PHELVEAFKS TLEEAVYADM LIHVVDASNE EAEEQVKVVN DILESLGAAN KPVIMALNKM DMVKGGLRLA ISNPNGRIFE ISAVTGQGID AMLEGIREML PEDEKEVRLF IPYSDGWVIS YIYQNGRILE QVHGESGTEV KALIKKHRLK PVRAYICGKY PV
|
| |