Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2711 |
Symbol | |
ID | 4810705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3198798 |
End bp | 3199979 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108130 |
Product | hypothetical protein |
Protein accession | YP_001039103 |
Protein GI | 125975193 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000024275 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACAGCC CTGAAAACAT ACCAAAGATG CGGCAACCTT CGAGCAGAAT GAGCATACTT CTATTTTTCG TACTTGTGAT CATAGTTGTG TGTACGGCTG CTGTTGCCTA TCTCAACAGC AAGGGTATAG ACATTAAAAG TGTAAGCATA AGGGATATAA TAGCCAATGG GTTTTTCGTC GGAGATAAAG ATGTGTATGA AGTAACCGGC ACCTTGTTAA GATATGAGGA CAGTTTGAAC TCTGAGTTTG GTGCTTACAA GGGCTATATT GTAAGATGCA CGAAAAATGG TTTGGAATAT CTCAACAGAA ATGGTGAAGA GGAGTGGATA TATACTGTTT CTTTGAATTT GCCGTATTTT AAGTCCGCGG GCGAGTATCT TTTGGTTGCA GGGCTCGAAT CAAAAGACAT TTATGTATTT GCCGGCAAGG AGAAAAAGTG GGAAAAAAAG CTGGATTACA GTATAATAAA TGCAAATATA AACAGCGCGG GTTATGTTAC CGTCCTTCAC AAAGCAGAGA GGGATAAGTC TGCTGTATCA GTGTACAACA AACAGGGAGT TTTTCTTTTT ACCAAAATAT TGGGAGAAAC TTACGCCATC TCGTCGGAGG TGTCTCCCTC CGGCAGGGAG GTTCTGATAA ACTGTGTTGA TATTTCGGGT GTAAGTATAA ATACAGGACT TCATTTGTAT ACGATTTCAG GTGGGAATGT AGCTGGCAAA ACTTTCGAGA ATGTTATTTT TCCTTCAATC CGCTATTTGA ATGACGATAC CATTGTGGCA GTGTCCGACT CAGCGATTTA TCTGTTTGGA CAAAATTTTG AAGAAAAGTG GAGCAGGACA ATAAGCGGCA AAGTATACAG CATGGACGTT ATGAAAGGCA GATACGCCGT GTTTGCATTC AGCGAAAAAG GAATGATGGG AGAAGCGGGA CATGTTCTTA TAGTTGATTC CAAAGGCAGG GAAGTCGCAG ATTACAGAAT TGACCAGGAT GTTGTGAACA TAGCTGCAAA TGATGATTTT ATAGCCATAA ACACATCCAA GTGTGTTTAT TTTATAGATG TTGCCGGGAG ACTCAGGGGA AGCTATGCTT CGGATTATTT GATAAATGAA GTGAAGTTTC TTGGCACAAA TGAGGCTTTG GCAATTACAA AAGAAGGCGT TATCCTTTTA AAAATGAATT AA
|
Protein sequence | MNSPENIPKM RQPSSRMSIL LFFVLVIIVV CTAAVAYLNS KGIDIKSVSI RDIIANGFFV GDKDVYEVTG TLLRYEDSLN SEFGAYKGYI VRCTKNGLEY LNRNGEEEWI YTVSLNLPYF KSAGEYLLVA GLESKDIYVF AGKEKKWEKK LDYSIINANI NSAGYVTVLH KAERDKSAVS VYNKQGVFLF TKILGETYAI SSEVSPSGRE VLINCVDISG VSINTGLHLY TISGGNVAGK TFENVIFPSI RYLNDDTIVA VSDSAIYLFG QNFEEKWSRT ISGKVYSMDV MKGRYAVFAF SEKGMMGEAG HVLIVDSKGR EVADYRIDQD VVNIAANDDF IAINTSKCVY FIDVAGRLRG SYASDYLINE VKFLGTNEAL AITKEGVILL KMN
|
| |