Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1928 |
Symbol | |
ID | 4810786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2300254 |
End bp | 2301255 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107344 |
Product | hypothetical protein |
Protein accession | YP_001038339 |
Protein GI | 125974429 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000122505 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACAGGAG CGGCATGTGC CGGGCTTGGT GCTTTGGGAG CTCTAGCAGG GAAAAGCATC CAATGTATGA GCACAGTGGG AAAAGCGATA AATGTTACAT CAAAGGTTAT GGCAGCACTT TCTTTTGGTA TGGATGGATT TGACATGCTG GCAATGGGAG TATCATTGTT TGATCCATCC AACGCATTGG TTGAATTTAA TCGGAAGCTG CATTCCAGTG CACTTTACAA CGGATTCCAG ATTGCTGTAA ACGCGCTGGC TGTTTTCAGT GCCGGGGCGG CATCTACAAT GAAGTGCTTT GTTGCAGGCA CGCTGATATT GACTGTGGCA GGCTTGGTTG CAATAGAGAA TATCAAGGCA GGAGACAAGG TAATTGCGAC GAATCTGGAG ACTTTTGAAG TAGCCGAGAA GACAGTGCTT GAGACATATG TGAGAGAGAC AACGGAGCTT TTGCATTTGA CAATCAATGG AGAGGTAATC AAGACAACCT TTGAGCATCC GTTTTATGTT AAAGATGTGG GTTTTGTTGA AGCTAAAGAA TTGCAAGTAG GAGATAAGCT GCTAGATTCA AAAGGCAATG TTTTGGTGGT GGAAGAGAAA AAGCTTGAAA TTACAGATGA ACCTGCCAAG GTTTATAACT TCAAGGTTGA TGATTTTCAT ACTTATCATG TCGGCAATAA TGGAATATTG GTACATAATG CAAATTATAG TAAGGGAATG AGTAGTAATA TCCCCGACTA TATAAAAGAT AATCGTGTAC CTTTAGATAA GGAGACAGTA TTGAACAGTA AGGAGTACCA AAAAACTAAT ATTAAAGTTA AAGGTGCTCA AGTTTACAAA AAAGGGGATA AATATTATTA CCGTGATACT TTCCATACAG GAGAAGCGGC TCATTTAGAG GTGTTTGATA AGAGAGGAAA CCATATTGGT GAAGCTAATC CACTAACTGG AGAATTGATA CCGGGAACAG CAGATCCGAT GAAGAAAATT AAAATAAAGT AG
|
Protein sequence | MTGAACAGLG ALGALAGKSI QCMSTVGKAI NVTSKVMAAL SFGMDGFDML AMGVSLFDPS NALVEFNRKL HSSALYNGFQ IAVNALAVFS AGAASTMKCF VAGTLILTVA GLVAIENIKA GDKVIATNLE TFEVAEKTVL ETYVRETTEL LHLTINGEVI KTTFEHPFYV KDVGFVEAKE LQVGDKLLDS KGNVLVVEEK KLEITDEPAK VYNFKVDDFH TYHVGNNGIL VHNANYSKGM SSNIPDYIKD NRVPLDKETV LNSKEYQKTN IKVKGAQVYK KGDKYYYRDT FHTGEAAHLE VFDKRGNHIG EANPLTGELI PGTADPMKKI KIK
|
| |