Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2397 |
Symbol | |
ID | 4811049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2862320 |
End bp | 2863540 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107810 |
Product | hypothetical protein |
Protein accession | YP_001038792 |
Protein GI | 125974882 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCGAA AAAACAGGCA TAATCTTATC CTGATATTAA TTATTATTGC ATCCCTGATT GTAAAAATAG TTTTGATAGT AAAGTATAAA AACAACTTAA CCTTGTCCAG TGATGACTTA AACTATGTAA AAAGTGCGGT GGTGCTTGTA AAAAGAGGCA TTTACATTTT TCACAATTTA AATGAGCCAA CGGTTTTTGT AACTCCGGTA TATCCTTTTT TTCTTGCTCT GGTCTTTAAG TTGTTTGGGA CGGGATTTGC CGGCCTTCAG GCTGCAAGAA TTATACAAGC GGTAATAAGT TCTGTCACGA TACTCCTTGT TTATTTAATT GCCGGAAAGC TTTTTAATAA AAATGTGGCT TTGCTTGCTG CATTTTTGGT GGCGTTCTAT GTTCCAAACA TTGTAACAGT AGGATATATG CTGACGGAAA CGCTTTTTAC GATGCTTCTT TGCCTCCTTT TGTATTTTAG CCTGTTGTTC GCAAAAAAGC CAAAAAAAAC CGGGTTTGTG TTTTTAGGAG TGCTTTGGGC TGTTGCCACA TTGTGCAGAC CAACCATAGC CCTGTATCCG ATTTTTTTAT TTGTCTACCT GTTTTGGTCG GAGCGTATAA AAATTGTTGA GATGATAAAG CTGGGAACTG TTATGTTTTT GGCATTTGTG GCAGTAGTAT CTCCATGGTG GATAAGAAAC TATCGCGAAT ACGGGGAGTT TATACCCCTT GCGGCCTCCA GTGGAAATCC GCTCCTTCAG GGAACGTATG TAAATTATGA GCAAACACCG GAAAATGTAG TGTACTATAA ATTGGGGAAG AATGCTTTTG AAACCAACAA AACGGAAGTT GCTGTTGCTA AAATGAGGAT AAAGGAGGAA TTTAAGAAGG ACTTTTGGGG ATATCTTAAA TGGTATACCA TAGGGAAGAC AAATCTTTTT TGGAGGACGG TGTTTTACTG GAAGGGCTTT TTCAACATCC CACACTCCAT TGTGTTGTAC ATTCACCTGT TTATAGTATA TACAGGTTTT GCCGGCATAG TTATGTTATT ATTCAAAGGA ATCGGAAAAT ACAGCCTGCC TGTGCTAGTA ATGCTTTATT TTAATGCAAC TCACTGTGTT TATATGGCCT TTGACAGGTA TGCCTTTCCC ATGATACCGC TTTTGTCGAT ATTTTCGTCT TTTTTGATTT TAAAGGTTTT GAGCTTAATG AGGGAGAAAA TCCGGTTCTA A
|
Protein sequence | MVRKNRHNLI LILIIIASLI VKIVLIVKYK NNLTLSSDDL NYVKSAVVLV KRGIYIFHNL NEPTVFVTPV YPFFLALVFK LFGTGFAGLQ AARIIQAVIS SVTILLVYLI AGKLFNKNVA LLAAFLVAFY VPNIVTVGYM LTETLFTMLL CLLLYFSLLF AKKPKKTGFV FLGVLWAVAT LCRPTIALYP IFLFVYLFWS ERIKIVEMIK LGTVMFLAFV AVVSPWWIRN YREYGEFIPL AASSGNPLLQ GTYVNYEQTP ENVVYYKLGK NAFETNKTEV AVAKMRIKEE FKKDFWGYLK WYTIGKTNLF WRTVFYWKGF FNIPHSIVLY IHLFIVYTGF AGIVMLLFKG IGKYSLPVLV MLYFNATHCV YMAFDRYAFP MIPLLSIFSS FLILKVLSLM REKIRF
|
| |