Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1926 |
Symbol | |
ID | 4810784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2297702 |
End bp | 2298928 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107342 |
Product | hypothetical protein |
Protein accession | YP_001038337 |
Protein GI | 125974427 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTGCCG CAGTAATTAT TACCGGGCTG GGGATAGCGG CAGCGTTGAC AGGCGGTATA TTGGGAGTCG TACTGGCAGG AGCATTCTGG GGAGCATTGG CCGGAGGATT GATAGGAGGA GCGGTCGGAG GAATAGCTGC GGCGGTAAAT GGAGGTTCAT TTTTAGAAGG ATTTGCGGAT GGCGCTTTAA GCGGAGCGGT TTCCGGAGCG GTGACAGGAG CCGCATGTGC CGGGCTGGGT GCTTTGGGAG CTCTAGCAGG GAAAGGCATC CAATGCTTGA GCACACTGGG GAAAGCGATA AAAATTACGT CAAAAGTAAC TGCAGCACTA TCGTTTGGTA TGGATGGATT TGACATGCTG GCAATGGGAA TATCGTTGTT TGATCCATCC AACGCATTGG TTGAATTTAA CCGGAAGCTG CATTCCAACG CGCTTTACAA CAGATTCCAG ATTGCTATAA ACGCGCTTGC TGTTTTCACT GCGGGAGCGG CATCGACAAT GAAGTGCTTT GTTGCAGGTA CGCTGATATT GACTGCGACA GGCTTCGTTG CGATAGAGAA TATCAAGGCA GGAGACAAGG TAATTGCGAC GAATCCGGAG ACTTTTGAAG TAGCGGAAAA GACAGTGCTT GAGACATATG TGAGAGAGAC GACGGAGCTT TTGCATTTGA CAATCAATGG AGAGGTAATC AAGACAACCT TTGAGCATCC GTTTTATGTA AAAGATGTGG GTTTTGTTGA AGCGGGAAAA CTGCAGATAG GAGACAGGTT GGTTGATTCA AGAGGTAATG TTTTAGTATT GGAAGGTAAA AAGCTTGAAA TAACAGATAA GCCTGTAAAG GTTTACAACT TCAAAGTGGA TGATTTTTAT ACTTATCATG TTGCACATAT TGGTGTATTG GTGCATAATG CAAACAACTA TAAACCTGAA AATAGTATAT ATATTTATGA AAATGGAATA TACGAGGATG CAGATTATCA TGGAAGAGTT GATAATAAAG TAAAAAGTAG AAGACCAATA GACGGTCAAT TTGCGTTGGA TAATTCTGTT GAGGTTAAAA TTGGTAGAAG TCCTAGAAGG GTTGGGATTG ATGTAAATGG AGATTTTGTT GTATTAGATC GTACGGATGG CGAAAAATAC CATGGTCATG TTAGACCTTG GAAGAAAGAT GCTTCAGGTA TTGAACCTTT AAGTGATAAA ATGAAAAGTG CTCTGATAAA GGCATGA
|
Protein sequence | MAAAVIITGL GIAAALTGGI LGVVLAGAFW GALAGGLIGG AVGGIAAAVN GGSFLEGFAD GALSGAVSGA VTGAACAGLG ALGALAGKGI QCLSTLGKAI KITSKVTAAL SFGMDGFDML AMGISLFDPS NALVEFNRKL HSNALYNRFQ IAINALAVFT AGAASTMKCF VAGTLILTAT GFVAIENIKA GDKVIATNPE TFEVAEKTVL ETYVRETTEL LHLTINGEVI KTTFEHPFYV KDVGFVEAGK LQIGDRLVDS RGNVLVLEGK KLEITDKPVK VYNFKVDDFY TYHVAHIGVL VHNANNYKPE NSIYIYENGI YEDADYHGRV DNKVKSRRPI DGQFALDNSV EVKIGRSPRR VGIDVNGDFV VLDRTDGEKY HGHVRPWKKD ASGIEPLSDK MKSALIKA
|
| |