Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2150 |
Symbol | |
ID | 4811198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2555782 |
End bp | 2557128 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107554 |
Product | integral membrane protein-like protein |
Protein accession | YP_001038546 |
Protein GI | 125974636 |
COG category | [S] Function unknown |
COG ID | [COG5542] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000529517 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAATG TTAATACCAT GAGTACAAAT AAAAAAAGTT GGTTTTTCGA GGATGGAAAA CCCGTATCTC GAAATATGTT TATAATTACC GGGACGGTCG TAATATTAAT AAGATTGCTT CTGACCACGA TTCCAAGTTA TCAGGTGGAC ATGGGAGGAT ACAGGGCCTG GAGCCTGTAT CTTGCCGAAA ATGGTCCGGT AGGTTTTTAT GAGCGTTACC ATGTTGTGTA TGCACCGGCA TATATGTATT TGCTGTGGAT TACGGGAATA ATAGCAAAGG CTTTTTCAGT CAATGCGTCA ACCCATGCAT TTTTGATAAA GCTGTGGGCT GTTGCTTCAG AACTGGTAGG TGCTTATCTT ATTTATAAAA TTGGCAAAAA GTACAAAAAA GAAAGGCTTG GATTTATTCT GGGAGTGGTT TATGCACTTA ATCCGGGAGT TTTCTTCAAT TCATCGATTT GGGGACAATT TGATTCGATA CCGGCAACGT TGCTTGTAGG TATGATATAT GCTTTTAGTG TAAACCGGAA AATGACTGCG GTGGTATTGT ATGCCATTGC TGTTCTGACC AAGCCTCAAA GTGCGCTTCT TACACCGCTG GGCATACTTT TTTACAAAGA ACTGTTTGAC TTTTCCAACA TCACAAAAGA AAAGATTGTT AAAAGTATCA AGGAAACATT GGTGGCTATT TGTGTAGGAT TGTCATGTTA TTTTATCGTT ATTTATCCTT TCTATTATCA TACCGATCTT TATGAACGAA TGAAGAGTAC TTCCGTTGTT AAAGATTTTA TAGCTGAGAG CATTGACTAC TTTTGGTGGA TGCCCAACTT GTATCTGACG AGTGTTGAAG ATTATCCGTA TGCCACTGCC AATGCCTTTA ACTTGTGGAC ACTTTTGGGA GGACAACCTG TAAAGGATTC AAATATATTC TTCATATTGT CCTATAAAAC GTGGGGAACT ATACTGTTTT TAATTTGCAT AGGCATAGCC TTTGCATATC TGCTGAAAAA AAGGAAAAGC GATTTTGCAA TGTACTTTGC ATCTTTCTTC ATCCTTTCAA GCGCTTTTAC CTTTATAACA AGAATGCATG AAAGATATTT GCTTCCCGCC ATAATATTCC TTACAATTTG CGTCCTGTGG GAAAAGTGGA TGGCAATACC TTTGACGGTT TTGAGTGTAT GTGTTACTGC CAACCACTGG TACATATATG ATTTGTCGTG GAAGGATGTT TTTTGGCTGA GAAATTACGA TCCTGTGGCC ATGCCCTTTG CTTTCCTGAC TGTGCTGGTG GTTTTATTTG GTGCGGGGTT TATTATAAAA CAGATTTTGC CGGCCAAAAA AAATTGA
|
Protein sequence | MDNVNTMSTN KKSWFFEDGK PVSRNMFIIT GTVVILIRLL LTTIPSYQVD MGGYRAWSLY LAENGPVGFY ERYHVVYAPA YMYLLWITGI IAKAFSVNAS THAFLIKLWA VASELVGAYL IYKIGKKYKK ERLGFILGVV YALNPGVFFN SSIWGQFDSI PATLLVGMIY AFSVNRKMTA VVLYAIAVLT KPQSALLTPL GILFYKELFD FSNITKEKIV KSIKETLVAI CVGLSCYFIV IYPFYYHTDL YERMKSTSVV KDFIAESIDY FWWMPNLYLT SVEDYPYATA NAFNLWTLLG GQPVKDSNIF FILSYKTWGT ILFLICIGIA FAYLLKKRKS DFAMYFASFF ILSSAFTFIT RMHERYLLPA IIFLTICVLW EKWMAIPLTV LSVCVTANHW YIYDLSWKDV FWLRNYDPVA MPFAFLTVLV VLFGAGFIIK QILPAKKN
|
| |