Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1048 |
Symbol | |
ID | 4811346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1251329 |
End bp | 1252864 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106470 |
Product | rhomboid family protein |
Protein accession | YP_001037473 |
Protein GI | 125973563 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATTATAA AGGCACTGAG CAGATTACTG ATTGAGAGGG GAAATTATTT TCCTGTCATC GACGAAAACG GAAACGTTGT GATAGATGAC AACCGGGCAA TGCTTCACAG GTACGGTCCT GAAGGTTACG TTGTTGCAGA AATTGTAAGC GGTGATTTGT TGACACCGGA ACAAATAAAA ACAGGACTTG AAATGGTAAG GTCAAGGCAG GAGCAGATAA AAGAAAGATT CAATACCTAC TACTTCAAAG TTTTTGTCTT TTCATCGGAT TTGCCTGAGG ATAAACTGGA GATTTTAGCG TCGGAAAATT ATAACGAAGG AATTAACGGA AATTACCTGT GCTGCATAAG TGTAAATGTT GAAAAAAAAG AGTTTGCCAA GTATTATGAA TATCCCAGAA TATCCTACGG TATTCATAAT CAAATTCAAT ATTTTTTCAG CAATGATTTT GACAGTGACT ACGGCAATGT TGATTTTAAA GAATTGGCGA AGAAAAATGA AAAGGGATAC AAAATTGAGA TAAAAGACAA AAAACCCTGG GTAACCTATG TTCTCATAGC TGTCAACATT CTGGTCTGGC TTTTGATAGA AATCTATGCG CGGTCCAAGG GTGTTGATTC GTCAAGTCTG TTGGTGGATT TTGGAGCAAA GGAAAATACT CATATAATGA TGGGAGAATA TTGGAGGTTT GTGACTCCCA TGTTTTTGCA CAACGGAATT ACTCATCTTG TTGTAAATTC CTATTCCCTT TATGTCCTCG GAACTACTGT TGAAATGATA ATGGGAAAAG GCAGGTTTTT GTTTATATAT CTCATGGCCG GCCTTATGGG CAGTATAGTA AGCTTCATTT TTTCAATAGC GCCGTCAGTA GGGGCGTCAG GAGCAATATT TGGCTTGCTG GGTGCTCTTA TTTATTATGG GACGGAGCAC CGGGAACTGT TTAAGAAAGG TTTTGGCAGA GGCATATTAA CGACTCTTCT GATCAATATC GTATATGGTT TGTCAGTTCC CAGAATTGAT AATTTCGGAC ATTTTGGAGG CCTGTTGGGT GGCTTTTTGG CTTCGGGGGT TGTCGGACTG CCTTCTTTCC GCAGGTCTTT GAAGAAAAAA ACGGTATTTA TGGTTGCAGC GGCTGTGATT TTGTCAACGT CGCTTTATTA CGGATTTACC AATACTCAGA ACCAATCCTT AAAGAAGTTG GAATTAATGA GCGGTTTTCT CAATGAACGA AACTGGGTTG AGAGCGAAAA GCTGGGTGAG GAAATAATTA AAATGCACCC AAAGAATGAA GCCATATTAT TCAATGCTTT GTGGAATTTA GCTATGTCTG AAGCCAAGCA GGGAAAATAT GACGAAGCGG TAGAACATGC ACAAATGCTG ACAGAGGTCG ACCCTGCAAA CGGGCATTTT CTTCTCGGAA TTATATATAA TGATGCAGGA AATGTTGAAA TGTCAAAAAA AGAGTTGGAG GAAGCCGTAA AAATTGATCC TCGTTTGAGG AATAAGGTTG AAAGTATTTT GAATACAATT AAATAG
|
Protein sequence | MIIKALSRLL IERGNYFPVI DENGNVVIDD NRAMLHRYGP EGYVVAEIVS GDLLTPEQIK TGLEMVRSRQ EQIKERFNTY YFKVFVFSSD LPEDKLEILA SENYNEGING NYLCCISVNV EKKEFAKYYE YPRISYGIHN QIQYFFSNDF DSDYGNVDFK ELAKKNEKGY KIEIKDKKPW VTYVLIAVNI LVWLLIEIYA RSKGVDSSSL LVDFGAKENT HIMMGEYWRF VTPMFLHNGI THLVVNSYSL YVLGTTVEMI MGKGRFLFIY LMAGLMGSIV SFIFSIAPSV GASGAIFGLL GALIYYGTEH RELFKKGFGR GILTTLLINI VYGLSVPRID NFGHFGGLLG GFLASGVVGL PSFRRSLKKK TVFMVAAAVI LSTSLYYGFT NTQNQSLKKL ELMSGFLNER NWVESEKLGE EIIKMHPKNE AILFNALWNL AMSEAKQGKY DEAVEHAQML TEVDPANGHF LLGIIYNDAG NVEMSKKELE EAVKIDPRLR NKVESILNTI K
|
| |