Gene Cthe_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1048 
Symbol 
ID4811346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1251329 
End bp1252864 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content39% 
IMG OID640106470 
Productrhomboid family protein 
Protein accessionYP_001037473 
Protein GI125973563 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTATAA AGGCACTGAG CAGATTACTG ATTGAGAGGG GAAATTATTT TCCTGTCATC 
GACGAAAACG GAAACGTTGT GATAGATGAC AACCGGGCAA TGCTTCACAG GTACGGTCCT
GAAGGTTACG TTGTTGCAGA AATTGTAAGC GGTGATTTGT TGACACCGGA ACAAATAAAA
ACAGGACTTG AAATGGTAAG GTCAAGGCAG GAGCAGATAA AAGAAAGATT CAATACCTAC
TACTTCAAAG TTTTTGTCTT TTCATCGGAT TTGCCTGAGG ATAAACTGGA GATTTTAGCG
TCGGAAAATT ATAACGAAGG AATTAACGGA AATTACCTGT GCTGCATAAG TGTAAATGTT
GAAAAAAAAG AGTTTGCCAA GTATTATGAA TATCCCAGAA TATCCTACGG TATTCATAAT
CAAATTCAAT ATTTTTTCAG CAATGATTTT GACAGTGACT ACGGCAATGT TGATTTTAAA
GAATTGGCGA AGAAAAATGA AAAGGGATAC AAAATTGAGA TAAAAGACAA AAAACCCTGG
GTAACCTATG TTCTCATAGC TGTCAACATT CTGGTCTGGC TTTTGATAGA AATCTATGCG
CGGTCCAAGG GTGTTGATTC GTCAAGTCTG TTGGTGGATT TTGGAGCAAA GGAAAATACT
CATATAATGA TGGGAGAATA TTGGAGGTTT GTGACTCCCA TGTTTTTGCA CAACGGAATT
ACTCATCTTG TTGTAAATTC CTATTCCCTT TATGTCCTCG GAACTACTGT TGAAATGATA
ATGGGAAAAG GCAGGTTTTT GTTTATATAT CTCATGGCCG GCCTTATGGG CAGTATAGTA
AGCTTCATTT TTTCAATAGC GCCGTCAGTA GGGGCGTCAG GAGCAATATT TGGCTTGCTG
GGTGCTCTTA TTTATTATGG GACGGAGCAC CGGGAACTGT TTAAGAAAGG TTTTGGCAGA
GGCATATTAA CGACTCTTCT GATCAATATC GTATATGGTT TGTCAGTTCC CAGAATTGAT
AATTTCGGAC ATTTTGGAGG CCTGTTGGGT GGCTTTTTGG CTTCGGGGGT TGTCGGACTG
CCTTCTTTCC GCAGGTCTTT GAAGAAAAAA ACGGTATTTA TGGTTGCAGC GGCTGTGATT
TTGTCAACGT CGCTTTATTA CGGATTTACC AATACTCAGA ACCAATCCTT AAAGAAGTTG
GAATTAATGA GCGGTTTTCT CAATGAACGA AACTGGGTTG AGAGCGAAAA GCTGGGTGAG
GAAATAATTA AAATGCACCC AAAGAATGAA GCCATATTAT TCAATGCTTT GTGGAATTTA
GCTATGTCTG AAGCCAAGCA GGGAAAATAT GACGAAGCGG TAGAACATGC ACAAATGCTG
ACAGAGGTCG ACCCTGCAAA CGGGCATTTT CTTCTCGGAA TTATATATAA TGATGCAGGA
AATGTTGAAA TGTCAAAAAA AGAGTTGGAG GAAGCCGTAA AAATTGATCC TCGTTTGAGG
AATAAGGTTG AAAGTATTTT GAATACAATT AAATAG
 
Protein sequence
MIIKALSRLL IERGNYFPVI DENGNVVIDD NRAMLHRYGP EGYVVAEIVS GDLLTPEQIK 
TGLEMVRSRQ EQIKERFNTY YFKVFVFSSD LPEDKLEILA SENYNEGING NYLCCISVNV
EKKEFAKYYE YPRISYGIHN QIQYFFSNDF DSDYGNVDFK ELAKKNEKGY KIEIKDKKPW
VTYVLIAVNI LVWLLIEIYA RSKGVDSSSL LVDFGAKENT HIMMGEYWRF VTPMFLHNGI
THLVVNSYSL YVLGTTVEMI MGKGRFLFIY LMAGLMGSIV SFIFSIAPSV GASGAIFGLL
GALIYYGTEH RELFKKGFGR GILTTLLINI VYGLSVPRID NFGHFGGLLG GFLASGVVGL
PSFRRSLKKK TVFMVAAAVI LSTSLYYGFT NTQNQSLKKL ELMSGFLNER NWVESEKLGE
EIIKMHPKNE AILFNALWNL AMSEAKQGKY DEAVEHAQML TEVDPANGHF LLGIIYNDAG
NVEMSKKELE EAVKIDPRLR NKVESILNTI K