Gene Cthe_2150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2150 
Symbol 
ID4811198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2555782 
End bp2557128 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content39% 
IMG OID640107554 
Productintegral membrane protein-like protein 
Protein accessionYP_001038546 
Protein GI125974636 
COG category[S] Function unknown 
COG ID[COG5542] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000529517 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATG TTAATACCAT GAGTACAAAT AAAAAAAGTT GGTTTTTCGA GGATGGAAAA 
CCCGTATCTC GAAATATGTT TATAATTACC GGGACGGTCG TAATATTAAT AAGATTGCTT
CTGACCACGA TTCCAAGTTA TCAGGTGGAC ATGGGAGGAT ACAGGGCCTG GAGCCTGTAT
CTTGCCGAAA ATGGTCCGGT AGGTTTTTAT GAGCGTTACC ATGTTGTGTA TGCACCGGCA
TATATGTATT TGCTGTGGAT TACGGGAATA ATAGCAAAGG CTTTTTCAGT CAATGCGTCA
ACCCATGCAT TTTTGATAAA GCTGTGGGCT GTTGCTTCAG AACTGGTAGG TGCTTATCTT
ATTTATAAAA TTGGCAAAAA GTACAAAAAA GAAAGGCTTG GATTTATTCT GGGAGTGGTT
TATGCACTTA ATCCGGGAGT TTTCTTCAAT TCATCGATTT GGGGACAATT TGATTCGATA
CCGGCAACGT TGCTTGTAGG TATGATATAT GCTTTTAGTG TAAACCGGAA AATGACTGCG
GTGGTATTGT ATGCCATTGC TGTTCTGACC AAGCCTCAAA GTGCGCTTCT TACACCGCTG
GGCATACTTT TTTACAAAGA ACTGTTTGAC TTTTCCAACA TCACAAAAGA AAAGATTGTT
AAAAGTATCA AGGAAACATT GGTGGCTATT TGTGTAGGAT TGTCATGTTA TTTTATCGTT
ATTTATCCTT TCTATTATCA TACCGATCTT TATGAACGAA TGAAGAGTAC TTCCGTTGTT
AAAGATTTTA TAGCTGAGAG CATTGACTAC TTTTGGTGGA TGCCCAACTT GTATCTGACG
AGTGTTGAAG ATTATCCGTA TGCCACTGCC AATGCCTTTA ACTTGTGGAC ACTTTTGGGA
GGACAACCTG TAAAGGATTC AAATATATTC TTCATATTGT CCTATAAAAC GTGGGGAACT
ATACTGTTTT TAATTTGCAT AGGCATAGCC TTTGCATATC TGCTGAAAAA AAGGAAAAGC
GATTTTGCAA TGTACTTTGC ATCTTTCTTC ATCCTTTCAA GCGCTTTTAC CTTTATAACA
AGAATGCATG AAAGATATTT GCTTCCCGCC ATAATATTCC TTACAATTTG CGTCCTGTGG
GAAAAGTGGA TGGCAATACC TTTGACGGTT TTGAGTGTAT GTGTTACTGC CAACCACTGG
TACATATATG ATTTGTCGTG GAAGGATGTT TTTTGGCTGA GAAATTACGA TCCTGTGGCC
ATGCCCTTTG CTTTCCTGAC TGTGCTGGTG GTTTTATTTG GTGCGGGGTT TATTATAAAA
CAGATTTTGC CGGCCAAAAA AAATTGA
 
Protein sequence
MDNVNTMSTN KKSWFFEDGK PVSRNMFIIT GTVVILIRLL LTTIPSYQVD MGGYRAWSLY 
LAENGPVGFY ERYHVVYAPA YMYLLWITGI IAKAFSVNAS THAFLIKLWA VASELVGAYL
IYKIGKKYKK ERLGFILGVV YALNPGVFFN SSIWGQFDSI PATLLVGMIY AFSVNRKMTA
VVLYAIAVLT KPQSALLTPL GILFYKELFD FSNITKEKIV KSIKETLVAI CVGLSCYFIV
IYPFYYHTDL YERMKSTSVV KDFIAESIDY FWWMPNLYLT SVEDYPYATA NAFNLWTLLG
GQPVKDSNIF FILSYKTWGT ILFLICIGIA FAYLLKKRKS DFAMYFASFF ILSSAFTFIT
RMHERYLLPA IIFLTICVLW EKWMAIPLTV LSVCVTANHW YIYDLSWKDV FWLRNYDPVA
MPFAFLTVLV VLFGAGFIIK QILPAKKN