Gene Cthe_2771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2771 
Symbol 
ID4810088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3273730 
End bp3274866 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content40% 
IMG OID640108191 
Productbeta-lactamase-like protein 
Protein accessionYP_001039163 
Protein GI125975253 
COG category[C] Energy production and conversion 
COG ID[COG0426] Uncharacterized flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCTGTCT TAATTTTTGG TATCATTATA GAAAAGCCGA CCGTAATTGA TACTGTGGAT 
ATCGAATTCG GAAAAGAATA TGTTGACAAC TTAAGTAAAA TTATTGATCC GGAAAATATC
GAATATATCG TAATCAACCA CGTAGAGCCC GACCATGCTG GGGCACTGCC TGCTTTGGCG
GCTAAAGCAA AAAATGCAAA AATAGTGACT ACCAAATTAG CCTCAGAACT GCTTAAGGAC
ATGTTTAAGC TTCATAATAG AGAATTTGCC ATAATCAAAG ACGGGGATAC ATTGGATATA
GGCGGCAAGA CATTAAGCTT TTTCGAAACA CCTTATCTGC ACACCGAAGA GACAATGATA
ACCTACGTGA ACGAGAATAA AATCCTTTTC CCCTGCGATA TCTTCAGCAC CCACATAGCA
AACTACGAAC TGTTTAATGA TTTGGCAAAA GGCGAATATA TCGAAGACTT CAAAGTGTAC
TACAGGCTTA TCATGGCACC CCACAGACCT TATGTAAGAG ACATGCTGGA AAAGATTAAA
AAGCTCAATA TAGAGGTAAT TGCTCCTTCT CACGGGTATA TCCTGCGTGA AAATACTGCA
AAGTTTATTC AAATGTATGA TGAAATGAGC AGTCTTGCGG CTTTGAAACA GCCGAAAAAA
GTTACAATTG TATATTCCAC CATGACGGGA AATACCGCAA AGATAGCGCA AAAACTCGTC
CAAGGGCTTG AAAGTGCAGG AGTGGAAACA TCAGTATTCA ACCTTAAAAA TTCTGATTTG
GCTGAAGTCA GGAAAAAAAT TACGGAAAGT GACGGCATTT TGGTGGGAAG CTCCACAAGG
TATGCCGATA TGGTTGGAAA TGTTGAGGAA CTCCTTAAAT TGTTAGAAGG AGAAGAAGTG
AAGAATAAAT TTGCAGCGGC TTTCGGTTCC TACGGTTGGA GCGGGGAAGC AATCATGCAC
ATTGAAAATT ATCTTGATAA AATCGGCTTT AACGTAATTA ACCAAAAATA TCTCATCGGC
AGTGCAGGTA TTGATATACC GTTATTCCCT CTGCGCATTA AGTTTGCCCG CCAGGAAGGC
CTGGAACTTG CTGAGGAGGC AGGCAGAGTT TTTGGAGAAC AAGTATTAAC CCATTGA
 
Protein sequence
MSVLIFGIII EKPTVIDTVD IEFGKEYVDN LSKIIDPENI EYIVINHVEP DHAGALPALA 
AKAKNAKIVT TKLASELLKD MFKLHNREFA IIKDGDTLDI GGKTLSFFET PYLHTEETMI
TYVNENKILF PCDIFSTHIA NYELFNDLAK GEYIEDFKVY YRLIMAPHRP YVRDMLEKIK
KLNIEVIAPS HGYILRENTA KFIQMYDEMS SLAALKQPKK VTIVYSTMTG NTAKIAQKLV
QGLESAGVET SVFNLKNSDL AEVRKKITES DGILVGSSTR YADMVGNVEE LLKLLEGEEV
KNKFAAAFGS YGWSGEAIMH IENYLDKIGF NVINQKYLIG SAGIDIPLFP LRIKFARQEG
LELAEEAGRV FGEQVLTH