Gene Cthe_2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2789 
Symbol 
ID4810106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3289547 
End bp3290629 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content41% 
IMG OID640108209 
ProductNLPA lipoprotein 
Protein accessionYP_001039181 
Protein GI125975271 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG GATTGAGAAT GAAAAAGAAA ATTCTTACGG GAATTATATC ATTTCTGATT 
ACAAGCATAT TGCTAAGCGG ATGCGGAGGT CAAAAAGTTC AAAAGACCAA TTTGGAAGAC
TCTGGTACCG GGCGGGAATT GAAGAAATTT AAAGTGGGTT ATCTGGCGTC GCCGGGGCAC
GTGCTTTACT TTGTAGCGAA AGAAAAAGGA TTTTTTGAAG AAGAAGGGCT GGACGTGGAA
CTGTTTTTGT TTACAAACTC CGGGGAAGGC TTAAATGCAA TCAGTTCCGG CAAAGTGGAC
GTTGGTTCCT TTGGTACGGC GGCACCACTT ACTTTCATTT CGAAAGGGAC TGATTTTGTC
ATCTTTGGCG GGCAGCAAAC CGAGGGACAC GGAATTGTGG CTCTTCCTCA GAAGGCAAAG
GAGCTTACGG ATTTAAACAG CTTTAAAGGC AAAACGATTG CCACTGTAAG GCTTGCCACG
GGAGACGTTA TTTTCAGGGC TGCATTGTCG GAAGCAGGCA TTGACTGGAA AAATGAACTG
ACAATTAATG AGTTGGATTC ACCGGCAGCG GTACTTGAGG CTGTGAAGAA AGGAAGTGTT
GATGCAGGTA TTGTGTGGAC TCCTTTTATT AAACTTGGAG AAAAGCAGGG ACTTGAAATA
GTAAAATACT CCGGAGAACT GGTTAAAATG CATACTTGCT GCCGTCAGGT TGCCCTCTCA
TCTAATGTAA AGGAAAACAA AGAAGACTTT GTAAGGTTTA TGGCAGCTTT GATAAAGGCA
TACAAATTCT ATATGGAAAA TCAGGATGAA ACAGTGGATA TTATTGCGAA ATATGCAAAA
GTTGACAAAG ACATTATAAA GGAAGAAACA TACGGAGGCC ATATATACAG CATACCTGAC
CCTGACAAAG AGGGAGTTAT AAGGTTCTGG GACCTTATAA ACAAGTCCGG ATATATCAGT
TCTGATCTCA ATATTGAAAA TTTTATAGAC ACATCAATCT ATAAGCTTGC TCTTGAAGAT
GTTCTTAAGG AGTATCCAAA TGATGAGGTT TATAAGAAGC TAAAAGCAGA CTTTAAGGAG
TAA
 
Protein sequence
MKKGLRMKKK ILTGIISFLI TSILLSGCGG QKVQKTNLED SGTGRELKKF KVGYLASPGH 
VLYFVAKEKG FFEEEGLDVE LFLFTNSGEG LNAISSGKVD VGSFGTAAPL TFISKGTDFV
IFGGQQTEGH GIVALPQKAK ELTDLNSFKG KTIATVRLAT GDVIFRAALS EAGIDWKNEL
TINELDSPAA VLEAVKKGSV DAGIVWTPFI KLGEKQGLEI VKYSGELVKM HTCCRQVALS
SNVKENKEDF VRFMAALIKA YKFYMENQDE TVDIIAKYAK VDKDIIKEET YGGHIYSIPD
PDKEGVIRFW DLINKSGYIS SDLNIENFID TSIYKLALED VLKEYPNDEV YKKLKADFKE