Gene Cthe_2943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2943 
Symbol 
ID4810226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3457697 
End bp3458914 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content41% 
IMG OID640108366 
ProductABC-2 type transporter 
Protein accessionYP_001039334 
Protein GI125975424 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1668] ABC-type Na+ efflux pump, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGCTG GCAGGCACGT ATGGATTGTA TTTAAAAAAG AGGTAAAGGA TATTGTAAGG 
GACAAAAAGA CCCTTCTTAC CAGTATATTT GTTCCAATGC TGTTAATTCC CGTTTTAAGC
ATGCTTGTCG GAGGAAGTAT AGAAAAGCTC AACAGGGATA TAAGTGAAAA CGTTACCATA
GCATTGACCA AAGAATCCGA TACCGATGAG ATTAGTAATA TAGTGGAAAA TCAAATAATC
AGGGACTATC CCAACATAAA ACTGATTGAA GTGGATGACC CCATAAAAGC TATTAATGAA
TCAAAAGTGA GGTTGGTATT GGATTTTGAG AAGGATTATG CGTCCAAATT GAAAGAGGGC
AAACCTTTTG TGATAAAGCT TATATATGAC AAGTCCCAGA CCAAATCGGG AGGAAGTCTT
GGCATATTAT GGGATGCAAT TGAAAGTTTT AATGAGAGAA TCGTGAAAGA AAGACTTAAT
TCTTTGGGAA TAAGCCCGGA AGTATTGACA CCCGTTGTGA TTGAAGAAAC CAATATTGCC
GATGAGGAAA AAACCAGTGC AAGTATCCTT GCCATGTCAC TTCCCATGAT GCTTGTGATT
TTGATAGCAT CCGGGGGTGT TGCTGCGGCT ACGGACCTTG TGGCAGGGGA AAAAGAAAGA
AATACTTTTG AACCTTTGCT TACCACCAAA CCCAGCAGAT TTTCCCTTCT TTTCGGCAAA
TATCTGGCAG TGACTTTGTT TTCTTTTGTG TCGGTTGTTG CAACAATGAT AGGAGCAGCT
GCGGGATTTA TGATAGACCC GTCCACCATG GTTATGGGAG TCGGTACGGA TATTACAGGT
TTTAGCATAC CGCCGTTGGC AGTTTTTTTG GCCGTCATAA TTTCAATAAC CTTTGGAATG
ACTTTTTCAG GCCTAGAGAT AGCCCTCAGC ACCTATGCAA AATCCTTTAA AGAGGCCCAG
ACATACATGT CTTTTCTGTT GATAATAGTT ATGATTCCCG CTTTTTCCAC AATGCTTATG
CAGCCCAATG ACATTCCGGC ATATATGTTT CTTGTACCTG TTATGAATAC GCTGGCAGCT
TTCAAGATAG TGCTGGGTGG CAGTATCAAC TATTTTTATT TACTTATGGC TCTGGGGTCG
TCGTTGGTGT ATGTCGGCAT TACCCTGTGG CTTGCGGCCA CACTGTTTAA AAAAGAAAAA
GTGCTGTTTA GAAGCTAA
 
Protein sequence
MKAGRHVWIV FKKEVKDIVR DKKTLLTSIF VPMLLIPVLS MLVGGSIEKL NRDISENVTI 
ALTKESDTDE ISNIVENQII RDYPNIKLIE VDDPIKAINE SKVRLVLDFE KDYASKLKEG
KPFVIKLIYD KSQTKSGGSL GILWDAIESF NERIVKERLN SLGISPEVLT PVVIEETNIA
DEEKTSASIL AMSLPMMLVI LIASGGVAAA TDLVAGEKER NTFEPLLTTK PSRFSLLFGK
YLAVTLFSFV SVVATMIGAA AGFMIDPSTM VMGVGTDITG FSIPPLAVFL AVIISITFGM
TFSGLEIALS TYAKSFKEAQ TYMSFLLIIV MIPAFSTMLM QPNDIPAYMF LVPVMNTLAA
FKIVLGGSIN YFYLLMALGS SLVYVGITLW LAATLFKKEK VLFRS