Gene Cthe_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1939 
Symbol 
ID4810722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2314949 
End bp2316283 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content39% 
IMG OID640107355 
Productmagnesium transporter 
Protein accessionYP_001038350 
Protein GI125974440 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2239] Mg/Co/Ni transporter MgtE (contains CBS domain) 
TIGRFAM ID[TIGR00400] Mg2+ transporter (mgtE) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0382273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA TGATTCTGGA GCTCATTGAA CAAGGCAAAT ATGCCGAAGT GAGAAACAAA 
ATTACTGAAA TGAATGAAGT TGACATTGCC CAACTTTTGG AAGAGACGGA CAAGCATAAG
CTGCTGGTGA TATTCAGGAT ATTGCCAAAG GATGTTGCTG CCGGGGTCTT TTCGTATATA
TCATATGAAT TACAGAGATA TATTGTTGAA TCCATTACCG ACAGCGAAAT AAAGAACATT
TTGGATGAAT TGTTCCTGGA CGATACAATT GACTTTTTGG AGGAAATGCC TTCAAACATT
GTAAAAAGGG TTCTTAAGAA CGCGGATGAA ACAACGAGAA AGCTTATAAA CCAGTTTTTG
AACTATCCTG AAAACTCGGC CGGGAGCATA ATGACCATAG AGTATGTCGA CTTGAAAAAG
GAAATGACGG TAAAACAGGC TTTGCAGCAT ATCAAGGAGA CAGGGATAGA CAAAGAGACG
ATTGATACTT GCTATATTTT GGATGACTCC AGAAAACTTG AAGGTGTAAT ATCAATCAGA
AAGCTGATAT TAAGTGACGA GTCCGTGGTA ATTAAAGACA TCATGGATGC AGATGTAATA
TATGTAAACA CACATGACAA GCAGGAAGAA ATTGCAGCAT TGTTTAAAAA ATATGATTTT
CTTTCCATGC CTGTGGTTGA TAATGAACGA AGACTGGTCG GTATAGTGAC AATAGATGAT
ATTGTGGATG TTATTGAGCA GGAAAATACT GAAGATTTCC AGAAAATGGC GGCCATTCAG
CCTTCCGAAA AAGAGTATTT GAAGACAAAC GCGTTGGTAT TGGCCAAGCA CAGAATCACA
TGGCTTTTGG TATTGATGCT TTCTGCAACT TTTACGGGCA ATATTATAAA AAAATTTGAT
GAAGTATTGC AATCAATTGT TATACTGGCT TCTTTCATCC CGATGCTCAT GAATACCGGT
GGAAATGCCG GTTCCCAGTC ATCGGCACTT ATAATCAGGG GCCTGTCCTT GGGAGAAATA
AGAGCGAGGG ATTTTTTAAA GGTTTTATGG AAAGAAATTC AGGTAAGCTG CATTGTAGGA
GTAGTTTTAG CTGCTGTGAA TTTTGTAAGA ATATATTATT TTGAAAAAGC AGGTTTTCTA
GTGTCCGCAA CCGTATGTCT AACCTTGTTT TTTACGATTA TGTTGGCGAA AGTCATCGGA
GGGCTGCTTC CCATCATGGC AAAGAAACTT AAACTTGACC CTGCGATTAT GGCAGGTCCG
CTGATAACAA CCGTGGTTGA TGCGGTAACT CTTACCATAT ATTTTACCAT AGCAACGTGG
TTGTTGGACA TATAA
 
Protein sequence
MKEMILELIE QGKYAEVRNK ITEMNEVDIA QLLEETDKHK LLVIFRILPK DVAAGVFSYI 
SYELQRYIVE SITDSEIKNI LDELFLDDTI DFLEEMPSNI VKRVLKNADE TTRKLINQFL
NYPENSAGSI MTIEYVDLKK EMTVKQALQH IKETGIDKET IDTCYILDDS RKLEGVISIR
KLILSDESVV IKDIMDADVI YVNTHDKQEE IAALFKKYDF LSMPVVDNER RLVGIVTIDD
IVDVIEQENT EDFQKMAAIQ PSEKEYLKTN ALVLAKHRIT WLLVLMLSAT FTGNIIKKFD
EVLQSIVILA SFIPMLMNTG GNAGSQSSAL IIRGLSLGEI RARDFLKVLW KEIQVSCIVG
VVLAAVNFVR IYYFEKAGFL VSATVCLTLF FTIMLAKVIG GLLPIMAKKL KLDPAIMAGP
LITTVVDAVT LTIYFTIATW LLDI