Gene Cthe_1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1410 
Symbol 
ID4809071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1728803 
End bp1729804 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content39% 
IMG OID640106833 
Productcation diffusion facilitator family transporter 
Protein accessionYP_001037834 
Protein GI125973924 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0053] Predicted Co/Zn/Cd cation transporters 
TIGRFAM ID[TIGR01297] cation diffusion facilitator family transporter 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGAAACA ATTTTGATGA ATACGGTAGT GTTTGGAATG GAGGCCTTGC GTTGATTAAA 
CTTTTAATCA GGTGGTTTAT CAAGGATTAT CAAAATGTGG ATAACAAAAA GGTAAGGGAA
GCTTACGGAG TATTGTCGGG AGTAACAGGC ATTATTTGCA ACGTATTTTT GTTTATTGTA
AAAATAACTG TGGGACTGGT CATGAACAGT ATTGCAGTAA TTTCTGACGC TTTCAACAAT
TTAAGCGATT TAGGTTCGTC ATTGGTTGGA ATACTCGGTG TCAAGCTTAG CAACAGGCCT
CCGGACGAGG AACATCCTCA TGGCCATGGA AGGTATGAGT ATATATCATC TCTTGTGGTG
TCGTTTATTA TATTTGGTGT TGGCCTTGAA CTTTTGAGAA ATTCTTTTTG GAAAATAATC
AAACCTGAAG AAGTGACATT GAGTACAATA TCGATATTAT TGCTCGTTAT CTCAGTGGCT
GTAAAATTGT GGATGTTTTC ATACAATAGA TATATAGGAA AAATAATCAA TTCGGGAATT
AACAAAGCGA CAGCCCAGGA TAGTCTGAAC GATGCCATTG CCACAACCGC AGTGCTTGCA
GGAACTCTAA TTGGAAGGTT TGTTTCTTTT CCCCTGGATG GAATTATGGG TTTAATCATA
TCCGCACTGA TTATGTATAC AGGATTTGGT ATTGCGAAAG ATTCGGTGGA CCTGCTTCTG
GGCCTGTGTC CTAACTCTGA GCTCATCGAG AGCATAAATT CATATTTTTT GGTCGGAGAA
AAAATAAAGG GCACTCATGA CTTGAAAGTT CATGATTACG GTCCCGGCAG AATAAGCGCG
TCTATTCATG CCGAAGTGCC TGAAGGGGCA GACATAGTTG AAATACATTC AATAATTGAT
GAAATCGAGC AAAGAATAAA AAATGAGCTC GGAATTGACA TAGTCGTTCA TATGGATCCT
GTTGAAGAGA AAAAAGAGGA TTGTTGTAAC GACGATAAAT AA
 
Protein sequence
MGNNFDEYGS VWNGGLALIK LLIRWFIKDY QNVDNKKVRE AYGVLSGVTG IICNVFLFIV 
KITVGLVMNS IAVISDAFNN LSDLGSSLVG ILGVKLSNRP PDEEHPHGHG RYEYISSLVV
SFIIFGVGLE LLRNSFWKII KPEEVTLSTI SILLLVISVA VKLWMFSYNR YIGKIINSGI
NKATAQDSLN DAIATTAVLA GTLIGRFVSF PLDGIMGLII SALIMYTGFG IAKDSVDLLL
GLCPNSELIE SINSYFLVGE KIKGTHDLKV HDYGPGRISA SIHAEVPEGA DIVEIHSIID
EIEQRIKNEL GIDIVVHMDP VEEKKEDCCN DDK