Gene Moth_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1683 
Symbol 
ID3833283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1721972 
End bp1722898 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content55% 
IMG OID637829608 
Productcation diffusion facilitator family transporter 
Protein accessionYP_430528 
Protein GI83590519 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0053] Predicted Co/Zn/Cd cation transporters 
TIGRFAM ID[TIGR01297] cation diffusion facilitator family transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000676071 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.070124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTGC GGACCAGAGC TGCCAGGGTA TCTATCTTTT CCAACATAAT CCTGGTGCTG 
GGTAAACTGG GGATCGGTTA CTGGATGCAC TCGGTCAGTG TCATGTCGGA AGCCATCCAC
TCCGGCCTGG ACCTGGTGGC GGCGGCGATA GCCTATTTTT CCGTCCGGGA AGCCAGCAAG
CCGGCTGATG CCGAGCACCG CTACGGCCAT GGTAAAATTG AAAATATTTC GGGTACCATT
GAAGCCCTGC TGATTTTCCT GGCAGCCCTC TGGATTATCT ATGAAGCAAT CAAAAGGTTT
ATCAGCGGCA GCCATGCCAT TAGCGAACCC CTGACCGGCG TGGCTGTTAT GGGCGGGGCC
GGCGTAGTCA ACTACCTGGT TTCCCGTTAT CTCTTCCGGG TTGCGAAAGA TACGGACTCC
ATCGCCCTGG AGGCCGACGC CTGGCACCTG CGTACCGATG TTTATACTTC CGCCGGGGTA
ATGCTGGGCC TGGCAGCCCT TTATTTTACC GGTTTCCAAT GGCTGGATCC CCTGGTGGCC
CTGGTGGTAG CCGCCATGAT CATCAAGGCG GCCTACCATT TAACCCGGGA GGCCATGCTG
CCCCTGATGG ATGTCAGCCT GCCGGCTGAA GAAGAAGAGG TAATTAAAGA AATTATCGCC
CGCCATGCCC ATGAGTATGT TGAATTCCAT AAATTACGCA CCCGCAAGGC CGGCCGGGAC
CGCCAGGTAG ACCTGCACCT GGTGGTACCG CGTTACAAGC ATATCGATTA TGTCCATAAC
CTCTGTGAGC ATATTGGCGA TGAGATAAGA GCAGCTCTAC CTTACACCGA TGTTTTAATC
CATGCCGAAC CCTGCTCTTC AGCGGTGGAT TGCCAGGTGT GTACCACCTG CCCGGAGAAG
GAAAATCGTT CCTCGAAGGC GAATTGA
 
Protein sequence
MDVRTRAARV SIFSNIILVL GKLGIGYWMH SVSVMSEAIH SGLDLVAAAI AYFSVREASK 
PADAEHRYGH GKIENISGTI EALLIFLAAL WIIYEAIKRF ISGSHAISEP LTGVAVMGGA
GVVNYLVSRY LFRVAKDTDS IALEADAWHL RTDVYTSAGV MLGLAALYFT GFQWLDPLVA
LVVAAMIIKA AYHLTREAML PLMDVSLPAE EEEVIKEIIA RHAHEYVEFH KLRTRKAGRD
RQVDLHLVVP RYKHIDYVHN LCEHIGDEIR AALPYTDVLI HAEPCSSAVD CQVCTTCPEK
ENRSSKAN