Gene Tmz1t_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1035 
Symbol 
ID7084019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1132704 
End bp1134044 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID643698053 
Productsodium:neurotransmitter symporter 
Protein accessionYP_002354693 
Protein GI217969459 
COG category[R] General function prediction only 
COG ID[COG0733] Na+-dependent transporters of the SNF family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAGC ACGCACACAG CCAATGGTCG TCGCGGATGG GCTTCGTGCT CGCCGCCACC 
GGCTCCGCCG TCGGCCTCGG CAACATCTGG AAGTTTCCCT ACATGGTCGG CCAGAGCGGC
GGCGCCGCCT TCGTGCTGGT CTACCTGGCC TGCATCGCCT TCATCGGCGT GCCCATTCTG
GTCGCGGAGT GGATGATCGG CCGGCGCGGG CAGAAGAACC CGATCAACAC CATGGCCCAG
GTCGCGCGCG ACAACGGCCA CAGCACCAAC TGGGCCGTGG TGGGCGCGAT CGGCGTGCTC
GCCGCCTTCC TGATCCTGTC CTTCTACTCG GTGATCGGCG GCTGGGCGCT GGCCTACATG
CGCGACGCCG CCACCGGCGC CTTCATCGGC CTGGACAAGG CGGCGATCGG CGGCGCCTTC
GAGGGCTTCC TCGCCCGCCC GGCCGAGCTG CTGACCTGGC ACTCGATCTT CATGCTGCTC
ACCGTCGTCG TCGTGGCGCT CGGCGTGTCC GCCGGCCTGG AGCGCGGCAC CAAGCTGATG
ATGCCGGCGC TCGGGGTGAT CCTGCTGGTG CTGGTCGGCT ACGCGATGAC CACCGGCAGC
TTCGGCCAGG GCCTCGCCTA CCTCTTCAAC CCGGACTGGA GCAAGCTCGA CGGCAAGGTG
CTGCTCGCCG CGCTCGGCCA CGCCTTCTTC ACCCTGTCGC TGGGCATGGG CATCATGATG
GCCTACGGCT CCTACCTCGG GCAGGAGGTG AACCTGCTGC GCGCCGCGCG CACCGTGGTG
ATCATGGACA CGGTGTTCGC GCTGTGCGCC GGCATGGCGA TCTTCCCGAT CGTGTTCGCC
AACGGCCTGG ACCCCGCGGC CGGCCCCGGC CTGGTGTTCG TGACCCTGCC GCTGGCCTTC
GGCCACATGG GCGGCGGCCT GGTCATCGGC GCACTGTTCT TCCTGCTGCT GACCTTCGCC
GCGCTGACCT CGTCGATCTC GCTGCTCGAG CCGGTGGTGG AGCTGATCGA GGAGCGCACC
CCGCTCGGCC GCGTCGCCGC CACGCTGATC GCCGGCATCA CCATCTGGGC GCTGGGCATC
GCCGCGCTGC TGTCCTTCAA CGTGTGGAGC GACGTCAAGC TGCTCGGCAT GAACATCTTC
GACCTGCTCG ACTATGCGAC CAGCAAGTTC ATGCTGCCGC TCGCCGGCCT GGGTGCAATC
GTGTTCGCGG CGTGGAAGCT GGACCAGCAG GGCGTGAAGG CGGAACTGGG CCTTGGCGAT
GCCACATTCG GCTTGTGGAC CCTGCTGTCG CGCTACGTCG CGCCGGTGGG CGTGCTGTTC
GTGTTCTGGA GCAACCTGTA G
 
Protein sequence
MAKHAHSQWS SRMGFVLAAT GSAVGLGNIW KFPYMVGQSG GAAFVLVYLA CIAFIGVPIL 
VAEWMIGRRG QKNPINTMAQ VARDNGHSTN WAVVGAIGVL AAFLILSFYS VIGGWALAYM
RDAATGAFIG LDKAAIGGAF EGFLARPAEL LTWHSIFMLL TVVVVALGVS AGLERGTKLM
MPALGVILLV LVGYAMTTGS FGQGLAYLFN PDWSKLDGKV LLAALGHAFF TLSLGMGIMM
AYGSYLGQEV NLLRAARTVV IMDTVFALCA GMAIFPIVFA NGLDPAAGPG LVFVTLPLAF
GHMGGGLVIG ALFFLLLTFA ALTSSISLLE PVVELIEERT PLGRVAATLI AGITIWALGI
AALLSFNVWS DVKLLGMNIF DLLDYATSKF MLPLAGLGAI VFAAWKLDQQ GVKAELGLGD
ATFGLWTLLS RYVAPVGVLF VFWSNL