Gene Tmz1t_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1650 
Symbol 
ID7084069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1850733 
End bp1851995 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content69% 
IMG OID643698670 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002355301 
Protein GI217970067 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0169608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA TGCGTTTCAA GGCCTTGGCC GCAGCGGTCG CTGCCGGCGG TCTCTTCTTC 
GCGCCCGGCT CGGTGGTCCA GGCCAGCGCC CAGCAGCAGG TCCAGCTCTT CCATCGCCTG
CCCGATGCCA AGGCGGGGGC GCTGAAGGAC TTGGTCGAGC GCTTCAACGC GCAGTCCAAG
GACGTGCAGG TCGTGATGTC CGCGGCGGAC TGGCGCTCGG GCGCGCCCCA CCTGATGATC
CTCGAGGGCG ACGACGAGGA GGAGTTCGTC GCCGGCAAGC CGCGCTTCAA GCCGCTCTTC
CAGCTCATGA AGGAAAGTGG CGTGCCGCTG CAGACCCTGC GTCCGCCCGC GATGATGACG
CGCACCCCGG TCGATGCCAA GGGACAGCTG CTGGCGCTGC CGGTCGGCCT GTCGACCCCG
GTGCTGTTCC TCAACCGCGA CGCGCTGCGC CAGGCCGGCC TCAACCCGGA GACCACCCGG
ATCAACACCT GGTTCGATCT GCAGGAAACC CTCGGGCGCC TCGCCGATAC CGGTCACACC
TGCCCCTATA CCGTGGCCGA GCCCGGCCGC GTGATGGTGG AGAACCTCTC GGCCTGGCAT
AACGAGCCGG TGGCCGCGCA GAGCGGCAAG ACCACCGTAC CGAGCTTCAA CGGCATGTTT
CAGGTCAAGC ATGTGGCGAT GATGGCGAGC TGGACGCGTG CGCGCTACCT GCACGTGTTC
GACCAGCAGG CCGAGGCCGA GCAGCGCTTC GCGCGCGGCG AGTGCGCGGT GATCGCCGCG
CCCTCGGCGA GCTGGACCGA CTTCCGCCGT GCCGGCAAGG TCGATGTGGC GGTATCCAAG
CTGCCCTACT ACGACGACTT CCCCGGTGCG CCGCAGAACA CCATCGCCGA CGGTCCCGCA
CTGTGGGCCT CGGCAGGCAA GAAACCGGCC GAATACAAGG CCGTGGCGCG CTTCGTGAGT
TTCTGGCTGC AGCCCGACAA CCAGGTCGCG TGGCAGCGCG AGACCGGCTA CCTGCCGCTC
AACCGCGCCG GCCTGCTGGC CTCGCGCAGC GAGCTGCTCG GCAACGACCT CGAGAACATC
CAGGTCGCGG TCGACCAGCT CGGCGGCAAG CCCGCCACGC CGCAGTCGTC GGCGCAGCCC
GTGGTCGAGC GCCAGAAGGT GCGCCGCATC CTCGATGAGG AACTCGCCGG CGTATGGGCC
GACGAGAAGG CCGCGAAGGA AGCGCTCGAC AACGCGGTGA CGCGCGCCCG CAACGGCAAC
TGA
 
Protein sequence
MNKMRFKALA AAVAAGGLFF APGSVVQASA QQQVQLFHRL PDAKAGALKD LVERFNAQSK 
DVQVVMSAAD WRSGAPHLMI LEGDDEEEFV AGKPRFKPLF QLMKESGVPL QTLRPPAMMT
RTPVDAKGQL LALPVGLSTP VLFLNRDALR QAGLNPETTR INTWFDLQET LGRLADTGHT
CPYTVAEPGR VMVENLSAWH NEPVAAQSGK TTVPSFNGMF QVKHVAMMAS WTRARYLHVF
DQQAEAEQRF ARGECAVIAA PSASWTDFRR AGKVDVAVSK LPYYDDFPGA PQNTIADGPA
LWASAGKKPA EYKAVARFVS FWLQPDNQVA WQRETGYLPL NRAGLLASRS ELLGNDLENI
QVAVDQLGGK PATPQSSAQP VVERQKVRRI LDEELAGVWA DEKAAKEALD NAVTRARNGN