Gene Tmz1t_1330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1330 
Symbol 
ID7084451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1469795 
End bp1470844 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content70% 
IMG OID643698347 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002354985 
Protein GI217969751 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCGTC GCTTGAGCCT GCATCGCCTC CCCTGGCTTG TGCTCGCGCT CGCACTCTCC 
GGCACGGCCT GCGCGACCGA GGTGTTGCGC GTGCTGAGCT GGCCGGGCTA TGCCGACGCC
GACGTGGTGC AGGCCTTCGA GGCCCGCACC GGCGCCCGGG TCGAAGTCAC CCAGGTCGAT
TCCGACGAAA CGCTGTGGCA GAAGCTCAGC ACGAACGACG CGACCGACTA CGACGTGTTC
GCGGTCAATA CCGCCGAGCT GCAGCGCTAC ATCGACCGCG GCGTGGCCGT GGCGATCGAT
CCGGCCGCGC TGCCCAACCT CGGTGCGCAA TTGCCGCGCT TCCGCAACCC GGCAACGCTG
CCCGGCACGA CTCGCGATGG CAAGCTGTAC GCCATCCCCT ATGCCTGGGC CGAAATGGGC
CTGATCTACG ATCGGCGCCA GTTCGACGCG CCACCCCAGT CCATCGCCGC GCTGTGGGAC
GCCCGCTACC GCGGCAAGGT GCTGGTGTAC AACAGCGGCT CGCACAACTT CTCGCTCGCC
GCACAGATGC TGGGCAAGGC ATCGCCCTTC CGCCTCGACG CCGCCGACTG GGCGCCGGCG
GTCGAGCGTC TGGTCGAGTT GCGCCGCAAC CTGCTGACCT TCTACGCCCA GCCGGAAGAA
TCCGCGCATC TGTTCGTCAG CCGCGGCGCG GCGCTGATGT ACGCCAATTA CGGCACCCAG
CAGCTGCAGC TCCTGCGCGC AGCGGGGGCG GACGTGGGCT ATGCGATCCC GCGCGAAGGC
GCGCTCGCCT GGCTCGACTG CTGGGTGGTG ACGCGCGGCG CACGTAACCA GGCGCTCGCG
CTGGCGTGGA TCGACCACCT GCTCGGCACC GGCCCGGCGC ACGTGCTGAG CGCGCGCCAC
GGCCTCGACA ACACCCGCGA CCCGGCGCCG CACCAGGCCG AAACCGACCG CCTGGTCTGG
CTCGAACCGG TCGAGGACGT CGAACGCCGC AACCTGCTGT GGGAGCGCAT CCTCTCCGGC
GACCGCGGCG CACGGGTGCT CGCGCCATGA
 
Protein sequence
MLRRLSLHRL PWLVLALALS GTACATEVLR VLSWPGYADA DVVQAFEART GARVEVTQVD 
SDETLWQKLS TNDATDYDVF AVNTAELQRY IDRGVAVAID PAALPNLGAQ LPRFRNPATL
PGTTRDGKLY AIPYAWAEMG LIYDRRQFDA PPQSIAALWD ARYRGKVLVY NSGSHNFSLA
AQMLGKASPF RLDAADWAPA VERLVELRRN LLTFYAQPEE SAHLFVSRGA ALMYANYGTQ
QLQLLRAAGA DVGYAIPREG ALAWLDCWVV TRGARNQALA LAWIDHLLGT GPAHVLSARH
GLDNTRDPAP HQAETDRLVW LEPVEDVERR NLLWERILSG DRGARVLAP