Gene Tmz1t_0628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0628 
Symbol 
ID7084566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp706549 
End bp707556 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content67% 
IMG OID643697655 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_002354297 
Protein GI217969063 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTCG GCCCCCTTCG CAGCACCCTG CTCACCCTGG CGCTCGCCAC CGGCCTCGCC 
GGCGTCGCCC ACGCGCAGAC CACCCTGCTC AACGTCTCCT ACGACCCCAC ACGCGAGCTC
TACCAGGACT TCAACCCCGA GTTCGCCAAG CACTGGAAAG CCCGCACCGG CGAGACCGTC
ACCATCAAGC AGTCCCACGG CGGCGCCGGG AAGCAGGCGC GCGCGGTGAT CGACGGCCTC
GAAGCCGACG TTGTCACGCT GGCCCTGGCC TACGACATCG ACGCCATCGC CGAGCAGACG
GGCAAGATCC CCGCCGACTG GCAGAAACGC CTGGCCAACA ACAGCTCGCC CTACACATCG
ACCATCGTCT TCCTGGTACG CAAGGGCAAC CCCAAGGGCA TCAAGGACTG GGGCGACCTG
GTCAAGCCGG GCGTCGAGGT CGTCACCCCC AACCCCAAGA CCTCCGGCGG CGCGCGCTGG
AACTACCTCG CCGCCTGGGC CTACGCGCTC AAGCAGCCGG GCGGCAACGA ACAGACCGCG
CAGGTCTTCG TCACCGAACT GATCAAGCAC GTGCCGGTGC TGGATTCGGG CGCCCGCGGC
GCCACCAACA CCTTCGTCCA GCGCGGCATC GGCGACGTGC TGCTGGCGTG GGAGAACGAG
GCCTTCCTGT CGATCAACGA ACTCGGTCCG GACAAGTTCG AGATCGTCGT GCCCTCGATC
TCGATCCGCG CCGAACCGCC GGTCACCGTC GTCGATGGCG TGGCGAAAAA GCGCGGCACC
GAGAAGCTCG CCCAGGCCTA CCTCGAGTAC CTGTACTCGC CGGTCGGTCA GAAGATCGCA
GCCAAGCACT ACTACCGACC GGTCAGGCCC GAACACGCCG ACCCGGCCGA CGTCGCACGC
TTCCCGAAAG TCGAGCTGAT CACCATCGAG GACCTCGGCG GCTGGCAGGC GGCGCAGAAG
AAGCACTTTG CCGACGGCGG GGTGTTCGAC CAGATCTACG CCAGGTAG
 
Protein sequence
MKLGPLRSTL LTLALATGLA GVAHAQTTLL NVSYDPTREL YQDFNPEFAK HWKARTGETV 
TIKQSHGGAG KQARAVIDGL EADVVTLALA YDIDAIAEQT GKIPADWQKR LANNSSPYTS
TIVFLVRKGN PKGIKDWGDL VKPGVEVVTP NPKTSGGARW NYLAAWAYAL KQPGGNEQTA
QVFVTELIKH VPVLDSGARG ATNTFVQRGI GDVLLAWENE AFLSINELGP DKFEIVVPSI
SIRAEPPVTV VDGVAKKRGT EKLAQAYLEY LYSPVGQKIA AKHYYRPVRP EHADPADVAR
FPKVELITIE DLGGWQAAQK KHFADGGVFD QIYAR