Gene Tmz1t_2852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2852 
Symbol 
ID7873260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3087019 
End bp3088008 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content74% 
IMG OID643699773 
Productputative sulfonate ABC transporter, periplasmic sulfonate-binding protein 
Protein accessionYP_002889828 
Protein GI237653514 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.174204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCAGC TGTTCGCCCG CCTGATCGGC ATCGTCCTGC TGGCGCTCGC CGCGGGCGGC 
GCCGCATTCG CCGCAGACCG GCCGGTGGTC CGTGTCGGCG TGCTGCAGTT CGGCACGGTG
AGCTGGGAGC TCGAGACCAT GCAGCGACAC GGCCTGCTCG AGCGCGAGGG CGTGGACATC
CGCGTCGTGC CGCTCGCGCT CAAGGACGCC GCCAACGTCG CCCTCCAGGG CGGCGAGGTC
GATGTGATCG TCAACGACTG GCTGTGGGTG ACGCGCATGC GCTCGGAGGG GGCGGATTTC
GTCTTCGTGC CCTTCTCGCA GGCGGTCGGC GGCATCCATG CGCGTCCGGA CGCCGGCATC
GCCAGCCTCG CCGACCTGCG CGGCAAGCGC CTGGGCGTGG CTGGCGGCGC GCTCGACAAG
AGCTGGCTGC TGCTGCGCGC GTATGCCCGC AAGACCGTGG GCGAGGACGC CGCGAGCTTC
CTGCGCCCGC AGTTCGCCGC GCCGCCGCTG CTCAACGAGC TGGTGACGCG CGGCGAGCTG
CCGGCGGCGA TGAACTTCTG GCATTACGGT GCCCGCCTCG CCGCCGCCGG CATGCCCGAA
GTGCTGGGCA TGAAGGAGAT CCTCGCCACG CTCGGCATCG GCGACGAGAT GCCGCTGGTC
GGCTGGGTGT TCGGCGAGCG CTGGGCGCGC GCCAACCCGG CGGCGATCGC GGGCTTCCTG
CGCGCCTCCG CGGCGGCCAA GGCGTTGCTG CGCGAGTCCG ACGCGGCCTG GGAGGCGCTG
CGTCCGTCGA TGCGCGCCGA GGACGAGGCC AGCTTCGTCG CGCTGCGCGA GGGTTTCCGC
GCGGGCATCC CGCACGCATC GGGCGAAGAG GGCGAGCGCG CCGCCGCGCG CGCCTTCGCG
ATCCTCGCTG CGGAAGGGGG CGAGGCCCTG GTCGGGCGCG CCCGCGAGAT CGCGCCGGGC
ACCTTCTGGC ATGGAGGCGG AGGGCGGTGA
 
Protein sequence
MLQLFARLIG IVLLALAAGG AAFAADRPVV RVGVLQFGTV SWELETMQRH GLLEREGVDI 
RVVPLALKDA ANVALQGGEV DVIVNDWLWV TRMRSEGADF VFVPFSQAVG GIHARPDAGI
ASLADLRGKR LGVAGGALDK SWLLLRAYAR KTVGEDAASF LRPQFAAPPL LNELVTRGEL
PAAMNFWHYG ARLAAAGMPE VLGMKEILAT LGIGDEMPLV GWVFGERWAR ANPAAIAGFL
RASAAAKALL RESDAAWEAL RPSMRAEDEA SFVALREGFR AGIPHASGEE GERAAARAFA
ILAAEGGEAL VGRAREIAPG TFWHGGGGR