Gene Tmz1t_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3551 
Symbol 
ID7873057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3891671 
End bp3893266 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content67% 
IMG OID643700492 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002890522 
Protein GI237654208 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.186732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA CCCGACTTGC CCTCGCCCTG CCTTTCGCCC TCGCGACGCT CGCCAGCGTG 
GCGGCCGCCC CCGCGGCTGC ACAGACCGTG CGCTGGGCGG CGGCCGGGGA CGCGCTGACC
ATGGACCCGC ATTCTCAGAA CGAGGGCCCG ACCCACGTCA TGAACCACCA GGTCTACGAC
TCGCTGGTGT TCCGCGACCA GGCCATGAAG CTGGCGCCGC GGCTGGCGAC CGCGTGGAAG
ATCACCGAGG ACCCCAACGT GTGGGAGTTC AAGCTGCGCG AGGGCGTGAA GTACCACAAC
GGCAACCCCT TCACGGCCGA CGACGTGGTG TTCTCCATCC AGCGTGCCAA GCACGAGAAC
TCGGACATGA AGGGCCTGCT CACCAGCGTG GTCGAGGTGG TGAAGGTCGA CGAGCACACC
GTGCGCATGC GCACCGACGG CCCCAACCCG CTGCTGCCCA ACAACCTGAC CAACCTCTTC
ATCATGGACC GCGAGTGGTC CGAGGCGAAC AAGGTCATGC TGCCGCAGAA CTACAAGGCC
GGCGACGAGA CCTTCGCGGT GCGCAATGCC AACGGCACCG GTCCCTTCCG GCTCGTGAAG
CGCGAGCCCG ACGTGCGCAC CGAACTCGAA CGCAACGAGG ACTACTGGGG CAAGGGGACC
TATCCCATGG AGGTGGCCAA GGTGGTGTTC ACCCCGGTGC GCTCGGCCGC CACCCGGGTC
GCGGCGCTGC TCTCGGGCGA GGTCGATTTC CTCCTCGACC CGCCGGTGCA GGACCTCGAG
CGCCTGTCCG CGGCCAAGGG TATCGTGGTG CGCTCCGGGC CGGAGAACCG CACGATCTTC
CTCGGCATGA ACCAGGGCGC GGCGGAGCTG CGCAGCGCGG ACGTGAAAGG CAGGAACCCC
TTCGCGGACA AGCGCGTGCG CGCGGCGATG AACATCGCGA TCAACCGCGA CGCGGTGAAG
CGCGTGGTGA TGCGCGGCCA GTCGGTGCCC GCCGGCATCG TCGCGCCGCC CTTCATCGAC
GGCTACGACA AGGCGATGGA CGTCGTGCCG GCGCCCGACG TCGCGCGCGC CAAGGCGCTG
CTCGCCGAGG CCGGCTACCC GAACGGCTTC GCGGTGACGC TGTCCTGCCC CAACGACCGC
TATGTGAACG ACGAGGCCAT CTGCCAGGCG GTGACCGGCA TGTTCGGCCA GATCGGCGTC
AAGGCGCGGC TGGACGCACG GCCCAAGAGC ATCCACTTCG CCGAGCTGCC CAAGGGCGAG
CTCGACCTCT ACATGCTCGG CTGGGGTGTG CCGACCATGG ACTCGCACTA CGTCTTCCAT
TACCTCTACG AGACCAGGAC CGACAAGGGC GGCTCGTGGA ACGTGACCGG CTATTCCAGC
GCGAAGGTGG ACGAGCTGAC CAAGGCGATG AACCGCGAGA TCGACCTCGG CAAGCGCGCC
GGCATGGTCG CCGAAGTGTG GAAGACGGTG CAGGACGACG TCGTCTACCT GCCGATCCAC
CATCAGATGC TGAACTGGGC GATGAAGGAC GACATCGACT TCCCGGTGCA GTCGGAGAAC
TATCCCTACT TCAAGCTGTT GAAGTACAGG AAGTGA
 
Protein sequence
MNKTRLALAL PFALATLASV AAAPAAAQTV RWAAAGDALT MDPHSQNEGP THVMNHQVYD 
SLVFRDQAMK LAPRLATAWK ITEDPNVWEF KLREGVKYHN GNPFTADDVV FSIQRAKHEN
SDMKGLLTSV VEVVKVDEHT VRMRTDGPNP LLPNNLTNLF IMDREWSEAN KVMLPQNYKA
GDETFAVRNA NGTGPFRLVK REPDVRTELE RNEDYWGKGT YPMEVAKVVF TPVRSAATRV
AALLSGEVDF LLDPPVQDLE RLSAAKGIVV RSGPENRTIF LGMNQGAAEL RSADVKGRNP
FADKRVRAAM NIAINRDAVK RVVMRGQSVP AGIVAPPFID GYDKAMDVVP APDVARAKAL
LAEAGYPNGF AVTLSCPNDR YVNDEAICQA VTGMFGQIGV KARLDARPKS IHFAELPKGE
LDLYMLGWGV PTMDSHYVFH YLYETRTDKG GSWNVTGYSS AKVDELTKAM NREIDLGKRA
GMVAEVWKTV QDDVVYLPIH HQMLNWAMKD DIDFPVQSEN YPYFKLLKYR K