Gene Tmz1t_2642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2642 
Symbol 
ID7873383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2851735 
End bp2854062 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content70% 
IMG OID643699565 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002889621 
Protein GI237653307 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.295666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCGTGC GCCGGTTCTC CTGCCCTCTC CGGACCCCCG GCCGCCCCGG CCCGATCCGC 
CCTGCGCACC CGCGCAGGGC GGGTGCGTCG TTACGTTCCG CGGCTGCGCT GTTGATCGCC
TGCCTGGCAC TCGCGGCCTG CGGCCCGGTG TGGAACGACC CCTATCCGGT GGCCGAGCGC
GGGGAACACA TCCTGTACAC CGCCTTCACC GAGCGCCCCA AGCATCTCGA TCCGGTGCAG
TCCTACAGCG AGGACGAGGC GAGCTTCCTG TACCAGATCG TCGAGCCGCC GCTGCAGTAC
CACTACCTGA AGCGCCCCTA TGTGCTCGAG CCCGCCACCG CCGAGGCCAT GCCGAGCCTG
CGCCGGCTCG ACGCTGGCGG CCGCGAGCTG CCCGCGAACG CCGACCCGGC CAAGGTGGTG
CGTACCGTGG TCGAGGTGCG TATCCGCCCG GGCATCCTGT ACCAGCCCCA CCCGGCCTTC
GCCCTCGATG AGGATGGCAG CCCGCGCTAT CTCGGCCTTG GCGAGGCCGA CCTGCGCGCA
GTGCGGGGTC TCGGCGACTT CGCCGCGACC GGCACGCGCG AGCTCGAGGC GCGCGACTAC
GTCTATCAGA TCAAGCGTCT CGCCCATCCG CGCCTGCACT CGCCGATCTT CGAGCTCATG
GCGGAGTATC TGCCCGGGCT CCAGGAGCTG CAGGCCGGGC TGGTTGCGGC AACCAAGGAA
GGCAGGGCGA GCGGTGCCGC GGCGAAGGAC GACCCGCCCG GCTGGCTGGA TCTCGACCGC
TTCGCGCTGC CCGGCGTGGA GGTGGTGGAC CGCCACACCT TGCGCATCAC CCTGCAGGGC
GCCTATCCGC AGTTCCTGTA TTGGCTTTCG ATGCCCTTCT TCAGCCCGGT GCCGCGCGAG
GTGGACCGCT TCTTCGCCCA GCCCGGCATG GCCGAGCGCA ACCTGACGCT GGACTGGTGG
CCGGTGGGCA CCGGGCCGTA CATGCTGGTC GAGAACAACC CCAACGCGCG CATGGTGCTT
GCGCGCAATC CGAACTACCG CGGCGACGCC TATCCCTGCG CGGGCGAGTC CGGCGATGCC
GAGGCTGGGT TGCTCGGCGA CTGCGGCAAG CCCATGCCAT TCATCGACAA GGTGGTGTTC
TCGCGCGAGC GCGAGGGCAT CCCATACTGG AACAAGTTCC TGCAAGGCTA TTACGACGCC
TCGGGCGTGT CTTCGGACAA CTTCGACCAG GCCGTCAGTC TCACCAGCCA GGGCGAGGTG
ACGCTCTCGG ACGACATGCG CGACAAGGGC ATCCGCCTGC TCACCTCGGT GTCGCCGTCG
ATCTTCTACC TCGGCTTCAA CATGCTCGAC CCGCTCGTGG GCGGCGGTGC CTCGAAGGCG
GAGAAGGAGC GCGCGCGCAA GCTGCGCCAG GCGATCTCGA TCGCGCTCGA CATGGAGGAG
TTCGTGTCGA TCTTCCTCAA CGGCCGCGGC CTGCCCGGCA TGAGCCCCTT GCCGCCGGGC
ATCTTCGGCG CGCGCGCGGG ACGGGCGGGG ATGAACCCGG TGGTGTACGA GTGGCAGGGC
AGCGAGGCGG ACGGCCGTCC CGTGCGCCGT GGCGTCGACG AGGCGCGCCG CCTGCTCGCC
GAGGCCGGCT GGCCAAATGG ACGCGATGCG CAGACCGGCG AGCCGCTGGT GATCCACCTC
GACACCACGC CCGGCGGGCT GGGCGACAAG GCGCGCTCGG ACTGGCTGGC GAAGAAGTTC
CGCGCGCTGG GCGTGCAGTT CGTCGTGCGG CCGACCGACT TCAACCGCTT CCAGGACAAG
ATCCGCCAGG GTAACGTGCA GCTCTTCTTC TTCGGCTGGA ACGCAGACTA CCCCGATCCG
GAGAACCTGC TCTTCCTGCT CCACGGCCCG CAGGGCAAGG TGAAGTTCAA CGGCCAGAAC
GCGGCCAACT ACGAGAACCC GGAATACGAC GCGCTGTTCG AGCGCATGAA GGCCATGCCC
GACGGCCCCG CGCGCCAGCA GATCATCGAC CGCATGGTGG CGATGCTGCA GCACGACGCG
CCGTGGATCT TCGCCTTCCA CCCGATGTCG TATTCGCTGC AGCATGGCTG GGTGCTCAAC
CGCAAGACCG GGGCCATGGT GCGCAACACC ATGAAGTACC AGCGCATCGA CCTCGAGCGC
CGTGCCGCTG CGCGCACGGA GTGGAACGCG CCGGTGACCT GGCCGCTGCT GCTGGTGGCG
GCGCTGCTCG CCGCGCTGGT GGCGCCGGCG GCGATCCACT GGCGCCGGCG CGAGACCGCG
ACCGCGGCGC CGGGCGCGGC CGGGTCCGGC GCGCGAGGAG GGCGCTGA
 
Protein sequence
MPVRRFSCPL RTPGRPGPIR PAHPRRAGAS LRSAAALLIA CLALAACGPV WNDPYPVAER 
GEHILYTAFT ERPKHLDPVQ SYSEDEASFL YQIVEPPLQY HYLKRPYVLE PATAEAMPSL
RRLDAGGREL PANADPAKVV RTVVEVRIRP GILYQPHPAF ALDEDGSPRY LGLGEADLRA
VRGLGDFAAT GTRELEARDY VYQIKRLAHP RLHSPIFELM AEYLPGLQEL QAGLVAATKE
GRASGAAAKD DPPGWLDLDR FALPGVEVVD RHTLRITLQG AYPQFLYWLS MPFFSPVPRE
VDRFFAQPGM AERNLTLDWW PVGTGPYMLV ENNPNARMVL ARNPNYRGDA YPCAGESGDA
EAGLLGDCGK PMPFIDKVVF SREREGIPYW NKFLQGYYDA SGVSSDNFDQ AVSLTSQGEV
TLSDDMRDKG IRLLTSVSPS IFYLGFNMLD PLVGGGASKA EKERARKLRQ AISIALDMEE
FVSIFLNGRG LPGMSPLPPG IFGARAGRAG MNPVVYEWQG SEADGRPVRR GVDEARRLLA
EAGWPNGRDA QTGEPLVIHL DTTPGGLGDK ARSDWLAKKF RALGVQFVVR PTDFNRFQDK
IRQGNVQLFF FGWNADYPDP ENLLFLLHGP QGKVKFNGQN AANYENPEYD ALFERMKAMP
DGPARQQIID RMVAMLQHDA PWIFAFHPMS YSLQHGWVLN RKTGAMVRNT MKYQRIDLER
RAAARTEWNA PVTWPLLLVA ALLAALVAPA AIHWRRRETA TAAPGAAGSG ARGGR