Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2642 |
Symbol | |
ID | 7873383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2851735 |
End bp | 2854062 |
Gene Length | 2328 bp |
Protein Length | 775 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699565 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002889621 |
Protein GI | 237653307 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.295666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCCGTGC GCCGGTTCTC CTGCCCTCTC CGGACCCCCG GCCGCCCCGG CCCGATCCGC CCTGCGCACC CGCGCAGGGC GGGTGCGTCG TTACGTTCCG CGGCTGCGCT GTTGATCGCC TGCCTGGCAC TCGCGGCCTG CGGCCCGGTG TGGAACGACC CCTATCCGGT GGCCGAGCGC GGGGAACACA TCCTGTACAC CGCCTTCACC GAGCGCCCCA AGCATCTCGA TCCGGTGCAG TCCTACAGCG AGGACGAGGC GAGCTTCCTG TACCAGATCG TCGAGCCGCC GCTGCAGTAC CACTACCTGA AGCGCCCCTA TGTGCTCGAG CCCGCCACCG CCGAGGCCAT GCCGAGCCTG CGCCGGCTCG ACGCTGGCGG CCGCGAGCTG CCCGCGAACG CCGACCCGGC CAAGGTGGTG CGTACCGTGG TCGAGGTGCG TATCCGCCCG GGCATCCTGT ACCAGCCCCA CCCGGCCTTC GCCCTCGATG AGGATGGCAG CCCGCGCTAT CTCGGCCTTG GCGAGGCCGA CCTGCGCGCA GTGCGGGGTC TCGGCGACTT CGCCGCGACC GGCACGCGCG AGCTCGAGGC GCGCGACTAC GTCTATCAGA TCAAGCGTCT CGCCCATCCG CGCCTGCACT CGCCGATCTT CGAGCTCATG GCGGAGTATC TGCCCGGGCT CCAGGAGCTG CAGGCCGGGC TGGTTGCGGC AACCAAGGAA GGCAGGGCGA GCGGTGCCGC GGCGAAGGAC GACCCGCCCG GCTGGCTGGA TCTCGACCGC TTCGCGCTGC CCGGCGTGGA GGTGGTGGAC CGCCACACCT TGCGCATCAC CCTGCAGGGC GCCTATCCGC AGTTCCTGTA TTGGCTTTCG ATGCCCTTCT TCAGCCCGGT GCCGCGCGAG GTGGACCGCT TCTTCGCCCA GCCCGGCATG GCCGAGCGCA ACCTGACGCT GGACTGGTGG CCGGTGGGCA CCGGGCCGTA CATGCTGGTC GAGAACAACC CCAACGCGCG CATGGTGCTT GCGCGCAATC CGAACTACCG CGGCGACGCC TATCCCTGCG CGGGCGAGTC CGGCGATGCC GAGGCTGGGT TGCTCGGCGA CTGCGGCAAG CCCATGCCAT TCATCGACAA GGTGGTGTTC TCGCGCGAGC GCGAGGGCAT CCCATACTGG AACAAGTTCC TGCAAGGCTA TTACGACGCC TCGGGCGTGT CTTCGGACAA CTTCGACCAG GCCGTCAGTC TCACCAGCCA GGGCGAGGTG ACGCTCTCGG ACGACATGCG CGACAAGGGC ATCCGCCTGC TCACCTCGGT GTCGCCGTCG ATCTTCTACC TCGGCTTCAA CATGCTCGAC CCGCTCGTGG GCGGCGGTGC CTCGAAGGCG GAGAAGGAGC GCGCGCGCAA GCTGCGCCAG GCGATCTCGA TCGCGCTCGA CATGGAGGAG TTCGTGTCGA TCTTCCTCAA CGGCCGCGGC CTGCCCGGCA TGAGCCCCTT GCCGCCGGGC ATCTTCGGCG CGCGCGCGGG ACGGGCGGGG ATGAACCCGG TGGTGTACGA GTGGCAGGGC AGCGAGGCGG ACGGCCGTCC CGTGCGCCGT GGCGTCGACG AGGCGCGCCG CCTGCTCGCC GAGGCCGGCT GGCCAAATGG ACGCGATGCG CAGACCGGCG AGCCGCTGGT GATCCACCTC GACACCACGC CCGGCGGGCT GGGCGACAAG GCGCGCTCGG ACTGGCTGGC GAAGAAGTTC CGCGCGCTGG GCGTGCAGTT CGTCGTGCGG CCGACCGACT TCAACCGCTT CCAGGACAAG ATCCGCCAGG GTAACGTGCA GCTCTTCTTC TTCGGCTGGA ACGCAGACTA CCCCGATCCG GAGAACCTGC TCTTCCTGCT CCACGGCCCG CAGGGCAAGG TGAAGTTCAA CGGCCAGAAC GCGGCCAACT ACGAGAACCC GGAATACGAC GCGCTGTTCG AGCGCATGAA GGCCATGCCC GACGGCCCCG CGCGCCAGCA GATCATCGAC CGCATGGTGG CGATGCTGCA GCACGACGCG CCGTGGATCT TCGCCTTCCA CCCGATGTCG TATTCGCTGC AGCATGGCTG GGTGCTCAAC CGCAAGACCG GGGCCATGGT GCGCAACACC ATGAAGTACC AGCGCATCGA CCTCGAGCGC CGTGCCGCTG CGCGCACGGA GTGGAACGCG CCGGTGACCT GGCCGCTGCT GCTGGTGGCG GCGCTGCTCG CCGCGCTGGT GGCGCCGGCG GCGATCCACT GGCGCCGGCG CGAGACCGCG ACCGCGGCGC CGGGCGCGGC CGGGTCCGGC GCGCGAGGAG GGCGCTGA
|
Protein sequence | MPVRRFSCPL RTPGRPGPIR PAHPRRAGAS LRSAAALLIA CLALAACGPV WNDPYPVAER GEHILYTAFT ERPKHLDPVQ SYSEDEASFL YQIVEPPLQY HYLKRPYVLE PATAEAMPSL RRLDAGGREL PANADPAKVV RTVVEVRIRP GILYQPHPAF ALDEDGSPRY LGLGEADLRA VRGLGDFAAT GTRELEARDY VYQIKRLAHP RLHSPIFELM AEYLPGLQEL QAGLVAATKE GRASGAAAKD DPPGWLDLDR FALPGVEVVD RHTLRITLQG AYPQFLYWLS MPFFSPVPRE VDRFFAQPGM AERNLTLDWW PVGTGPYMLV ENNPNARMVL ARNPNYRGDA YPCAGESGDA EAGLLGDCGK PMPFIDKVVF SREREGIPYW NKFLQGYYDA SGVSSDNFDQ AVSLTSQGEV TLSDDMRDKG IRLLTSVSPS IFYLGFNMLD PLVGGGASKA EKERARKLRQ AISIALDMEE FVSIFLNGRG LPGMSPLPPG IFGARAGRAG MNPVVYEWQG SEADGRPVRR GVDEARRLLA EAGWPNGRDA QTGEPLVIHL DTTPGGLGDK ARSDWLAKKF RALGVQFVVR PTDFNRFQDK IRQGNVQLFF FGWNADYPDP ENLLFLLHGP QGKVKFNGQN AANYENPEYD ALFERMKAMP DGPARQQIID RMVAMLQHDA PWIFAFHPMS YSLQHGWVLN RKTGAMVRNT MKYQRIDLER RAAARTEWNA PVTWPLLLVA ALLAALVAPA AIHWRRRETA TAAPGAAGSG ARGGR
|
| |