Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0947 |
Symbol | |
ID | 7314978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1021885 |
End bp | 1023753 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643615832 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002513022 |
Protein GI | 220934123 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACAC GTTTTGCCGC CAAAGACCTG ACCCTGATCC TTGGCCTGGT CATCCTGGGC GTGATGCTGA TCCTGCTCAT GTACCAGGTG GACCGCCAGT GGAGCCGCAT GAGCGAGATG CAGCGGGCCA TGCAGGAGCA GTCCGACGAT CTGCGTCGGC TGCGCGTGGC CCTGCAGTCC CTGGAGACCC GGGTGCGCAG CGGCGTGGCC GTGGGTGCAG GGGAGGGGGA TCAGCTGGCC GATGTCCCGC CCGCCTTCCG GCGCGCGCAA CAGGCCGCCG CCCAGCCCGA CTTCGCCCCC GGCGACTGGC TGGTGCAGGC CTTCCCCACG GGGCTGAGCA CCATCACCCC TCTGGTGTCC GCCGACGCCT ATTCCCGGGA GGTGCAGAAC TACGTGCTGG AGTCCCTGAT CACCCGGGAT CCGGACACCC TGGAATGGCA GGGCCTGATC GCCCGGGACT GGACCATCAG CGAGGACGGA CTCACCATCA CCTTCCGCAT GCGCCCCGAC GTGAACTTCT CCGACGGCGT GCCGCTCACC GCCCACGACG TGGTGTTCAC CTGGGCCTTC ATCATGAACG AGGCCATCGC CGCGCCCCGT TCCCGGGCCT ACCTGGAGAA GCTGGAGAAG GTGGAGGCCC TGGATGATCA CACCGTGGCC TTCACCTTTG CCGAGCCCTA CTTCAACAGC CTGTCCCTGG CCGGCGGCCT GGAGATCATG CCGAGGCATT TCTACGCGCG TTTTCTGGAC GATCCCGAGG CCTTCAACCG TTCCCGCGGC CTGCTGCTGG GTTCCGGCCC CTACCGGCTG TCCGACCCCG AGGGCTGGAC TCCGGACCAG GGCCTGGTGG AGCTGGTGCG CAATCCGCGC TACTGGGGGC CGGTGGATCC GCCCTTCAAC CGGGTGCTCT GGCGTGTCAT CGAGAACGAC AGCGCCCGCC TGACCACCTT CCGCAACGGC GAGATCGACG CCTACGGCGC ACGTCCCCGT GAGTACCAGC GCCTGCTGGA GGACGAACAG TTGCGCAGCC GCACCCGGCA CTTCGAGTAC ATGAGCCCCA CCGCCGGCTA CAACTACATC GGCTGGAACC AGGAGCGGGA CGGGCGTGCA ACCCGCTTCG CCGATCCTCG GGTGCGCCAG GCCATGAGCC ACCTCACGCC GATCGAGCGC ATCATCGACG AGATCATGCT GGGCTATGCG GAGCCGGCGG TCAGCCCCTT CAACCCCCGC AGCCCGCAGC ACGACACGAG CCTCGAACCC TATGCCTTCG ACATCGAGCG GGCCACGCAG CTGCTGCACG AGGCGGGCTA CCGCAGCCGC AACCGTGACG GCATCCTGGT GGACGAACAG GGCAGGCCAT TCGAGTTCGA GCTGGTGTTC TTCCAGGACA ACGAGGACAC CCGGCGCATC GTGCTGTTCC TGCGCGACAT CTACGCCCGG GCCGGTATCC TGCTGCGTCC GCGGCCCACG GAGTGGTCGG TGATGCTGGA CCTGCTCACC CGCAAGGACT TCGACGCCAT CACCCTGGGC TGGACCAGCG GCGTGGAGAC CGACATCTAC CAGATGTTCC ACTCCAGCCA GACCGTGGCC GGCGGCGACA ACTTCATCAA CTACCGCAAC CCGGAACTGG ACCGGCTCAT CGACCAGGCC CGGGCCGAGG TGGACGAGGC CGCGCGCATG GCACTCTGGC AGCAGGTCGA GCGCATCCTG TACGAGGATC AGCCCTACAC CTTCCTCATG CGCCGCCAGA CGCTCGCCTT CATCGACCAG CGCCTGCACA ACCTGCAGAT CACCAATCTG GGCCTGAATC TCGGTGCGGT GCCCGTGGAG ACCTATGTGC CAGCGGATAT GCAGAGGTAC ACGAGATGA
|
Protein sequence | METRFAAKDL TLILGLVILG VMLILLMYQV DRQWSRMSEM QRAMQEQSDD LRRLRVALQS LETRVRSGVA VGAGEGDQLA DVPPAFRRAQ QAAAQPDFAP GDWLVQAFPT GLSTITPLVS ADAYSREVQN YVLESLITRD PDTLEWQGLI ARDWTISEDG LTITFRMRPD VNFSDGVPLT AHDVVFTWAF IMNEAIAAPR SRAYLEKLEK VEALDDHTVA FTFAEPYFNS LSLAGGLEIM PRHFYARFLD DPEAFNRSRG LLLGSGPYRL SDPEGWTPDQ GLVELVRNPR YWGPVDPPFN RVLWRVIEND SARLTTFRNG EIDAYGARPR EYQRLLEDEQ LRSRTRHFEY MSPTAGYNYI GWNQERDGRA TRFADPRVRQ AMSHLTPIER IIDEIMLGYA EPAVSPFNPR SPQHDTSLEP YAFDIERATQ LLHEAGYRSR NRDGILVDEQ GRPFEFELVF FQDNEDTRRI VLFLRDIYAR AGILLRPRPT EWSVMLDLLT RKDFDAITLG WTSGVETDIY QMFHSSQTVA GGDNFINYRN PELDRLIDQA RAEVDEAARM ALWQQVERIL YEDQPYTFLM RRQTLAFIDQ RLHNLQITNL GLNLGAVPVE TYVPADMQRY TR
|
| |