Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtpsy_0666 |
Symbol | |
ID | 7384171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax ebreus TPSY |
Kingdom | Bacteria |
Replicon accession | NC_011992 |
Strand | + |
Start bp | 690685 |
End bp | 692256 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643653977 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002552146 |
Protein GI | 222109882 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.404568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAATA TGGCCCTGGC GCTTGTACTG CAAGCGCTAG CAGCTATATT TTTGGTAGCT AATGCGCAAA CCATCCGTGT GGCCAACCAG GGCGATGCGC TGTCGATGGA CCCACATGCG CTCAATGAAT CGCTGCAGCT CAGCCTGACC GGCAATGTGT ATGAGCCTCT GGTGGGACGC AACAAGGATC TGAGCCTGGT GCCTGCGCTG GCGCTGGCCT GGCGTGCGAC CACCCCCACC GTGTGGCGCT TCGAGTTGCG CCGCGGCGTG TCCTTCCACG ACGGTGCGCT TTTCACGGCC GACGATGCGG TGTTTTCGCT GCGGCGCGCG CAGTCCGAAG GCTCGGACAT GCGCAGTTAC CTGAGCGGCG TGCGCGAGGT GCGCAAGTTG GACGCGCACA CCATCGAGAT CGAAACCCGC GAGCCCACGC CGCTGCTGCC GGCCCTGCTG TCGCACGTCT ACATGATGAA CCAGCGCTGG AGCGAGGCGC AGGGTCTCGC GCAGGTAGGC GACGCGCGCG GGGCCGCGGG GCAGGCCGCC GCGCTGCGCG CCAATGGCAC GGGGCCGTTC CGCCTGGCCG AGCGTCGCCC GCAGCAGCGC ACCGTGTTCG AGCGCAACAT GCGCTACTGG GGCACTATCG AGAGCAACGC ACGCCAAGTC GTCTTCCTGC CCATTCCGGA CAACGAGGCG CGCGTGGCCG CGCTGCTGGC CGGCCGCGTG GACGTGATGG AGCCCGTGCC GGTACAGGAC ATCGAACGCG TGAGCGCCGC CGGACTGGTC CGCGTGGTCA CGGGGCCCGA GTTGCGCACG CTGTTCCTGG GCATGGACCA GCACAGCGAC GAGCTGCCGT ACGCCAGCGT GCGCGGCGCC AACCCCTTCA AGGACCGGCG CGTACGCCAG GCCTTCTACC AGGCCATCGA CATCGATGCG CTCATCAAGA ACGTGATGCG CGGCGCCGCC ACGCCCGCCG CGCTCATCGT GGGACCGGGC GTCAATGGCT TTCAGTCCGA CGTCAAGCGG CTGCCGCACG ACGTGGCGGC CGCGCGCGCG CTCATGGCGC AGGCCGGCTA TGGCGAGGGC TTCGCGCTCA CGCTGGACTG CCCCAGCGAC CGCTATGTGA ACGACGCGGC GCTGTGCACC GCCATCGCTG CACAGCTGGC GACCCTGCAG GTGCGCGTGA CGGTGCGCGC GGAGCCCAAG GCGCAGTACT ACCCGCGCAT CCTGCGACGC GACGCGGGCT TCTACCTGAT GGGCTGGACG CCCTCCACCT ACGACGCACA CGGTGCGCTG AACGCGCTGG CGGCCTGCCC GCGCGGCGAA GGCGCGGGCC ACTTCAACCT GGGGGGCTAT TGCAACCCGC GGCTGGACGC GCTGCTGCTG CAGGTCCAGA CGCAGACCGA CAAGACCCGG CGCGATGTGC TGCTGCGCGA GGCTTTGCTG CTGCAGGCCG CCGACATTGC CTACATCCCG CTGCACCAGC AGGCCCTGGC CTGGGGCGTG TCCAAGAAGA TCCGCCTGGT GCAGATGGCC GACAACACCA TGCCTTTCAA GTGGATGGGC GTGGGCCCCT GA
|
Protein sequence | MRNMALALVL QALAAIFLVA NAQTIRVANQ GDALSMDPHA LNESLQLSLT GNVYEPLVGR NKDLSLVPAL ALAWRATTPT VWRFELRRGV SFHDGALFTA DDAVFSLRRA QSEGSDMRSY LSGVREVRKL DAHTIEIETR EPTPLLPALL SHVYMMNQRW SEAQGLAQVG DARGAAGQAA ALRANGTGPF RLAERRPQQR TVFERNMRYW GTIESNARQV VFLPIPDNEA RVAALLAGRV DVMEPVPVQD IERVSAAGLV RVVTGPELRT LFLGMDQHSD ELPYASVRGA NPFKDRRVRQ AFYQAIDIDA LIKNVMRGAA TPAALIVGPG VNGFQSDVKR LPHDVAAARA LMAQAGYGEG FALTLDCPSD RYVNDAALCT AIAAQLATLQ VRVTVRAEPK AQYYPRILRR DAGFYLMGWT PSTYDAHGAL NALAACPRGE GAGHFNLGGY CNPRLDALLL QVQTQTDKTR RDVLLREALL LQAADIAYIP LHQQALAWGV SKKIRLVQMA DNTMPFKWMG VGP
|
| |