Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_4660 |
Symbol | |
ID | 5420766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 5150891 |
End bp | 5152513 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640883924 |
Product | extracellular solute-binding protein |
Protein accession | YP_001419537 |
Protein GI | 154248579 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.845725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.101671 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCC GCGAATTCGT GAAGTCTGCG TCTGCCACCG CCGTTGCCAC CGGAACCGGT GTCGCCGCCC CGGCGGTCTT CTCCTCGGCA CAGGCGCAGG CGCGCAACGA GACGCTGCTG ATCGTCTCCG AGAGCGGCCC CAACAACCTC GACATCCATG GTGTCGGCAC CAACGTGCCG GGCTATGAGG CGAGCTGGAA CACCTACGAC CGCCTCATCA CCCACGAGAT GACCGAGAAG GACGGGGTCC GCTACTACGA CCGCGACAAG CTGAAGGGCG AGCTCGCCGA GGACATGAAC ATCGGCGACA TGTCGGTGAC CTTCAAGCTG AAGAAGAACG CCACCTTCCA GGACGGCACC CCGGTCACCG CCAAGGACGT GAAGTGGTCG CTGGACCGCG CCGTCTCCGT GGGCGGCTTC CCCACCTTCC AGATGAAGGC CGGCTCGCTG GAGAAGCCCG AGCAGTTCGT GGTGGTGGAT GACCACACGG TGCGCGTGGA CTTCATCCGC AAGGACCGCC TCACCATCCC CGATCTCGCC GTGATCGTGC CCTGCGTCAT CAATTCCGGG CTGGTGCAGA AGAACGCCAC CGAAAAGGAC CCCTGGGGCC TCGAATACAC CAAGCAGAAC ACCGCCGGCT CCGGCGCCTA TCGCGTCACC AAGTGGACCC CCGGCACCGA GGTGATCTTC GAGCGCTTCG AGGACTGGAA GGGCGGCCCG CTGCCCAAGA TCAAGCGCGT GATCTGGCGC ATGGTGCCCT CCGCCGGCAA CCGCCGGGCG CTGCTGGAGC GCGGCGACGC CGACATCTCC TACGACCTGC CCAACAAGGA TTTCGTGGAG CTGAAGCAGG CCGGCAAGCT GAACATCACG TCGGTGCCCT ATTCCAACGG TGTCCAGTAC ATCGGCATGA ACGTGAAGAA CCCGCCCTTC GACAATCTGA AGGTGCGCCA GGCCATCGCT TACGCCATCC CCTACCAGAA GATCATGGAC GCCGCCCTGT TCGGCCTCGC CAAGCCCATG TTCGGCGCCC CGGCGGATGC GCAGACCCAG GTCAAGTGGC CGCAGCCCAC CAAGTTCGTC ACCGACCTCG CCAAGGCCAA GCAATTGCTG GCGGAGGCGG GCTATCCCGA CGGGCTGGAG ACGACGCTGT CCTTCGACCT CGGCTTTGCC GGCGTGAACG AGCCGCTGTG CGTGCTGCTG CAGGAAAACC TGGCGCAGAT CGGCATCAAG ACCACCATCA ACAAGATCCC CGGCGCCAAC TGGCGCACCG AGCTGACGAA GAAGGTGCTG CCGCTGTTCA CCAACGTGTT CTCGGGTTGG CTGGACTATC CCGAATACTT CTTCTTCTGG TGCTACCACG GCAACAATTC GATCTTCAAC ACCATGAGCT ACCAGTCGGC GGCCATGGAT GCCTTCATCG ACGGCGCCCG TGCCGCCGCC GCCAACGGCG ACAAGGCGGC CTATGATGCG GACGTGAAGG GCATGGTGGA CCTCGCCTTC GCCGACGTGC CGCGTATCCC GCTCTACCAG CCCTATGTGA ACGTGGCGAT GCAGAAGAAC ATCACCGGCT ACGAATACTG GTTCCACCGC CGTCTCGACT ATCGCGCTTT CCAGAAGGGG TGA
|
Protein sequence | MNRREFVKSA SATAVATGTG VAAPAVFSSA QAQARNETLL IVSESGPNNL DIHGVGTNVP GYEASWNTYD RLITHEMTEK DGVRYYDRDK LKGELAEDMN IGDMSVTFKL KKNATFQDGT PVTAKDVKWS LDRAVSVGGF PTFQMKAGSL EKPEQFVVVD DHTVRVDFIR KDRLTIPDLA VIVPCVINSG LVQKNATEKD PWGLEYTKQN TAGSGAYRVT KWTPGTEVIF ERFEDWKGGP LPKIKRVIWR MVPSAGNRRA LLERGDADIS YDLPNKDFVE LKQAGKLNIT SVPYSNGVQY IGMNVKNPPF DNLKVRQAIA YAIPYQKIMD AALFGLAKPM FGAPADAQTQ VKWPQPTKFV TDLAKAKQLL AEAGYPDGLE TTLSFDLGFA GVNEPLCVLL QENLAQIGIK TTINKIPGAN WRTELTKKVL PLFTNVFSGW LDYPEYFFFW CYHGNNSIFN TMSYQSAAMD AFIDGARAAA ANGDKAAYDA DVKGMVDLAF ADVPRIPLYQ PYVNVAMQKN ITGYEYWFHR RLDYRAFQKG
|
| |