Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bxe_A3028 |
Symbol | |
ID | 4001961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia xenovorans LB400 |
Kingdom | Bacteria |
Replicon accession | NC_007951 |
Strand | + |
Start bp | 1562307 |
End bp | 1563617 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637946613 |
Product | putative prolidase (Xaa-Pro dipeptidase) |
Protein accession | YP_558003 |
Protein GI | 91782797 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0496396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0296694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGGC GTGTCCGGCG GTATTGGCCG GAAGCATTTC CTCATCAAGT CAGGAACATA TCAATGAGCA TTACGGTAAT CAGCGGCGGC AACGTACTCG ATCTGGCCCA GGGCGTGTTG CTCGAGCATC AGTACGTCGT GATCGAAAAC GGCCATATCG TCGAAGTCAC CGATCGTCCC GTCGATCTGC CCAACGCGCG CGTAATCGAT GCTCGCGGCA AAACGGTGAT GCCCGGGCTG ATCGATTGTC ATGTGCACGT GCTGGCTTCG CGCGCGAACC TCGGTACGAA CGCGACGCAG CCGAACATTC TCACCGCGAT TCGCGCGCTG CCGATTCTCA AGGCCATGCT CGGCCGCGGC TTTACGAGTG TGCGCGATGC GGGCGGCGCG GATTGGGCTT TGACTCAGGC GCTGGAGAGC GGTCTGATTC CGGGCCCGCG TATTTTCCCG TCGGGCAAGG CGCTCTCGCA AACGGGCGGC CACGGCGATT TCCGTCCGCG TGGCGATATG CTCGAACCGT GCTCGTGCTG CTTTCGGGCC GGCGCGATTG CGCGCGTGGT CGACGGCGTG GATGCCGTGC GGCTCGCCGT GCGCGAAGAG ATTCAGAAAG GCGCCACGCA GATCAAGATC ATGGCCTCGG GCGGTGTTGC CTCGCCGACC GATCCGATCG GCAACACGCA GTATTCCGAG GACGAAATCC GCGCGATCGT CGCCGAAGCC GAGGCGGCCA ATACCTACGT GATGGCGCAC GCGTACACGG GCCGCGCCAT CTCGCGGGCG ATTCGCTGCG GCGTGCGCAC CATCGAGCAC GGCAACCTGG TCGACGAAGC CGCCGCGAAG CTGATGCGCG AACATGGCGC GTTCGTGGTA CCGACGCTGG TCACGTACGA CGCATTGGCC AAACACGGCG CCGACTATGG TTTGCCGGCC GACTCGATCG CCAAGGTCGA AACCGTGCGG CAAGCGGGGC GCGACTCGCT ACAGATCTAT GCGAATGCCG GTGTGCCGAT GGGTTTCGGT TCCGACCTGC TCGGCGAAAT GCACACGTTC CAGAGCGACG AATTGCGTAT TCGCGCCGAC GTGCTCGGCA ATCTGGAAGC CTTGCGTTCA GCCACGACGA TCGCCGCGGA TATCGTCGAT CTGAGCGGCA AGCTGGGCGT CATCAAGGCC GGCGCGATTG CCGACATTCT CGTGGTGGAC GGCGATCCGC TCACGGACAT CGGCGTGCTG ACCGGGCAGG GCGAGCGCAT CGAGTACGTA TTTCAGCGAG GCGAAGTGGT GAGCGAGCGA GGCAGCGTCG CGTCGCGCTG A
|
Protein sequence | MKRRVRRYWP EAFPHQVRNI SMSITVISGG NVLDLAQGVL LEHQYVVIEN GHIVEVTDRP VDLPNARVID ARGKTVMPGL IDCHVHVLAS RANLGTNATQ PNILTAIRAL PILKAMLGRG FTSVRDAGGA DWALTQALES GLIPGPRIFP SGKALSQTGG HGDFRPRGDM LEPCSCCFRA GAIARVVDGV DAVRLAVREE IQKGATQIKI MASGGVASPT DPIGNTQYSE DEIRAIVAEA EAANTYVMAH AYTGRAISRA IRCGVRTIEH GNLVDEAAAK LMREHGAFVV PTLVTYDALA KHGADYGLPA DSIAKVETVR QAGRDSLQIY ANAGVPMGFG SDLLGEMHTF QSDELRIRAD VLGNLEALRS ATTIAADIVD LSGKLGVIKA GAIADILVVD GDPLTDIGVL TGQGERIEYV FQRGEVVSER GSVASR
|
| |