Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1578 |
Symbol | |
ID | 4886678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1509375 |
End bp | 1511129 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640131516 |
Product | hypothetical protein |
Protein accession | YP_001062573 |
Protein GI | 126444883 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3653] N-acyl-D-aspartate/D-glutamate deacylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.211733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGCT ACGACACGAT CATCCGCAAC GGCCTATGGT TCGACGGCAC GCTCGCCGAG CCCCGCCCGC GCGAGCTCGG CATTCGCGAC GGGCGCGTCG CGGCGGTATC GGACACGCCG CTCGCCGCGG ACGGCGCGAG CGTGATCGAC GCGACCGGCA AATGGGTGAT GCCCGGCTTC ATCGACATCC ACACGCATTA CGACGCCGAG ATTCTCGTGT CGCCGGGGCT GCCCGAATCG GTGCGGCACG GCGTGACGAG CGTCTTCCTC GGCTCGTGCT CGCTGTCGAC CGTCCACGCG AACGCGCTCG ACTGCACCGA TCTCTTCAGC CGCGTCGAAG CGCTGCCGCG CGAGCAGATG CTCGCCGTGC TGTCGCGCGT GAAGACCTGG GACACGGCGG CCGCGTACGT GCGCCATCTC GAATCGCTGC CGCTCGGCCC GAACGTCGCG GCGTTCCTCG GCCATTCGGA CCTGCGCACG CACGTGCTCG GCCTCGGGCG CGCGGTGGAC GACCGCGTGC GGCCGCACGA GGCTGAGCTG CAACGGATGG AGCGGCTGCT CGACGACGCG CTCGACGCGG GCTTCGTCGG CCTGTCGTCG ATGACGACGC CGTGGGACAA GCTCGACGGC GAGCGCTACC GGTCGAAGTC GCTGCCGTCG ACGTTCGCGA CGTGGCGCGA GTACCGGCGG CTGAACCGCG TGCTGCGCCG CCGCGGGCGC GTGCTGCAAA GCGCGCCGAA CACGACCAAT CCGCTGAACG GCCTGCTCTT CATGGCCGAG AGTTGCGGCT ACTTCGTGCG CAAGCCGCTG CGCACGTCGC TGCTCGTCGC GGCCGACAGC AAGGCGGCGC CGCGCGGCAC CATCGACGTG CAGCTCGGCG GCGTGCGCGT CGCGAACGCG CTGTTTCGCG GCGAGCTCGT CTGGCAGCAT CTGCCGGTGC CGTTCGAGGT CTATGCGGAC GGCATCGATT TCGTGATCTT CGAGGAGTTC GGCGCGGGCC GCGCGGCGCT GCATCTGGCC GACGCGCTCG AGCGCAACAA GCTGCTGCAG AACGAAGGCT ACCGGCGCGA GTTCCGCCGG CAGGTCGGCA AGGGGTTCGA CCTGCGGCTC TGGACGCGCG ATCTGCACGA CACGCGGATC GTCGGCTGTC CGGATGCGTC GGTCGTCGGC AAATCGTTCG GCCAGGTCGC GAACGAGCGC GGCGTCCACC CGGCGGACGC GTTTCTCGAT CTCGTCGTCG CGCACGGGCA ACGGCTGCGC TGGTGCATGA CGATCGCGAA CCACCGCGCG GATGTGCTCG ACCGCATCGC GACCCATCCG GCGCTGCAGA TCGGCTTCGC CGATTCGGGC GCGCACCTGC GCAACATGGC GTTCTACAAC GCGCCGGTGC GCTTCCTGCG CCGCGTGCGC GAGGCCGAGC GCGCCGGCCG GCCGTTCATG TCGGTGCAGC AGGCGGTGCA TCGGCTGACG GGCGAGCTCG GCGCGTATTT CGGCGTCGAC GCCGGCACGC TGCGCGCCGG CGACCGCGCC GATATCGCGA TCGTCGATCC CGCGCATCTC GACGCGTCGG TCGACGCGTA TCACGAAGAG GACATGGCCG TGTTCGGCGG GCTGCGGCGG CTCGTGAACC GCAGCGGCGC GGCGATCGCG GCGACGCTCG TCAACGGGCA ACTCGTGTAT CGCGACGGCG CGTTCGCCGA AGGCTTCGGC GACACGCGGC GCTCGGGGCG GTTCCTGCGC GCGGCGGCGC GCTAG
|
Protein sequence | MTRYDTIIRN GLWFDGTLAE PRPRELGIRD GRVAAVSDTP LAADGASVID ATGKWVMPGF IDIHTHYDAE ILVSPGLPES VRHGVTSVFL GSCSLSTVHA NALDCTDLFS RVEALPREQM LAVLSRVKTW DTAAAYVRHL ESLPLGPNVA AFLGHSDLRT HVLGLGRAVD DRVRPHEAEL QRMERLLDDA LDAGFVGLSS MTTPWDKLDG ERYRSKSLPS TFATWREYRR LNRVLRRRGR VLQSAPNTTN PLNGLLFMAE SCGYFVRKPL RTSLLVAADS KAAPRGTIDV QLGGVRVANA LFRGELVWQH LPVPFEVYAD GIDFVIFEEF GAGRAALHLA DALERNKLLQ NEGYRREFRR QVGKGFDLRL WTRDLHDTRI VGCPDASVVG KSFGQVANER GVHPADAFLD LVVAHGQRLR WCMTIANHRA DVLDRIATHP ALQIGFADSG AHLRNMAFYN APVRFLRRVR EAERAGRPFM SVQQAVHRLT GELGAYFGVD AGTLRAGDRA DIAIVDPAHL DASVDAYHEE DMAVFGGLRR LVNRSGAAIA ATLVNGQLVY RDGAFAEGFG DTRRSGRFLR AAAR
|
| |