Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0374 |
Symbol | |
ID | 4904010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 353866 |
End bp | 355146 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640143481 |
Product | hypothetical protein |
Protein accession | YP_001074417 |
Protein GI | 126455639 |
COG category | [R] General function prediction only |
COG ID | [COG3608] Predicted deacylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.210307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTGA CTCGGCGGGC ACGCCGATTT CGACGCAAGT CATTGATCGG TCGAGAAAAC GACTATCGAA CGGCACAGGT TTGGTGCGTT TCGCGACGAT CCACGTACAA TCGTTCATTC ACCGACATCG AACGACGCGC GGCGGCTGCG CCGCCGCCTC CCATCATGCA AACGCAGACC CATCCGCTGA TCTCGCCGGC CGTCGGCACG GCGCGCCACA TCACGAGTTT CCATTACGGC CCGCGCGGCG GGAAGAAGGT GTACATCCAG GCGTCGCTGC ACGCGGACGA GCTGCCCGGC ATGCTCGTCG CCACGCTGCT GCGCCGCAAG CTCGCGGCGC TCGAGGCGGC GGGCAGGCTG CGCGACGAGA TCGTCGTCGT GCCGGTCGCG AACCCGATCG GCCTCGCGCA GCACGTGTTC GGCGATCATC TCGGCCGCTT CGAGCTCGGC TCGATGCAGA ACTTCAACCG CAATTTCCAC GATCTCGCCG CGCTCGTGAT TCCGCGCATC GAAGGGCGCC TCACGCACGA CGCGGCCGCG AACCTCGCCG CCGTGCGCGG CGCGATGCGC GAGGCGCTTG CCGAGCAGAA GCCGCGCACC GAGCTCGAAT CGCAGCGGCT CGCGCTGCAG CGGCTGTCGT ATGACGCGGA CATCGTGCTC GATCTGCACT GTGACTGCGA CGCGGTGATG CACATCTACA CGAATCCGGA CCTGTGGGAC GACGTCGAGC CGCTGTCGCG CTATCTGGGC GCGAAGGCGT CGCTGCTCGC GCTGAACTCG GTCGGCAATC CGTTCGACGA AATCCACAGC TTCTGCTGGT CCGAGCTGCG CGGCCGCTTC GGCGAACGTC ATCCGATTCC GAACGGCACG ATCTCGGTGA CGGTCGAGCT GCGCAGCGAG CGCGACGTGT CGTACGAGCT CGCCGAGCAC GACGCGCAGG CGCTCGTCGA ATACCTGACG CTGCGCGGCG CGATCGACGG CACGCCCGCG CCGCAGCCGC CGCTCGAATT CGCGGCCACG CCGCTCGCGG GCACCGATCC GCTCGTCGCG CCGGTGTCGG GCGTGATCGT GTTCCACACG CCGGTCGGCG TATGGATCGA GGCGGGCCAG GACGTGGCCG ACATCGTCGA TCCGCTGACC GATCGCGTCG TCACGTTGAA GAGCAGCGTG TCCGGCGTGC TGTATGCGCG GCAGATCGCG CGCTTCGCGA CGGCCGGCAT GGAAGTCGCG CGGATCGCCG GCGCGACGCC GATCCGCACC GGATCGCTGC TGTCGGCTTG A
|
Protein sequence | MILTRRARRF RRKSLIGREN DYRTAQVWCV SRRSTYNRSF TDIERRAAAA PPPPIMQTQT HPLISPAVGT ARHITSFHYG PRGGKKVYIQ ASLHADELPG MLVATLLRRK LAALEAAGRL RDEIVVVPVA NPIGLAQHVF GDHLGRFELG SMQNFNRNFH DLAALVIPRI EGRLTHDAAA NLAAVRGAMR EALAEQKPRT ELESQRLALQ RLSYDADIVL DLHCDCDAVM HIYTNPDLWD DVEPLSRYLG AKASLLALNS VGNPFDEIHS FCWSELRGRF GERHPIPNGT ISVTVELRSE RDVSYELAEH DAQALVEYLT LRGAIDGTPA PQPPLEFAAT PLAGTDPLVA PVSGVIVFHT PVGVWIEAGQ DVADIVDPLT DRVVTLKSSV SGVLYARQIA RFATAGMEVA RIAGATPIRT GSLLSA
|
| |