Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3913 |
Symbol | |
ID | 4901754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3819397 |
End bp | 3820497 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640137139 |
Product | 2-nitropropane dioxygenase family oxidoreductase |
Protein accession | YP_001068133 |
Protein GI | 126453588 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.841992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTCCC GTATCGCCCC GACTCCGTTC GCCGCCCGGT TCGACTTGCG CCTGCCGCTC GTGCAGGCGC CGATGGTCGG CGCGACGACG CCCGCGCTCG TCGCGGCCGC CTCCAACGCC GGTGCGCTCG GCAGCCTCGG CGGCGCGTCG TTCGCGCCGG AGAAGCTTGC CGCCGAAATC GCCGCGGTGC GTGCCGCGAC GCGCCGCGCG TTCGCCGTGA ACCTGTTCGT GCTGCCCGAC GCGCAGCCGG ACGACGCGGC CGTGCGTCGC GCGCTCGACG CGATCGATCC GCTGCGCGCG CGGTTCGGGT TGCCGCCCGG CGCGCCGCTG CCGCGCTACG CGCCGGATTT CCGCGCGCAA CTCGATGCGC TCGTCGACGC GCGCGTGCCG GTCGCGAGCT TCACGTTCGG CGTGCTCGAC AAGAAAGATG TCGTCCGGCT GCAGGCGGCG GGCACGTATG TGATCGGCAC GGCAACGCAT GTCGCCGAGG GCCTCGCGTG GCAGGCGGCG GGCGCCGACG CGATCTGCGC GCAAGGCGCG GAAGCGGGCG GCCATCGCGG CACGTTCATC GGTTCGGCCG AAGACGCGCT CGTCGGCACG ATCGCGCTCG TGCCGCAGCT CGTCGACGCG ACGAATCTGC CGGTGCTCGC GGCGGGCGGC ATCATGGACG GGCGTGGGAT CGCCGCCGCG CTCGCGCTCG GCGCGCAAGC CGCGCAGCTC GGCACCGCGT TTCTCACGTG CGCGGAAAGC GCGATTCCCG CGTGCTGGAA AGCGCGTCTG CTCGCGAGCG ACGATACGTC GACGTCCGTC ACGCGCGCGA TCACGGGCCG CCACGCGCGC GGCATCCGCA ATGCGCTGAT GGCGCAGCTG GCCGGACGGC CCGATTCGGT CGCGCCGTAT CCGGTGCAAA ACGCGCTGAC GCAGGAGCTG CGGCAAACCG CCGCGCGAGC GGGCGACGCC GAGTACTTGT CGTTGTGGTC CGGGCAAGGC GCGCCGCTCG GCAAGCACCG CGATGGCGCG CAAACCACCG CGCAATTGAT CGACGCGCTC GACGCCGAAT GGCGCGCTGC GCTTTCGCGC TCCGTTATTT CCCTGGTCTG A
|
Protein sequence | MTSRIAPTPF AARFDLRLPL VQAPMVGATT PALVAAASNA GALGSLGGAS FAPEKLAAEI AAVRAATRRA FAVNLFVLPD AQPDDAAVRR ALDAIDPLRA RFGLPPGAPL PRYAPDFRAQ LDALVDARVP VASFTFGVLD KKDVVRLQAA GTYVIGTATH VAEGLAWQAA GADAICAQGA EAGGHRGTFI GSAEDALVGT IALVPQLVDA TNLPVLAAGG IMDGRGIAAA LALGAQAAQL GTAFLTCAES AIPACWKARL LASDDTSTSV TRAITGRHAR GIRNALMAQL AGRPDSVAPY PVQNALTQEL RQTAARAGDA EYLSLWSGQG APLGKHRDGA QTTAQLIDAL DAEWRAALSR SVISLV
|
| |