Gene BURPS1710b_2314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2314 
Symbol 
ID3689931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2580938 
End bp2582128 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content69% 
IMG OID637728771 
Product2-nitropropane dioxygenase 
Protein accessionYP_333710 
Protein GI76810674 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGTT CCATTCCCTT TCCGCCGCTG ATGATCCGCG GCCGTTCGCT GTTGCCCATC 
GTGCAGGGCG GGATGGGCGT CGGCATCTCC GCGCATCGGC TCGCCGGAAG CGTCGCGCGC
GAAGGCGCGC TCGGCACGAT CGCGAGCATC GACTTGCGCC ATCACCATAC CGATCTGATC
GAGCGCTGCA AGCAGCATCC GGATCGCGAG ACGATGGAGG CGGCGAACCT CGAGGCGCTC
GCGCGCGAGA TCCAGCGCGC GAAGACGTGG GGCGAGGGGC GCGGCATGAT CGCGGTCAAC
GTGATGAAGG CGGTGCGCTC GCACGCCGAC TATGTGCGCA TCGCATGCGA GTTCGGCGCG
GACGCGATCG TGATGGGCGC GGGCTTGCCG CTCGATCTGC CGGACATGAC GCAGGGGCAC
GACATCGCGC TGATCCCGAT CCTGTCGGAC AGCCGCGGCA TCGCGCTCGT GCTGAAGAAG
TGGATGAAGA AAGGGCGTCT GCCCGATGCG ATCGTGATCG AGCATCCGGC CCGCGCGGGC
GGCCATCTCG GCGTGACGAG CCTCGACGAC ATGGACGATC CGCGCTTCGA ATTCGCGCGG
GTCATCGACG AAACCCGGCA GACGTTCGCC ACGCTCGGCC TCGAGCGCGA GCGCATCGCG
CTCGTCGTCG CGGGCGGCAT CAACAGCCAC GAGGCGGTGC GCGCGGCGCT CGCCGAAGGC
GCGAACGGCG TGCAGGTGGG CACGCCGTTC GCGGTCACCG AGGAGGGCGA TGCGCATCCG
AACTTCAAGC GCGTGCTCGC GAACGCGAAG CCGGACGACA TCGTCGAGTT CTTGAGCGTC
ACGGGGCTGC CGGCGCGCGC GGTGAAGACG CCGTGGCTCG AGCGTTATCT GCGGCACGAG
ACGCGCATTC GCGCGAAGAT CGGCGCGCTC AAGCAGCGCT GCCCGTCGGC GCTCGAATGC
CTGAGTGTGT GCGGCTTGCG CGACGGCATC GAGCGCTTCG GCCACTTCTG CATCGATACG
CGCCTGGCCG CCGCGCTGCG CGGCGACGTC GCGAACGGGC TGTTCTTCCG CGGCCGCGAA
GCGCTGCCGT TCGGGCAGGC GATTCGCAGC GTGCGCGATC TGCTCGAGCT GCTGCTCACG
GGCACCGCAC CCGAAGCTGC GGCAAACCGT CCCACTTTCT CGTTGTCGTA A
 
Protein sequence
MTGSIPFPPL MIRGRSLLPI VQGGMGVGIS AHRLAGSVAR EGALGTIASI DLRHHHTDLI 
ERCKQHPDRE TMEAANLEAL AREIQRAKTW GEGRGMIAVN VMKAVRSHAD YVRIACEFGA
DAIVMGAGLP LDLPDMTQGH DIALIPILSD SRGIALVLKK WMKKGRLPDA IVIEHPARAG
GHLGVTSLDD MDDPRFEFAR VIDETRQTFA TLGLERERIA LVVAGGINSH EAVRAALAEG
ANGVQVGTPF AVTEEGDAHP NFKRVLANAK PDDIVEFLSV TGLPARAVKT PWLERYLRHE
TRIRAKIGAL KQRCPSALEC LSVCGLRDGI ERFGHFCIDT RLAAALRGDV ANGLFFRGRE
ALPFGQAIRS VRDLLELLLT GTAPEAAANR PTFSLS