Gene BURPS1106A_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2183 
Symbol 
ID4901982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2168312 
End bp2169502 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content69% 
IMG OID640135412 
Product2-nitropropane dioxygenase family oxidoreductase 
Protein accessionYP_001066447 
Protein GI126455445 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGTT CCATTCCCTT TCCGCCGCTG ATGATCCGCG GCCGTTCGCT GTTGCCCATC 
GTGCAGGGCG GGATGGGCGT CGGCATCTCC GCGCATCGGC TCGCCGGAAG CGTCGCGCGC
GAAGGCGCGC TCGGCACGAT CGCGAGCATC GACTTGCGCC ATCACCATAC CGATCTGATC
GAGCGCTGCA AGCGGCATCC GGATCGCGAG ACGATGGAGG CGGCGAACCT CGAGGCGCTC
GCGCGCGAGA TCCAGCGCGC GAAGACGTGG GGCGAGGGGC GCGGCATGAT CGCGGTCAAC
GTGATGAAGG CGGTGCGCTC GCACGCCGAC TATGTGCGCA TCGCATGCGA GTTCGGCGCG
GACGCGATCG TGATGGGCGC GGGCTTGCCG CTCGATCTGC CGGACATGAC GCAGGGGCAC
GACATCGCGC TGATCCCGAT CCTGTCGGAC AGCCGCGGCA TCGCGCTCGT GCTGAAGAAG
TGGATGAAGA AAGGGCGTCT GCCCGATGCG ATCGTGATCG AGCATCCGGC CCGCGCGGGC
GGCCATCTCG GCGTGACGAG CCTCGACGAC ATGGACGATC CGCGCTTCGA ATTCGCGCGG
GTCATCGACG AAACCCGGCA GACGTTCGCC ACGCTCGGCC TCGAGCGCGA GCGCATCGCG
CTCGTCGTCG CGGGCGGCAT CAACAGCCAC GAGGCGGTGC GCGCGGCGCT CGCCGAAGGC
GCGAACGGCG TGCAGGTGGG CACGCCGTTC GCGGTCACCG AGGAGGGCGA TGCGCATCCG
AACTTCAAGC GCGTGCTCGC GAACGCGAAG CCGGACGACA TCGTCGAGTT CTTGAGCGTC
ACGGGGCTGC CGGCGCGCGC GGTGAAGACG CCGTGGCTCG AGCGTTATCT GCGGCACGAG
ACGCGCATTC GAGCGAAGAT CGGCGCGCTC AAGCAGCGCT GCCCGTCGGC GCTCGAATGC
CTGAGTGTGT GCGGCTTGCG CGACGGCATC GAGCGCTTCG GCCACTTCTG CATCGATACG
CGCCTGGCCG CCGCGCTGCG CGGCGACGTC GCGAACGGGC TGTTCTTCCG CGGCCGCGAA
GCGCTGCCGT TCGGGCAGGC GATTCGCAGC GTGCGCGATC TGCTCGAGCT GCTGCTCACG
GGCACCGCGC CCGAAGCTGC GGCAAACCGT CCCACTTTCT CGTTGTCGTA A
 
Protein sequence
MTGSIPFPPL MIRGRSLLPI VQGGMGVGIS AHRLAGSVAR EGALGTIASI DLRHHHTDLI 
ERCKRHPDRE TMEAANLEAL AREIQRAKTW GEGRGMIAVN VMKAVRSHAD YVRIACEFGA
DAIVMGAGLP LDLPDMTQGH DIALIPILSD SRGIALVLKK WMKKGRLPDA IVIEHPARAG
GHLGVTSLDD MDDPRFEFAR VIDETRQTFA TLGLERERIA LVVAGGINSH EAVRAALAEG
ANGVQVGTPF AVTEEGDAHP NFKRVLANAK PDDIVEFLSV TGLPARAVKT PWLERYLRHE
TRIRAKIGAL KQRCPSALEC LSVCGLRDGI ERFGHFCIDT RLAAALRGDV ANGLFFRGRE
ALPFGQAIRS VRDLLELLLT GTAPEAAANR PTFSLS