Gene Bpro_4589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4589 
Symbol 
ID4012747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4862445 
End bp4863713 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content65% 
IMG OID637944238 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_551370 
Protein GI91790418 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAAT CTTTTCCAGC TGACCCGGCC GAGACGGCCG GTGTCGACGC AGCGGCGGTG 
GCTGCCGCTT TGCAGGCGCT TTCGCGCGAT TCGGGCCTGC GGCCCTGGCA GCTTGGCAAC
AAGAGCCTGA TTCCCGTGGT GCAGGGCGGC ATGGGCGTGG GTGTTTCTGC CGGCGGGCTG
GCTGGTGCCG TGGCGCGCCT GGGCGGGGTG GGCACGCTTT CCTCGGTGGA TCTGCGGCGC
CTGCACCCCG ACCTGATGGC CCGGACCGGC CACCTTGACA AGGAGCCCGA CGCGCGCCAG
CTGATTGATG CGGCCAATCT GCAGGCCCTG GACCGCGAAA TCCGCCGGGC CCGAGAACTC
GCGCAGGGCC GCGGTTTGAT TGCCGTCAAC GTGATGAAGG CGCTCAGTGC CTACGAGGCC
TGTGTGCGCC AGGCGCTTGC CAGCGGGGCC GATGCGCTGG TGGTGGGTGC CGGCCTGCCG
CTTGATTTGC CGGAACTGGC GCGTGACTAT CCCAAAACCG CGTTGATCCC CATTCTTTCC
GATGCACGCG GCGTGCAATT GCTGGTGCGC AAGTGGCAAA AGAAAGGGCG CTTGCCCGAT
GCCATCGTGA TTGAGCACCC CGGCCTGGCA GGCGGCCATC TGGGCGCCGC GAAAGTTGCC
GATCTGCACG ATGCGCGTTT CGATTTCGAG GTGGTCATCC CGCAGGTGCT GGAGTTTTTC
AAGACCGCGG GCATCGAGCG CGAGATTCCG CTGATCGCGG CCGGGGGCAT CAATTGCCGG
GACGACATCC TGCGCCTGCA GGCCCTGGGT GCCTCGGCTG TGCAGCTGGG CACGGCGTTT
GCTGTCACCC TGGAGTGTGA TGCCGATCCG GCTTTCAAAA AGGTGCTGGC CGACGCAAAG
CCCGAAGACC TCATTGAGTT CATCAGCGTG GCTGGCCTGC CGGCCCGTGC GGTGCGCACG
CCCTGGCTGG ACAAATACAT CCGGCTGGAG CCCAGGCTCA AGGCCGTGGC GCATGTCAAG
GCCAAATGCA ACATGTCGTT CGATTGCCTG TCCCACTGCG GGCTGCGCGA TGGCGATGCC
AGCATAGGCC AGTTCTGCAT CGACAAGCAG CTCGGCCATG CGCTGGATGG CGACATTCAC
AAAGGCCTTT TTTTCCGGGG CGCCGGCCAC CTGCCGTTTG GCAACCAGAT CCGTTCGGTG
CAAGAATTGC TGCAGTGGCT GCTGGGCGGC ATTCGCCCCG CAGCGCTTAC CACCGGAGCG
CAGACATGA
 
Protein sequence
MDKSFPADPA ETAGVDAAAV AAALQALSRD SGLRPWQLGN KSLIPVVQGG MGVGVSAGGL 
AGAVARLGGV GTLSSVDLRR LHPDLMARTG HLDKEPDARQ LIDAANLQAL DREIRRAREL
AQGRGLIAVN VMKALSAYEA CVRQALASGA DALVVGAGLP LDLPELARDY PKTALIPILS
DARGVQLLVR KWQKKGRLPD AIVIEHPGLA GGHLGAAKVA DLHDARFDFE VVIPQVLEFF
KTAGIEREIP LIAAGGINCR DDILRLQALG ASAVQLGTAF AVTLECDADP AFKKVLADAK
PEDLIEFISV AGLPARAVRT PWLDKYIRLE PRLKAVAHVK AKCNMSFDCL SHCGLRDGDA
SIGQFCIDKQ LGHALDGDIH KGLFFRGAGH LPFGNQIRSV QELLQWLLGG IRPAALTTGA
QT