Gene Dole_1526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1526 
Symbol 
ID5694363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1819976 
End bp1821043 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content62% 
IMG OID641264121 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001529407 
Protein GI158521537 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.172251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTC AAAAAGGAAA GCTTATGAAA ACAGCGGTGA CACAACTTCT GGAATGCGAA 
TATCCGGTAT TGCTTTCCGG CATGACCGGG GTCAGCACGC CGGAACTGGC CGGCGCGGTG
AGCAATGCCG GGGGACTGGG GCTGCTGGCC ACGGCCGATC TTACCCTGGA GCAGACCCGG
CAGGCGGTCC GGCGGACGCG CAGGATAACG GACCGGCCTT TTGGCGCCAA CGTGCCGCTT
CTGATTCCAG GCGCTGAAGA GAAGATGGCG GTGCTGGTGG AAGAAAAGGT GCCGGTGGTC
AACTACACGC TTGGCAGTGG AGAACAAGTG GCTGATGTGG TCCACCGGTA CGGCGGAAAA
GTCGTGGCCA CAGTGACCAC GGCAAAACAT GCCCTGTCAG CTGAAAAACA CGGGGCCGAC
GCCCTGATCG TGACCGGCCA TGAGGCCGCG GCCCATGGTG GAGCCGTGGC TTCCATGGTG
TTGATCCCCG GCATTGTGGA CCGGGTGAAC ATCCCGGTAA TCGCCGCCGG CGGCATTGCC
GACGGCAGGG GGCTGGCCGC GGCCCTTGTG CTGGGGGCCG AAGGCGTGGC CATGGGCACC
CGTTTCATGA ATACCCGGGA AAGCCCGGTT CACGACACCA TGAAGCAGCT CTGCAACCAG
AAAGCGGTGG AAGACACGGT TTATTCAGAC CGGATCGACG GCCTGCCCTG TCGGGCACTG
GACTCTTCGG GCGCCAGAAA AATGATGGCC GACCGGCTTT ACCTGTTCAA GGCCCTTGCC
AACTCCCGTT TCGCGGCCCG GGCATACGGG TTCCCATGGA TTGCGGCCAT GGCCGGCATC
CTGTTGTCCG GCTATCGCCG GTCGAAACAG CTGGCCCGCA TGGCCAATGC CTTTCGCGCC
GTCAAACTGG CCATTGATGA CGGGGATCAG AAACGGGGGG TCTTTCTCAT GGGCCAGGTG
ACCGGCCTTA TTGACGAAAC CCTGACCGTC GGCCAGGTCA TGGAGAAGAT TCTTATTGAA
GCGGCATCGG CCCGAGCAAG GCTGGCGGCA GGAATGGGGC AGGAATAG
 
Protein sequence
MNFQKGKLMK TAVTQLLECE YPVLLSGMTG VSTPELAGAV SNAGGLGLLA TADLTLEQTR 
QAVRRTRRIT DRPFGANVPL LIPGAEEKMA VLVEEKVPVV NYTLGSGEQV ADVVHRYGGK
VVATVTTAKH ALSAEKHGAD ALIVTGHEAA AHGGAVASMV LIPGIVDRVN IPVIAAGGIA
DGRGLAAALV LGAEGVAMGT RFMNTRESPV HDTMKQLCNQ KAVEDTVYSD RIDGLPCRAL
DSSGARKMMA DRLYLFKALA NSRFAARAYG FPWIAAMAGI LLSGYRRSKQ LARMANAFRA
VKLAIDDGDQ KRGVFLMGQV TGLIDETLTV GQVMEKILIE AASARARLAA GMGQE