Gene Noca_3653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3653 
Symbol 
ID4595765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3877396 
End bp3878814 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content77% 
IMG OID639778261 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_924840 
Protein GI119717875 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.042009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCGGTA CGACACCGTC CATCATCCAA GGTGGGATGG GGGTCGGCGT CTCGTCCTGG 
CAGCTCGCCC GGTCCGTGGC CCTGGCCGGC CACCTCGGCG TGGTCTCCGG TACGGCGCTC
GACGCGACGC TCGCCCGCCG GCTGCAGGAC GGCGACCCGG ACGGGCACGC CCGGCGCGCC
CTGGCCGCGT TCCCCGACCA GGCGATGGTC GCCCGGGCGC TCGACCGCTA CCACCTCCCC
GAGGGCCGGA CGCCGGGCCG CCCGTACCGG CCGACGCCGA AGCCGTCGCT GCGGCCCAGC
CGGCACGCCC AGGAGCTGGC CGTCCTCGGC AACTTCGCCG AGGTGTGGCT GGCCAAGCAG
GGCCACGAGG GGACCGTCGG CATCAACTTC CTCGAGAAGA TCCAGCTGGC CACGCCCGCG
GCCGCGCTCG GCGCGATGCT CGCCGGGGTG GACGTCGTCC TGATGGGGGC CGGGGTGCCC
CGGGAGATCC CGCAGCTGCT CACCGACCTC GCGCGCGGCG ACGTCGGGGG GATCGGCGTC
GACGTGCACG GCGCACCCGA GCGGTCGCGG ATCGAGGTCG ACCCGCGGGG CCTGCTCGGT
CGGCACCTGC CGCCGCTGCG CCGGCCGCGG TTCCTGGCCA TCGTGTCCGC GACCGTCCTG
GCCGCCTACC TTGCCCGCGA CGACGCGACC CGGCCCGACG GGTTCGTGAT CGAGGGTCCG
GTCGCGGGCG GCCACAACGC GCCGCCACGC GGGACGCTCG TGGTCGACGA CGACGGGGAG
CCCGTGTACG GCGACCGCGA CGCGGTCGAC CTCGCCAAGG TCGCGGCCCT CGGCCTGCCG
TTCTGGCTGG CGGGCGGGCA CGGGACGCCC GAGGGCCTGC AGTCGGCCCG CGCCGCCGGG
GCTGCCGGCA TCCAGGTCGG GACGCTGTTC GCGCTCGCCG CAGAGTCGGG ACTGCGCCCC
GAGCTGCGTG AGCAGGTCGG CGCCCAGCTC GGCGCCGGGA CGCTGCGGGT ACGGACCGAC
GTGCGCTCCT CGCCGACCGG CTTCCCGTTC AAGGTCGCCC GGATCGAGGG CACGCTCTCC
GAGTCCGAGG TGTACGAGGA CCGCGAGCGG CTGTGCGACC TGGGCTACCT GCGCACGCCG
TACCTCACCG CGGCGGGCCG GATCGGGTAC CGGTGCCCGG GCGAGCCGGT GGCGGTCTAC
CAGCGCAAGG GCGGCGACCT CGCCGACACG GTCGGCCGCA GGTGCCTGTG CAACGCGCTG
ACCGCCGACG TCGGCCTGGG ACAGACCCGC CCGGACGGCT ACCGGGAGCC GGGCCTGATC
ACGCTCGGGA GCGACCTCAG CGGCCCGCGC CGCCTGCTCG AGCGCCATCC GGGCGGCTGG
ACGGCCGGGC AGGCGGTGGC CTGGCTGGAG GGCCGATGA
 
Protein sequence
MPGTTPSIIQ GGMGVGVSSW QLARSVALAG HLGVVSGTAL DATLARRLQD GDPDGHARRA 
LAAFPDQAMV ARALDRYHLP EGRTPGRPYR PTPKPSLRPS RHAQELAVLG NFAEVWLAKQ
GHEGTVGINF LEKIQLATPA AALGAMLAGV DVVLMGAGVP REIPQLLTDL ARGDVGGIGV
DVHGAPERSR IEVDPRGLLG RHLPPLRRPR FLAIVSATVL AAYLARDDAT RPDGFVIEGP
VAGGHNAPPR GTLVVDDDGE PVYGDRDAVD LAKVAALGLP FWLAGGHGTP EGLQSARAAG
AAGIQVGTLF ALAAESGLRP ELREQVGAQL GAGTLRVRTD VRSSPTGFPF KVARIEGTLS
ESEVYEDRER LCDLGYLRTP YLTAAGRIGY RCPGEPVAVY QRKGGDLADT VGRRCLCNAL
TADVGLGQTR PDGYREPGLI TLGSDLSGPR RLLERHPGGW TAGQAVAWLE GR