Gene Noca_0050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0050 
Symbol 
ID4600101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp56595 
End bp58040 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content69% 
IMG OID639774664 
ProductDyp-type peroxidase family protein 
Protein accessionYP_921286 
Protein GI119714321 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGC AGCACCACAA CCCCACCGAA CTCACCGCCA GCACCCCTGG TGCGGCCAGA 
CATGCCCGCG GGGTCGTCCC CTTCCAGCCG GACGACCCCG TCATGAAGTC GCCCTACGCC
CACGGCGCAT TCGTCTTCGC GACCCTTCCC GCCGAGTGGG ACACCCCCGC CGTGACCACC
TGGCTCACCA CCATCGACAC CGCCCGACAG GCCCTGCGTG CTGCGTGCAC TCCTGACGTG
GTTCGCGTCG CGACGATCGC CGTGGGGTTC GGGCCTTCGT TCTTCTATCG GCCCGACGCC
ACTGTGCGCT TCGCCGGGGT GACTCCGCCG GCGGGATTCG CTCAGCTGCC TCCGATGCCG
CACGGCGCGG CGGTGCCGGC CGACGTGTGC TTCTACATCG TGGCGATCGC CGAGGCAGAG
ATCGCCAAGT TCGTCAACGC GCTCGCGGCC AGCGGCGTCA CCGGCCTGGC GATGGAGCAC
GGCTACAAGT CCTATCCGGA CGAGGAGGCG TTCGGGTACC GAGACGGGGT CCGCAACATC
CCCGTGTCCT CGCGCAACGA TTTCGTGTTC ATTGACGCCG ACCGCAACGC CGAGGAGCCG
GACTGGACCC ACCACGGCAC CTACATGGCC TACATGCGCA TCGCCCAGAA CCTGGCCGCG
TTCCAGGCGA TCCCGGCCGC CGAGCAGGAC CAGGTCATCG GCCGGGACCG GACGGGCCGC
CGCCTCGACC TCCCCGAGGG CACCAAAGCA AAGGACGAGC CGTCCTTCGC TACCGACGAC
CCGCGCCTGG ACTCCCACGT CCGCAAGGTC GGTCCGCGCG GGTTCGAGCA CCGCGACGAG
ACCCAGATCT TCCGTCGCGG CCTGCCGTTC TTCGAGGTGC GCGACGGCCA GGTCGTCCAG
GGGCTGCAGT TCGCCTCCTT CCAGGCCTCG CTCGACCAGT TCGACGCGGT GTTCAACCGG
TGGATGCTCA ACCCCGACTT CCCGCGGTCC GGGACAGGGG TCGACGCACT CGTGGCGCGC
GGCCTGATCA CGATCGAGAA GTGGGGTTTC TACTTCGTGC CTCCCGACAC CGACGGCCCC
ATCGGGATGG GCATGTTCGC GCCCGCCAAG GAGACGCGGA AGCCGAAGAC GGGCAGAGTG
GCGGTCCGCA AAGAGCTGGT CGACGCGAAC GGGACACGCG TCAACGGCGA CCTGGGCGGC
TTCACCTTCC AGATCACCGA CCTGGAGGGC AACCCGGTCG GAGAGTCGTT CACTTCAAAC
TCGCATGGCC ACGCGCTGTC GGGCGAGATC CCGCTCGGCG ACTACCAGCT GACCGAGCTG
CCCCCTCAGC CCCCGCAGCC GCCGATGCCA GCGGCCGGGC CGGTGTCGTT CACTCTCCGC
TCCGCCCAGG AGGTCGTGAA GGTCCGCAAC CAGCTCACCC CCGCCGCCGG ACCGTACAAC
GGCTGA
 
Protein sequence
MTEQHHNPTE LTASTPGAAR HARGVVPFQP DDPVMKSPYA HGAFVFATLP AEWDTPAVTT 
WLTTIDTARQ ALRAACTPDV VRVATIAVGF GPSFFYRPDA TVRFAGVTPP AGFAQLPPMP
HGAAVPADVC FYIVAIAEAE IAKFVNALAA SGVTGLAMEH GYKSYPDEEA FGYRDGVRNI
PVSSRNDFVF IDADRNAEEP DWTHHGTYMA YMRIAQNLAA FQAIPAAEQD QVIGRDRTGR
RLDLPEGTKA KDEPSFATDD PRLDSHVRKV GPRGFEHRDE TQIFRRGLPF FEVRDGQVVQ
GLQFASFQAS LDQFDAVFNR WMLNPDFPRS GTGVDALVAR GLITIEKWGF YFVPPDTDGP
IGMGMFAPAK ETRKPKTGRV AVRKELVDAN GTRVNGDLGG FTFQITDLEG NPVGESFTSN
SHGHALSGEI PLGDYQLTEL PPQPPQPPMP AAGPVSFTLR SAQEVVKVRN QLTPAAGPYN
G