Gene Noca_4183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4183 
Symbol 
ID4596697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4420495 
End bp4421799 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content74% 
IMG OID639778789 
ProductHNH endonuclease 
Protein accessionYP_925367 
Protein GI119718402 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCCC TTCCGCTCGA GGCACCTGGC CCGGACACCC CGGCCGAGGT CCTGGCTGCC 
CTGGAGACCA CGCACACCGC CCTGGCTGAT GCGGAGATCG GGCAGTTCCG GCTGGCGGTG
GAGTGGGCGA TCGCGCACCC GATCACCTCG GTCGCCGACA TCGCCACCGT CGAGGGCACC
GAAGGCGAGG TCGCGATCGC CGGGACGGGT GCGCCGTTGG TGGCGGAGTT CTGTGTCGCG
GACTTCGCCG CCGCGATCGG GGTGAGCACC GATGCCGGCC GGGCCTACCT GGGGGACGCG
GTCGAGGTCT GCTACCGGCT CCCCCTGCTG TGGGCCCAGG TCATGTCCGC GCGGGTGCCG
GTATGGAAGG CCCGCCGGAT CGCCGGGCAC ACCCACGGCC TGTCCTTCGA GGGCGCCGCC
TGTGTGGACC GGCACCTGGC CCCGATCGCG CACCGGGTCT CCTTCGCCCA GATCGAACGC
ACCGTCGAAG CCGCCCGCGC CATGCACGAC CCCGCCCAGG CCGAGCAACG CCGCCGCGAG
GCCGCCGACG GCCGGTTCTG TGACATCGAC ACCACCCAGG TCGGGCTGCA GGGCACCATG
ACCGTGCGCG GCGAGCTCGA CCTGGCCGAC GCCCTCGACC TCGACCAGGC CCTGCGACAC
GGCGCCCAAC AGCTCGCCGA CCTCGGCTGC ACCGAGACCC TCGACGTCCG CCGCGCCCTG
GCCGTCGGCG CCCTGGCCCG CGGCGACCTC ACCCTCGGAC TCGACACCCA CCCCGCCACC
CAGGCCGAGG CCGACACCGA CACCCAGCCC GAGAGCCAGG CCGGCCCCGA GGCGGAGCCG
CCGCCCAAGC GGCGGCGCGG GTTGATGCTG TACGCGCACC TGACCGACGA CGCGGTCCGC
GGCCTGCTCG CCACCGTGGA GAACACCCGC TCCCAGGTCC TGGTCGGCCA GGTCGCCGGC
TGGTGCGCCA CCGCGACCGG CCCGGTCACC ATCCGGCCGG TCCTGGACCT GCACGAACAC
CTGCAGGTGC CGGGCTACCG GCCCTCCCCA CGGTTGCGCG AACAAGTCCT CCTCACCCAC
CCCACCTGCG TGTTCCCCCA CTGCACCCGG CCATCACGGT CCTGCGACCT CGACCACGTG
ATCCCCTGGG CCGAGGGCGG CCCCACCTGC TCGTGCAACC TGGTCCCCGC CTGCCGGTTC
CACCACCGGC TCCGCACCCA CGGCGGCTGG CGGCTCCACC GCGTCGGCGA ACGACTCTTC
GTATGGACCA GCCCCCACGG ACGCATCTAC ACCCGACATC TGTGA
 
Protein sequence
MAALPLEAPG PDTPAEVLAA LETTHTALAD AEIGQFRLAV EWAIAHPITS VADIATVEGT 
EGEVAIAGTG APLVAEFCVA DFAAAIGVST DAGRAYLGDA VEVCYRLPLL WAQVMSARVP
VWKARRIAGH THGLSFEGAA CVDRHLAPIA HRVSFAQIER TVEAARAMHD PAQAEQRRRE
AADGRFCDID TTQVGLQGTM TVRGELDLAD ALDLDQALRH GAQQLADLGC TETLDVRRAL
AVGALARGDL TLGLDTHPAT QAEADTDTQP ESQAGPEAEP PPKRRRGLML YAHLTDDAVR
GLLATVENTR SQVLVGQVAG WCATATGPVT IRPVLDLHEH LQVPGYRPSP RLREQVLLTH
PTCVFPHCTR PSRSCDLDHV IPWAEGGPTC SCNLVPACRF HHRLRTHGGW RLHRVGERLF
VWTSPHGRIY TRHL