Gene Noca_0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0440 
Symbol 
ID4597339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp471452 
End bp472498 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content73% 
IMG OID639775054 
Producthypothetical protein 
Protein accessionYP_921669 
Protein GI119714704 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0835114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGC ACTCCGGCCC CGAGCTGGTC GACTGGGACC TCGCGGTCGC CCTCGGCTCC 
CGGATCGCGG GCGACGGCCC GGTCGTGGAC CGCCGTACCG CGGACGCCGT GGTCGCCGAG
CTGCGCGCCG ACGCCGATCG GTCGACGGGC CTGGTCCGTG CGCACACCGG GCTGCTCGCC
GCCGAGCGGA CCGCCCCGGT CCTGGTCGTC GACCGGCCCG GCTGGGTCAT CGCCAACGCC
GACGGGTTCG CGACGATCCT GGCCCCCGTC GTCGACAAGC TGTCCTCGCG CAAGGGCGAG
CCGACCGGGC TGACGAAGGC GATCGGCTCG CGCGTCACCG GGGCCGAGGT CGGCGCGCTG
CTGGGCTTCC TGGCCGGGAA GGTGCTCGGC CAGTTCGACC CGTTCCACGC CCCGTCCGGG
CGGCTGCTGC TCGTCGCGCC GAACATCGTC CACGTCGAGC GCGAGCTCGG CGTCGTCCCC
CGGGACTTCC GGCTCTGGGT GTGCCTGCAC GAGGAGACCC ACCGGGTGCA GTTCACCGCC
ACCCCGTGGC TGGCCGACCA CCTGCTGGGA GAGATGCACG CGCTCGCCGA GAGCCTCGAG
CCGAGCGGCC TGCTCGACGA CGGGTTGACC CGGATCGCCG GGGCCGTCCG GAGCGCTCGC
GAGGGTGGCG GCAGCCTCCT GGACCTGATC GGAACCCCGG AGCAGAAGGA GATCGTCGAC
CGGGTCACCG GCGTCATGTC CCTGCTCGAG GGCCACGCCG ACGTGGTCAT GGACGACGTC
GGCCCGGGCG TGATCCCGAC GGTGGCCGAC ATCCGCAAGC GGTTCAACCG CCGCCGCCAG
GGCGTCGGGG TACTCGACCG GCTGCTGCGC CGGGTGCTCG GCCTGGACGC CAAGATGGCG
CAGTACCGCG ACGGCGCGAA GTTCGTCCGT GCGGTCGTCG ACAAGGCCGG CATGACCGAG
TTCAACGCGG TCTGGGAGCG GCCCGAGAAC CTGCCGTCGA AGGCCGAGAT CGCGGACCCA
CGAGCCTGGA TCAGCCGGGT GCTGTGA
 
Protein sequence
MSKHSGPELV DWDLAVALGS RIAGDGPVVD RRTADAVVAE LRADADRSTG LVRAHTGLLA 
AERTAPVLVV DRPGWVIANA DGFATILAPV VDKLSSRKGE PTGLTKAIGS RVTGAEVGAL
LGFLAGKVLG QFDPFHAPSG RLLLVAPNIV HVERELGVVP RDFRLWVCLH EETHRVQFTA
TPWLADHLLG EMHALAESLE PSGLLDDGLT RIAGAVRSAR EGGGSLLDLI GTPEQKEIVD
RVTGVMSLLE GHADVVMDDV GPGVIPTVAD IRKRFNRRRQ GVGVLDRLLR RVLGLDAKMA
QYRDGAKFVR AVVDKAGMTE FNAVWERPEN LPSKAEIADP RAWISRVL