Gene Noca_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1942 
Symbol 
ID4599847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2072359 
End bp2073519 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content74% 
IMG OID639776540 
ProductnifR3 family TIM-barrel protein 
Protein accessionYP_923139 
Protein GI119716174 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.439583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCGC CCACCTCACC GACCGGCAGT CCGGCCGGGG GGCTGGCCCT GGGGTCGCTG 
CGCGTCGACA CCCCGGTGGT GCTCGCGCCG ATGGCCGGCA TCACCAACGC GGCCTACCGC
CGGCTGTGTG CCGAGCAGGG GGCCGGCCTC TACGTCTGCG AGATGATCAC CAGTCGGGGG
CTGGTCGAGG GCGACCAGCA CACCAAGGAC ATGCTGGTCT TCGACGAGCT CGAGACGATC
CGCTCCGTCC AGCTCTACGG CAGCGACCCG GCCTACGTCG GCAAGGCCGC CGAGATCCTC
TGCGCGGAGT ACGGCGTCGC GCACATCGAC CTCAACTTCG GCTGCCCGGT GCCCAAGGTG
ACCCGCAAGG GCGGCGGCGG CGCGCTGCCG TGGAAGCGCG GGCTGCTCGC CGAGATCCTG
GAGTCGGCCG TCGCCGCGGC CGCGCCGTAC GACGTGCCGG TCACGATGAA GACCCGCAAG
GGCATCGACG AGGATCACCT CACCTACCTC GACGCCGGCC GGATCGCGCA GGAGTCCGGA
TGTGCGGCGA TCGCGCTGCA CGGCCGGACG GTCGCGCAGG CCTACTCCGG CGCGGCCGAT
TGGGACGCGA TCGCGGCGCT GGTCGAGCAC GTGGACATCC CGGTGCTCGG CAACGGCGAC
GTCTGGGAGG CCGCGGACGC ACTGCGGATG GTCGAGGAGA CCGGCGTCGC GGGCGTCGTG
GTCGGCCGCG GCTGCCTGGG CCGGCCCTGG CTCTTCCGCG ACCTCGCCGC CGCGTTCGGT
GGCGAGGACG TCGCGACCCT GCCGGCCCTG GGCGAGGTGG CCGCGATGAT GCGCCGGCAC
GCCGAGCTGC TGTGCCAGCA CCTGGGGGAG GAGCGCGGCT GCAAGGAGTT CCGCAAGCAC
GTGACCTGGT ACCTCAAGGG TTTCGGCGCG GGCGGCGAGA TGCGGCGCTC GCTGGGCCTG
GTCGACAGTC TCGCGGCACT CGACCGGCTG CTGGCCGAGC TCGACCCGGA CGAGCCGTTC
CCGGAGCGCG AGCTGGGCGC CCCGCGCGGG CGCCAGGGAT CGCCGCGGGC CAAGGTCGCC
CTGCCCGAGG GTTGGCTCGA GGACGCGGAC GGTCGCGGCC GGCACGTCCA GGAGGACGCG
GACGAGACCA CCGGCGGGTG A
 
Protein sequence
MSSPTSPTGS PAGGLALGSL RVDTPVVLAP MAGITNAAYR RLCAEQGAGL YVCEMITSRG 
LVEGDQHTKD MLVFDELETI RSVQLYGSDP AYVGKAAEIL CAEYGVAHID LNFGCPVPKV
TRKGGGGALP WKRGLLAEIL ESAVAAAAPY DVPVTMKTRK GIDEDHLTYL DAGRIAQESG
CAAIALHGRT VAQAYSGAAD WDAIAALVEH VDIPVLGNGD VWEAADALRM VEETGVAGVV
VGRGCLGRPW LFRDLAAAFG GEDVATLPAL GEVAAMMRRH AELLCQHLGE ERGCKEFRKH
VTWYLKGFGA GGEMRRSLGL VDSLAALDRL LAELDPDEPF PERELGAPRG RQGSPRAKVA
LPEGWLEDAD GRGRHVQEDA DETTGG