Gene Noca_0612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0612 
Symbol 
ID4596029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp647541 
End bp649913 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content64% 
IMG OID639775219 
Productxanthine dehydrogenase, molybdenum binding subunit apoprotein 
Protein accessionYP_921833 
Protein GI119714868 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTAGAC CTGGCGCTTC CGGTGGTCTC GGCCCTTCAG CCAAGCGATA CGTGGGACAG 
CCCGTGCGTC GCCGTGAGGA CGCGCGGCTC CTGCGGGGAG ATGCGCGTTT CGTGGACGAT
GTCGATATTC ACGGGCAGCT GTACATGAGC GTGGTGCGTT CATCTGAGGC CCATGCGCGC
ATCGTCTCCG TGGAGCCCTC TGCGGCCCGA CGGGCTCCCG GCGTCCGGCT AGTCATCACG
GCTGATGACA TCGATGCTGA AGTGCCGCTG GAACAGATCG GCTACCACGA GGTTTACCCG
CAGATCGACG ACTATCTGCA CCCGGTCTTC GCCGGTGACC GTGTCCGGTA CGTCGGGCAA
CCCGTCGCCG CCGTCATCGC TGACACGCCG TATCTGGCTG AAGACGCCGC CGGGCTGGTG
GAGGTCACCT ACGACCTCCT TCCGCCGGTG CTCGATCCTC AGTACGCCTT GACCGATGAG
GCCGAACCAC TCTTCGAGGG GCAGTCCAAC GAGGCCGTTC GCATCGTGAA GGCGTACGGC
GACGTCGCGG ACGCCTTTAA GAAGGCAGCG CATGTCGTGA AGGGTCGCTA CGTGGTCGGT
CGTCATTCCG GGGTGCCGAT GGAGACGAGA GGGTGCATCG CAGAGCCCGA CCGCGGCCGC
AAGCAACTCT TCATGTGGGG GCCGGTCCAC ACCCACGACT GTCAGCGCCT CATCGCGCAA
GTACTCGAAC TCCCGCTCGC GGATCTGCGC ATGAAACACG TCGACATCGG AGGGAACTTC
GGTGTGAAGG GCGGGGTCTT TCCGGAATAC ATCATGGTGG GCTGGGCGGC CATGCGTCTG
GGACGTCCTG TCAAGTGGAC CGAAGATCGC CTCGAGCACA TGGTGGCAAA CGCTCACGCT
CGCGAACAGG TGCACGAAAT GGCCGCGGCC TTCGACGCCG ACGGGGTCCT GCTGGCTTTG
AAGGACGAGA TCTGGCACAA CCACGGAGCG TTCATTCGGC AGGCCGAGCC TCTCGTCAGC
GACATCACAG TCGGGATGGT CCCAGGCCCC TACCGAGTGC CGGCGTACGA CGGGCTCCTT
CACGTCGTGG TGTCCAACAA GACGCCGCTG TCCGCCTACC GTGCTCCAGG CAGGTACGAA
GGGACCTTCG CTCGCGAGCG GCTCCTCGAC CTCGCGGCCG AACAGATCGG CATTTCGCAG
GTAGAGATTC GGCGCCGCAA TCTGCTGACC GAAGCCGACC TTCCATTCGC GCCGGGCATG
GACATCTGTT TCGAGCCCTA CCACTTCGAT TCGGGTGACG TCGTTGACCA CTTGGACAAA
GCGCTCGAGT CGGCGGGGTT CGACGATTGG GAGCGGGAGG CGGCAGAGCT CCGAGCGCAG
GGCCGCTTAG TCGGCAACGG AATCGGCATG CTGATGGACA AGGCTGGCCT TGGGCTCTAC
GAGACGAGCG CCATCGACGT AGACGCCTCT GGACGGATCA GAGTTCGAAC TGGGGCATCT
TCCGTCGGAC AGGGTATCGA AACAGTCCTG GCGCAGATCG TTGCCGATGA ACTCCAAGTC
GATCCAGAGC TCATCGACGT CGTTCACGGT GACACCGAGC TTGTGCCCGA GGGTGTCGGC
TCCTGGTCGA GTCGCTCGAC CGTTCTGGCC GGAGGAGCGG CGCGTCAGGC GGCCCTCGAC
ACGCTAGCCA AGGCCAAGAG GCTCGCCAGC GAGATGCTGG AAGCAAATGT GGACGATCTG
ATCCTCGTCG ATGGCCGCAT CGTGGTGTCC GGGCTCGAGC AGCAGGGGTT GAGTCTCGCC
GAGATCGCCG GACGGTGGGA CGGATGGTCC GCGAGGTTGG CCAACGACGA GCCAGGCCTC
GGCGCGCAGG CCGTCTACCT GGATGAGCAC ATGAACTATC CCTACGGGGT CACTCTCGTG
CAGATCGAGA TTGATCCCGC CACGGGAGGT CACACCTTGA GGCGTTTCCT TACCAGCACC
GAGGCCGGAC GAGCGATCAA TCCAATGACC ACCCGCGGCC AAGTGATCGG TGCGGCAGCG
CAAGGCATCG GAGGTGCCCT CTACGAGGAG TTCCTGTACG ACGAATCCGG TCAACCGTTG
GCCACGTCCT TCATGGACTA CCTTCTGCCG ACTTCGCTGG ACGTGCCAGA TGTCGACTTC
TTCATGACGG AGGACGCGCC TACACCCAAC AACCCGTTCG GGGCCAAGGG CTTGGGGGAG
GTCGGCCTCA TCGCTGTGGG CGCGGCCATC GCGGGCGCCA TAGACGATGC CTTTGGCGAA
GGTGTGCGCA CGATCAAGGT GCCCGTCCCG CCCGAGACGC TCTGGCGGAG GTCGCATTCG
ATCGCGTTCA GTGATGAACC CGAGTCGATG TGA
 
Protein sequence
MSRPGASGGL GPSAKRYVGQ PVRRREDARL LRGDARFVDD VDIHGQLYMS VVRSSEAHAR 
IVSVEPSAAR RAPGVRLVIT ADDIDAEVPL EQIGYHEVYP QIDDYLHPVF AGDRVRYVGQ
PVAAVIADTP YLAEDAAGLV EVTYDLLPPV LDPQYALTDE AEPLFEGQSN EAVRIVKAYG
DVADAFKKAA HVVKGRYVVG RHSGVPMETR GCIAEPDRGR KQLFMWGPVH THDCQRLIAQ
VLELPLADLR MKHVDIGGNF GVKGGVFPEY IMVGWAAMRL GRPVKWTEDR LEHMVANAHA
REQVHEMAAA FDADGVLLAL KDEIWHNHGA FIRQAEPLVS DITVGMVPGP YRVPAYDGLL
HVVVSNKTPL SAYRAPGRYE GTFARERLLD LAAEQIGISQ VEIRRRNLLT EADLPFAPGM
DICFEPYHFD SGDVVDHLDK ALESAGFDDW EREAAELRAQ GRLVGNGIGM LMDKAGLGLY
ETSAIDVDAS GRIRVRTGAS SVGQGIETVL AQIVADELQV DPELIDVVHG DTELVPEGVG
SWSSRSTVLA GGAARQAALD TLAKAKRLAS EMLEANVDDL ILVDGRIVVS GLEQQGLSLA
EIAGRWDGWS ARLANDEPGL GAQAVYLDEH MNYPYGVTLV QIEIDPATGG HTLRRFLTST
EAGRAINPMT TRGQVIGAAA QGIGGALYEE FLYDESGQPL ATSFMDYLLP TSLDVPDVDF
FMTEDAPTPN NPFGAKGLGE VGLIAVGAAI AGAIDDAFGE GVRTIKVPVP PETLWRRSHS
IAFSDEPESM