Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1634 |
Symbol | |
ID | 4600013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1734138 |
End bp | 1737056 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639776233 |
Product | xanthine dehydrogenase, molybdenum binding subunit apoprotein |
Protein accession | YP_922834 |
Protein GI | 119715869 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR03313] probable selenate reductase, molybdenum-binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.024681 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCA TGATCGACCT CGAGCTCCAC CTCAACGGCG CTCCGCGCAC CGTGACCTGC GCGCCGGGCG ACTCGCTGCT GAGCGTGCTG CGGCGGCTGG GCTGCTTCAG CGTGCGGTTC GGCTCCGAGA CCGGCGAGAC CGGCGCCGCC GCGGTCCTGG TGGACGGGCG GCTGCTGGCC GCCGACGTGC TGCTCGCCGC CCAGGCCGCC GGCCACGAGG TGACGACCGT GGAGTCACTG AACGTCGCCA CCGGCGAGCT GCACCCCATC CAGGCCGCGT TCGTGGAGTC CGGCGCGCTG CAGTCCGGCT ACTCCGCCGG GGCCATGGTG CTCGCCACCC AGGCCCTGCT CGCGCGCGAC CCCGACCCCT CGGAGTGGGA GATCCGCGAC GCGTTCTCCG GGATCCTCGA CCGGGAGACG GGCTACGTGA AGGTCGTCGA CGCCGTGCAG CGCGCGGCCG CGCTGCTCCG TGGCGAGAAG CCGGAGCCGA TGGCGCCCGT CGTCGTCGGG CGGCTCACCG ACGGCGACGG TGGGACCGTC CCGGACGCTC CCGCTGCCAT GCCGCGCCTC GTGCCCGCGG CGGACGTCCC CTCCATGCAG GTGGTCGGGA CGCCCGCGCC CAAGGTCGAC GCGCTCAAGC TGGTCAAGGG GAATCCGGCC TTCGTCGACG ACATCGAGCT GCGCGGGATG CTCTACGCCA AGGTGCTGCG CAGCCCGCAC GCGCACGCCC GGATCGTCGC CATCGACGAC TCCGAGGCCC GCGCGCTCCC AGGCGTGCAC GCCGTGCTGC ACACCTTCAA CACGGCCCGC GTGAAGTACG CCTCCGGCGG CCAGAGCTGG CCGAACCCGC GCCCCTGGGA CCAGGTGAGC TTCGACGACA AGGTGCGCCA CGTCGGTGAC CGCGTGGCCG CGGTGGCCGC GGAGACCGTG GCGATCGCCG AGGAGGCCTG CCGGCGGATC AAGGTGACGT ACGACGTCCT GCCGGCGGTC TTCGACGAGG TCGAGGCGAT CGGCCCCGGC GCTCCCGTCA TCCACGACGA GACCGACACG GAGGGGATCG CCGACGCGGC CCGCAACATC CCCCGCCGGG TCAACGGGCA GACGGTCACC GACGACGAGC TGGCGGCCGC GTTCGCCGAC GCCGAGCACG TCTTCGAGCA GACCTTCCAC GTCCAGCAGC AGCAGCAGAC CCCGATCGAG CCGCACATCA CCATCGGCTG GCTCGACGCG GACGAGCGGC TGACGCTGCG CAGCTCCACC CAGGTCCCGT TCCACACCCG CCGGATGGTC GCCCCTCTGC TCGGCCTGCC GGTCAAGCAG ATCCGGGTGA TCAAGCCGCG GCTCGGCGGT GGCTTCGGCG GCAAGCAGGA GATGCTGCTC GAGGACGTGG TCGGGCACCT GGTGCTCGCG ACCCGCCGAC CGGTCCGGCT GGAGCTGACC CGGGAGGAGG AGTTCGTCTC CTCGCGGATC CGTCACGCCC AGACGATCAC CTTCCGCTCC GCCGTCGACG CCGACGGCCG ACTGCTCGCC CTGGACCACC ACGTGGTCGG TAACACCGGC CCGTACGCGA CGCATGGGTT CACGGTGCAG TCCGTCTCCG GCCAGCGCGG CCTGTCGCAG TACAACTGCC CCGCCAAGCG GTACACGGCC GACGTCGCCT ACACGAACCG GCCGGTGGCC GGGGCGTTCC GCGGCTACGG CGCGCCCCAG GCGCTGTTCG CGCTGGAGTC CCACATGGAC GACATCGCCC GCAGCCTCGG CCTCGACCCG ATCGAGCTCC GTCGGCGGAA CTGGGTGCGC ACCGGCGACC CCCTCGACAT CCTGCCCACG CTGGGCGAGC GGGGCGCCGC GGAGATCCAG GGCGAGGTGC CCCGGGTGAC CAGCTGCGCG ACCGACGAGT GCGTGGCGCA GGCCAGGCGC GCGATCGGCT GGGAGCGGCG CAACGACCCC GCGTGGAAGC AGCCCGCGGA CCGGCCGAGC ATCCGCAGGG GGATCGGCTT CGCGCTGTGC ATGCAGGCCA CCGCGATCCC CAACCTCGAC ATGGGTGGTG CCTCGATCAA GATGAACGAC GACGGCTCGT TCAACCTGCT GGTCGGTGCG ACGGACCTCG GCACGGGCGC CGACACGGTG CTGGCCCAGA TCGCCGCCGA GGTGCTCGGG GTCGAGGCCA GTGACGTGGT CGTGTACGCC GCCGACACCG ACCTGACGCC GTTCGACGTC GGGGCCTACG CCGACTCCAC GACGTACATC TCCGGGATGG CGGTCAAGAA GGCCGCCGAG GCCGTGCGCG AGCGGATCGT GCTGCGCGCG GCCCGGCTGC TCGAGCTGGC CGGCCCCGCC AAGGTCGAGC TGCGGAACAA GCAGGCCTGG GCGCCCGACG GGCGCTGCGT GACGCTCCAG GACGTGGCCC TCCATGCGCT GCACATCGAG GACCAGGAAC AGATCATGGC GACGGCCTCG CACGTCTCGG GTGAGTCGCC GCCGCCCTTC GCCGCCCAGA CGGCCGAGAT CGAGGTGGAC GTGGAGACCG GCCAGGTGAC CGTCACGAAG CTGGTGATGG CCGTCGACGC CGGGGTCGTC ATCAACCCGA CCACGGCGAG CGGCCAGGTC GAGGGCGCGA TGGTGCAGGC GCTGGGCTAC GCGCTCACCG AGGACCTCGT CCTCGACGAC GACGGCCGTG CGGTGAACGC CCGGTTCGGG CCGTACTGGA TCTTCCGGCC CGACGACACG CCCCCGATGG AGGTGTTCCT GGTGCAGACC ATGGAGCCGA GCGGGCCGTT CGGCGCCAAG TCGGTCGGCG AGATCGCCAT CGACGGGGTG GCACCCGCCG TGCGCAACGC GATCCTCGAC GCGACCGGGG CCACCGTCAA CGAGCTGCCG CTCACCCCCG AACGGGTCTG GCGGGCGCTG CACGCCTGA
|
Protein sequence | MSTMIDLELH LNGAPRTVTC APGDSLLSVL RRLGCFSVRF GSETGETGAA AVLVDGRLLA ADVLLAAQAA GHEVTTVESL NVATGELHPI QAAFVESGAL QSGYSAGAMV LATQALLARD PDPSEWEIRD AFSGILDRET GYVKVVDAVQ RAAALLRGEK PEPMAPVVVG RLTDGDGGTV PDAPAAMPRL VPAADVPSMQ VVGTPAPKVD ALKLVKGNPA FVDDIELRGM LYAKVLRSPH AHARIVAIDD SEARALPGVH AVLHTFNTAR VKYASGGQSW PNPRPWDQVS FDDKVRHVGD RVAAVAAETV AIAEEACRRI KVTYDVLPAV FDEVEAIGPG APVIHDETDT EGIADAARNI PRRVNGQTVT DDELAAAFAD AEHVFEQTFH VQQQQQTPIE PHITIGWLDA DERLTLRSST QVPFHTRRMV APLLGLPVKQ IRVIKPRLGG GFGGKQEMLL EDVVGHLVLA TRRPVRLELT REEEFVSSRI RHAQTITFRS AVDADGRLLA LDHHVVGNTG PYATHGFTVQ SVSGQRGLSQ YNCPAKRYTA DVAYTNRPVA GAFRGYGAPQ ALFALESHMD DIARSLGLDP IELRRRNWVR TGDPLDILPT LGERGAAEIQ GEVPRVTSCA TDECVAQARR AIGWERRNDP AWKQPADRPS IRRGIGFALC MQATAIPNLD MGGASIKMND DGSFNLLVGA TDLGTGADTV LAQIAAEVLG VEASDVVVYA ADTDLTPFDV GAYADSTTYI SGMAVKKAAE AVRERIVLRA ARLLELAGPA KVELRNKQAW APDGRCVTLQ DVALHALHIE DQEQIMATAS HVSGESPPPF AAQTAEIEVD VETGQVTVTK LVMAVDAGVV INPTTASGQV EGAMVQALGY ALTEDLVLDD DGRAVNARFG PYWIFRPDDT PPMEVFLVQT MEPSGPFGAK SVGEIAIDGV APAVRNAILD ATGATVNELP LTPERVWRAL HA
|
| |