Gene Noca_4171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4171 
Symbol 
ID4596685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4403943 
End bp4406273 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content73% 
IMG OID639778777 
Productmolybdopterin oxidoreductase 
Protein accessionYP_925355 
Protein GI119718390 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0561971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGCCA ACACCGAACA GGCCAGCACC GACCGGACCC AGGACGTCTG GGGAGAGCGC 
ACGCCGCACC CCCGCGCCAC CCCGTGGCCG ACCCGGGTGG ACCTCCACCT CGAGGACGGC
CTGGCCGAGG ACCAGGTGGA CTCCTGGGTC GCCTCGGCGT GCCTGCTGTG CAGCAACGGC
TGCGGCGTCG ACATCGCCGT CAAGGACGGG AAGATGGTCG GCGTCCGCGG CCGGGAGACC
GACCGGGTCA ACCACGGCCG GCTCGGCCCC AAGGGCCTCT ACGGCAGTAC GCCGTGGTCG
CACTCCCCCG ACCGGCTCAC CACGCCGCTG GTCCGCGAGC ACGGCGAGCT GGTCGAGACC
GACTGGGACA CCGCGATGGG CCGGATCGTG GCCCGCTCGA AGCAGCTGCT CGAGGAGCAG
GGCCCGCTCA GCCACGGCTT CTACACCAGC GGCCAGCTGT TCCTCGAGGA GTACTACACG
CTCGCGGTGA TCGGGAAGGC CGGGCTCGGC ACCCCGCACA TGGACGGCAA CACCCGGCTC
TGCACCGCGA CCGCCGCGGC CGCGCTCAAG GAGTCCTTCG GCGCCGACGG CCAGCCGGGA
TCCTACGAGG ACATCGAGCA CTGCGACGCG CTGTTCCTCT ACGGCCACAA CATGGCCGAG
ACCCAGACCG TGCTGTGGTC GCGGATCCTG GACCGCACCC GCGGCGAGGA CCCGCCGAAG
GTCGTCTGCG TCGACCCGCG CCGCACCGCG GTCGCCGCCG AGGCCGAGCG CACCGGCGGC
GTCCACCTGG CCCCGAAAGT GGGCACCAAC CTGGCGCTGA TGAACGCCCT GATCCGCGAG
CTGCTCGAGC ACGACGCGTG GGTCGACCAC GCCTGGGTGG ACGCGCACAC GATCGGGCTC
GAGGGCCTGC GCGCCGCGGT CGCGCCGTAC ACCCCCGAGC AGGCGGCGGA GATCTGCGGC
GTCGACCCCG ACGAGGTACG CCGGGCGGCG CGGATCTTCG GGGAGTCCGA GGCGGTCCTC
TCCACGGTGC TGCAGGGCTT CTACCAGTCC CACCAGGCGA CGGCCGCGTC CGTGGCGGTC
AACAACCTGC ACCTGCTGCG CGGCATGATC GGCCGCCCGG GGGCCGGGCT GCTGCAGATG
AACGGCCAGC CGACCGCGCA GAACAACCGC GAGACCGGCG CCGACGGCGA CCTGACCGGG
TTCCGCAACT GGGACAACCC GACGCACGTC CAGGAGCTCG CCGACCACTG GAACGTCGAC
CCGATGACGA TCCCGCACTG GGCGCCGCCG ACCCACGCGA TGCAGATCTT CTCCTACGCC
CAGGCCGGCA CCGTCGGGCT GCTGTGGATC TCCGCGACCA ACCCGGCCGT GTCGATGCCG
GAGACCGAGC GGATCAGGAG CATCCTGTCC GGCGACCAGT GCTTCGTGGT CGTGCAGGAC
CTGTTCCTGA CCGAGACCGC GCAGCTCGCC GACGTGGTGC TGCCCGCGGC CGGGTGGGGC
GAGAAGACCG GCTGCTTCAC CAACGTCGAC CGCACCGTGC ACCTCTCGCA ACAGGCCGTC
GACCCGCCCG GCGAGGCGCG CAGCGACCTG GACATCTTCC TGGCCTACGC CGACGCCATG
GGGTTCGAGG ACCAGGACGG CGGGCCGCTG ATCACCTGGC GTACCCCCGA GGAGACCTTC
GCGCACTGGT CGGCGGCGAC CCGCGGCCGC CCCTGCGACT ACACCGGCAT CACCTACGAG
CTGCTGAGCG GGCCCACCGG CGTGCAGTGG CCGCTCGGCG TCGAGCGGCT GTACGCCGAC
GCGCAGTTCC CCACCCACAC CGAGCAGTGC GAGACCTTCG GCCACGACCT GACCACGGGC
GCGACCGTCA CCGAGCAGGA GCACCGGGCC CAGGCGCCGG CGGGGCGCGC GTTCCTCAAG
GGCGCGCCGT ACTCCCCGCC GCACGAGGAG CCCAGCGCGG AGTATCCGTT CCGGCTGACC
ACCGGCCGCA CCGTCTACCA GTTCCACACC CGCACCAAGA CCGCCCGGTC CCGGCCGCTC
GACGCGTGCG CCCCGCACGC GTGGGTGGAG CTCGCCCCGG CCGACTCCCA GCGGCTCGGG
ATCGCCGACG GCGACCTGGT CCGGCTGGAG TCGCCGCGCG GCAGCGTCGA CGTGCCGGCC
CGGGTCACCG AGGTGACCGA GGGGGCGGTG TTCGTGCCGT TCCACTACGG CGACCACCCC
GCCAACGAGC TGACGATGAC CGTCTGGGAC CCGGTGTCCA AGCAGCCCAC GTTCAAGACC
GCTGCGTGCC GGGTCACCCG CCTCGGCGCC GGCACGGGAG AGGAGGGCTG A
 
Protein sequence
MSANTEQAST DRTQDVWGER TPHPRATPWP TRVDLHLEDG LAEDQVDSWV ASACLLCSNG 
CGVDIAVKDG KMVGVRGRET DRVNHGRLGP KGLYGSTPWS HSPDRLTTPL VREHGELVET
DWDTAMGRIV ARSKQLLEEQ GPLSHGFYTS GQLFLEEYYT LAVIGKAGLG TPHMDGNTRL
CTATAAAALK ESFGADGQPG SYEDIEHCDA LFLYGHNMAE TQTVLWSRIL DRTRGEDPPK
VVCVDPRRTA VAAEAERTGG VHLAPKVGTN LALMNALIRE LLEHDAWVDH AWVDAHTIGL
EGLRAAVAPY TPEQAAEICG VDPDEVRRAA RIFGESEAVL STVLQGFYQS HQATAASVAV
NNLHLLRGMI GRPGAGLLQM NGQPTAQNNR ETGADGDLTG FRNWDNPTHV QELADHWNVD
PMTIPHWAPP THAMQIFSYA QAGTVGLLWI SATNPAVSMP ETERIRSILS GDQCFVVVQD
LFLTETAQLA DVVLPAAGWG EKTGCFTNVD RTVHLSQQAV DPPGEARSDL DIFLAYADAM
GFEDQDGGPL ITWRTPEETF AHWSAATRGR PCDYTGITYE LLSGPTGVQW PLGVERLYAD
AQFPTHTEQC ETFGHDLTTG ATVTEQEHRA QAPAGRAFLK GAPYSPPHEE PSAEYPFRLT
TGRTVYQFHT RTKTARSRPL DACAPHAWVE LAPADSQRLG IADGDLVRLE SPRGSVDVPA
RVTEVTEGAV FVPFHYGDHP ANELTMTVWD PVSKQPTFKT AACRVTRLGA GTGEEG