Gene Noca_4801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4801 
Symbol 
ID4595405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp124926 
End bp127316 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content63% 
IMG OID639772588 
Productmolybdopterin oxidoreductase 
Protein accessionYP_919248 
Protein GI119714106 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR00509] molybdopterin guanine dinucleotide-containing S/N-oxide reductases 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCT TTCCGCTGAC ACTCACTCAT TGGGGCGCAT ACCGGATGCG GCACGTCCCC 
GGTGGGGCCA CCGAGGCATT GCCATTCGAG AATGACCCAG ATCCGTCGTC ACTGGGGAGG
TCTATGGCTG GGGCGTGGAA TTCCCAAGCC AGGATCCTTC GGCCAGCGGT ACGCCAGGGC
TACCTGAAGT ACGGACCCGG CGCTGGGGTA CGCGGCAAGG AACCTTTCGT CGAGGTGCCT
TGGGAGCTCG CGGTGGAACT TGTGTCGGGC GAGCTCACGC GTGTGACCAA GGCATTCGGC
AGCGAGGCGA TCTTTGGTGG TTCGTACGGC TGGAGCAGTG CTGGCCGCTT CCACCACGCA
CAGGGCCAAC TGCACAGATT CCTGAACTGT GCTGGCGGGT ACACCAGCTC GGTCAACACG
TACTCGGTTG CGGCAGGTGA AGTGATCCTC CCGCACGTGC TCGGGCTCGA CCCTCAGGAC
ATGATGTACA ACGGGATGCA ACCGAGTTGG AAGGACATGG CGGAGAACGC AGAGCTCGTG
GTCGCGTTCG GTGGCATGTC GATGAAGAAC CGACAAGTGG GCGCGGGCGG TCCGGTACGG
CACCTGGCGT CGAGCGCGGT CAAGTCGATC TCGGATGCGG GCGTTCGCTT CGTCAATGTC
AGTTTCACAC GCGACGACGC ACCGCCGCTC GACAAGATGA CCCACTTCGC TGTGCGCCCG
TGCACCGACG TGCCGCTGAT GCTCGCGCTC GCTCACACGA TCATCGTGGA AGGACTGTTC
GACGAGCAGT TCGTCGCCCG TTGCACGGTT GGGTTTGACG AGTTCTTGCC CTACCTGATG
GGCGTTTCCG ATGGCATCCC CAAGGACGCG GCGTGGGCCA GTGGTGTGTG CGGATGTAGT
GCTGGGTCGA TCCGCGCACT TGCCCGCGAC ATGGCGCGAT CGCGGACACT CATCACGGCC
GCCATGTCCT TGCAGAGACA AGAGCACGGG GAACAGACCT GGTGGATGGC CGTCGTCCTT
GCGGCCCTGT TAGGGCAGAT TGGTCTTCCC GGACGTGGGA TCGGCTTCGG ATACGCGAGC
CTGAGCCCCG TGGGAAACGA CGAAACGCCA ACGGCATGGC CCCACTTGCC TCAGGGACTC
AACGCGGTGA AGACCTTCAT TCCCGTAGCC CGGATGGCTG ACATGTTGTT GAACCCCGGC
GGCCATCACC AGTACAACGG TCGGGAACTC ACGTTTCCCG ACATCAGACT TGTCTGGTGG
GCAGGCGGCA ACCCGTTCCA CCACCACCAG GATCTCAACC GGCTCATCAA GGCGTGGCAG
AAACCCGAGA CTGTGATCGC CCACGAAGTC TTCTGGAACG CCCACGCACG CCATGCCGAC
ATCGTGTTGC CCGCGACGAC CGCTCTGGAA CGCAATGACC TTGGCTGCGC ACACCTCGAC
CCCCACCTGA TCGCAATGAA GCAAATGTCA GAGCCACTGG GCGAGGCCCA GTCGGACTAC
GCGATCCTGA CCCAGATCGC GCGTGCGGTC GGGATCGCCG ACGAGTACAC CGAGGGTCGC
AACGAGGCCG AGTGGCTCCG TTACCTGTAT AAGCAACTCG AGCACAGCCC TGCCCTCGGT
GGAAAGACCA TTCCGTCCTT CGACGAGTTC TGGGATCAGG GGTGGCTCGA GGTTCCATTC
GACCGCGCCG GGCAACGTGA ACGACTCGGC GAGGCGCTAC GCCGTGATCC GGATGCCAAC
CCCCTCGACA CACCCTCGGG ACGCATCGAG ATCTTCTCCT CGACCATCGA CGGGTTCGGC
TACGCCGACT GCCCAGGCCA CCCCATGTGG CTGGAGCCTC TGGAATGGCC AGGAGCGGAC
ATCGCGGCCC GCTTCCCCCT CTATCTGAGC TCCAACCAGC CGGCGCATCG ACTACACAGC
CAGTACGACC AGGGCGTTGT GAGCGTCGAC GCAAAAGTAG AAGGGCGCGA GCGGATCAGG
TTGAGTATTC AAGACGCGGC AGAACGATCC ATTGAGAACG GAATGATCGT GCGAGTCTTC
AATGATCGCG GCGCCTGTCT CGCCGCAGCA TGGGTAGACG AGGGTCTCGA GGCGGGAGTG
GTTCAGCTCC CCACCGGGGC TTGGTACTTC CCTTCGGTCT CCGACACCGG GCCAATCGAG
AGTCACGGCA ACCCAAATGT TCTGACAGCC GATCGGCCGA CGTCACGCCT AGCCGCAGGA
CCGTCCATCA ATGCCCTCGT GCAGGTTGAA GCCTGGACTT ACCCCCTGCC CGACCTCGCG
CCCTTTGAAG CGCCGCAGTT CGTCGTCCCA CGAACATTCC CCGCGAACGC GTGGGCTCGT
CGCACAGGCC TCGCGCCTCT GTCCGTAATC CCTTCAAGCC GTCAGCCGTA G
 
Protein sequence
MTTFPLTLTH WGAYRMRHVP GGATEALPFE NDPDPSSLGR SMAGAWNSQA RILRPAVRQG 
YLKYGPGAGV RGKEPFVEVP WELAVELVSG ELTRVTKAFG SEAIFGGSYG WSSAGRFHHA
QGQLHRFLNC AGGYTSSVNT YSVAAGEVIL PHVLGLDPQD MMYNGMQPSW KDMAENAELV
VAFGGMSMKN RQVGAGGPVR HLASSAVKSI SDAGVRFVNV SFTRDDAPPL DKMTHFAVRP
CTDVPLMLAL AHTIIVEGLF DEQFVARCTV GFDEFLPYLM GVSDGIPKDA AWASGVCGCS
AGSIRALARD MARSRTLITA AMSLQRQEHG EQTWWMAVVL AALLGQIGLP GRGIGFGYAS
LSPVGNDETP TAWPHLPQGL NAVKTFIPVA RMADMLLNPG GHHQYNGREL TFPDIRLVWW
AGGNPFHHHQ DLNRLIKAWQ KPETVIAHEV FWNAHARHAD IVLPATTALE RNDLGCAHLD
PHLIAMKQMS EPLGEAQSDY AILTQIARAV GIADEYTEGR NEAEWLRYLY KQLEHSPALG
GKTIPSFDEF WDQGWLEVPF DRAGQRERLG EALRRDPDAN PLDTPSGRIE IFSSTIDGFG
YADCPGHPMW LEPLEWPGAD IAARFPLYLS SNQPAHRLHS QYDQGVVSVD AKVEGRERIR
LSIQDAAERS IENGMIVRVF NDRGACLAAA WVDEGLEAGV VQLPTGAWYF PSVSDTGPIE
SHGNPNVLTA DRPTSRLAAG PSINALVQVE AWTYPLPDLA PFEAPQFVVP RTFPANAWAR
RTGLAPLSVI PSSRQP