Gene Ndas_4111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4111 
Symbol 
ID9247985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4908002 
End bp4909945 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content72% 
IMG OID 
Productmolybdopterin oxidoreductase 
Protein accessionYP_003682013 
Protein GI297563039 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.191585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCCG ACGAACCCTC CGCACCCCTC ACCCGACTCA CCCGACCCCT CGTCCGTGAC 
GGCGGCGAGC TGCGCCCGGC CTCCTGGGAG GAGGCCCTGG ACCGCGCGGC CGCCGGGTTC
CGGCGCGGGG TGGCCGAGCA CGGCCCCAAC TCCTTCGGCA TGCTCTCCTG CGCCCGCGCC
ACCAACGAGA TGAACTTCAT GGCGCAGAAG TTCACCCGCG TCGTCGTCGG CACCAACAAC
GTCGACTCGT GCAACCGCAC CTGCCACGCG CCCAGCGTCG CGGGACTGGC CGCGGCCTTC
GGCTCGGGCG GGGGCACCTC CTCCTACACC GAGATCGAGG ACACCGACCT CATCGTGATC
TGGGGCGGCA ACCCGCGCTT CGCCCACCCG ATCTTCTTCC AGCACGTGCT CAAGGCCGTG
CGGCGCGGGG CGCGCCTGTT CGTCGTGGAC CCCCGGCGCA CCCCCACCGC CGAGTGGGCC
GACCGCTGGC TGGGCCTGAA CGTGGGCACC GACATCCCCC TGGCGCACGC CATCGGCCGC
GAGATCCTGC ACGCCGGGCT GGCCAACGAC ACCTTCGTCC AGCGCGCCAC CACCGGTCTG
GAGGAGTACC GGGCGCTGGT CGAACCGTGG ACGCTGGCCG CCGCCGAGGC CGAGACCGGC
GTGCCCGCCG AGGCCATCCG CGAGCTGGCG CACGCCTACG CCCGCGCCGA GCGCGCCCAG
ATGTGCTGGA CGCTGGGCAT CACCGAGCAC CACAACGCCA CCGACAACGT CCGTTCGCTG
ATCAACCTGT CCCTGCTCAC CGGGCACGTG GGGCGGTACG GCTCCGGGCT CAACCCGCTG
CGCGGCCAGA ACAACGTGCA GGGCGGCGGC GACATGGGCG CCATCCCCGA CCGGCTGGTC
GGGTTCCAGG ACATCCTGGA CGCCCGGGTG CGCGCGCCCT TCGAGGCCGC CTGGGGACGC
GAGATCCAGC CCGTCAAGGG CCTGAACCTG ACCCAGATGT TCGAGGCGAT GGACGAGCGC
GAGCTCAGGA CGCTCTACGT GATCGGGGAG AACCCCGTCC AGTCCGAGCC CGAGACGCAC
AAGACCACCC GACGCCTGCG CGGCCTGGAC CACCTGGTCG TGCAGGACAT CTTCCTGACC
CGCACCGCCG AACTGGCCGA CGTGGTGCTC CCGGCCAGCG CGTCCTGGTG CGAGTCCGAC
GGCACCTTCA CCAACAGCGA GCGCCGCGTG CAGCTGGTGC GCAGGGCGCT GGACCCGCCG
GAGGGCGCCC GCGACGACAT CGAGATCATG TGCGACCTGG CCACCCGGCT GGGCCACGAC
TGGACGCGGC CCTCGGCCGA GGAGATCTGG GACGAGGTCC GCTCGCTCTC CCCCATGCAC
CGGGGCATGA GCTACGCGCG CCTGGCCGAG CTGCACGGCA TCCAGTGGCC GTGCTACTCC
GAGGACACCG TGGAGCCGAG CTACCTGCAC GCGCGGCTGT GGTCGGAGGA CCCCGCCGAG
CGCGGCGAAC CCGCGCCGTT CGGGATCGTG GGCCACTCGC CGCCGGTGGA CCTGCTGGAC
GAGGACTTCC CCTTCCGCCT GACCACCGGC CGACGGCTGG ACGACTACAA CACCGGCGTG
CAGACCAGCG GGTTCTCCTC GCCGCTGCGC CGGGGCGAGC TGCTGGACCT GTCCCCCGAG
GACGCGGCCA AGCTCGGCGT CGCCGACGGG GAGACGGTCC GGGTCACCTC CCGGCGGGGC
TCGGTGCTGG TCCCGGTCTC GGTCACCGAG GCCATGCGGC CCGGCCTGGT GTTCATGACC
TTCCACTTCC CCGACCAGGT GGACACCCAG CTGCTGACCA TCGACGCCAC GGACCCCATC
GCGGGGACCG CGGAGTACAA GGCCGCCGCC GTGCGGATCG AACGGGTGGA GCGCTCCGCC
CGCCAGACCG CCGCCGCGGG CTGA
 
Protein sequence
MRSDEPSAPL TRLTRPLVRD GGELRPASWE EALDRAAAGF RRGVAEHGPN SFGMLSCARA 
TNEMNFMAQK FTRVVVGTNN VDSCNRTCHA PSVAGLAAAF GSGGGTSSYT EIEDTDLIVI
WGGNPRFAHP IFFQHVLKAV RRGARLFVVD PRRTPTAEWA DRWLGLNVGT DIPLAHAIGR
EILHAGLAND TFVQRATTGL EEYRALVEPW TLAAAEAETG VPAEAIRELA HAYARAERAQ
MCWTLGITEH HNATDNVRSL INLSLLTGHV GRYGSGLNPL RGQNNVQGGG DMGAIPDRLV
GFQDILDARV RAPFEAAWGR EIQPVKGLNL TQMFEAMDER ELRTLYVIGE NPVQSEPETH
KTTRRLRGLD HLVVQDIFLT RTAELADVVL PASASWCESD GTFTNSERRV QLVRRALDPP
EGARDDIEIM CDLATRLGHD WTRPSAEEIW DEVRSLSPMH RGMSYARLAE LHGIQWPCYS
EDTVEPSYLH ARLWSEDPAE RGEPAPFGIV GHSPPVDLLD EDFPFRLTTG RRLDDYNTGV
QTSGFSSPLR RGELLDLSPE DAAKLGVADG ETVRVTSRRG SVLVPVSVTE AMRPGLVFMT
FHFPDQVDTQ LLTIDATDPI AGTAEYKAAA VRIERVERSA RQTAAAG