Gene Ndas_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0799 
Symbol 
ID9244644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp982877 
End bp984046 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content75% 
IMG OID 
Productdihydroorotate dehydrogenase 
Protein accessionYP_003678749 
Protein GI297559775 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.795428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGT TCTACCAGAT GCTCTTCCGC TCGGTCCTGC GCCACATGGA CGCGGAGAAG 
GTCCACAGGC TCAGCTTCGC CGCGCTGCGC GGCGTGACCT CCCTGCCCGC CGTGGCCTCC
GCCATGAAAG GCGTGCTCGG GCCGCGCGAG CCGGAGCTGA CCGTGCACGC CCTCGGGCAG
GAGTTCCCCG GCCCGCTGGG GCTCGCGGCC GGTTTCGACA AGAACGCGGA GAGCCCCTCC
GGGCTGGCGG CGCTCGGCTT CGGCTTCGTG GAGGTGGGCA CCGTCACCGC CCAGCCCCAG
CCGGGCAACC CGCGGCCGCG CCTGTCCCGG CTGGTGGCCG ACCGCGCGAT CGTCAACCGC
ATGGGCTTCA ACAACGAGGG GTCGGCCCTG GTCGCAGAAC GCCTCCACCA CCGGCGCGGC
GGCCGCCGTC CCGTGCTCGG CGTCAACATC GGCAAGACCA AGGTCACGCC CGAGGAGGAG
GCCCCCGCCG ACTACGCCCT CAGCGCCCGG CGCCTGGCCC GCTACGCCGA CTACCTGGTG
GTCAACGTCA GCTCGCCCAA CACCCCCGGG CTGCGCAACC TCCAGGGCGT GGAGCGGCTG
CGCCCGCTCC TGGCCGCCGT GCGCGAGGCC ATGGCCGAGG CCGGTCGCCC GGACCTGCCC
CTCCTGGTGA AGATCGCCCC CGACCTCGCC GACGAGGACG TCGACGCCGT CGCCGACCTC
GCGCTGGCCG AGGGACTCGA CGGCATCATC GCCACCAACA CCACCATCTC CCGTGAGGGC
CTGACCACCC CCGCGGCGCA GGTGGAGGCG GCCGGTGCGG GCGGCCTGTC CGGCGCCCCC
CTCAAGCGGC GCTCCCTGGA GGTGCTGCGC CGCCTGCGCG CCCGTGTGGG CGACCGGGTG
ACCCTGATCG CCGTCGGCGG CATCGAGACG CCCCTGGACG CCTGGTTCCG CATCCGGGCG
GGGGCCAGCC TCGTGCAGGG CTACACCGGC CTCATCTACG GCGGTCCGCT CTGGCCCCGC
CGCATCAACC GCGGCCTCGC CCGGCTGGTG CGGGCCTCCG GCCACCGTTC CATCAACGAG
GTCGTCGGAG CCGACGTCCC CTCCCCGGCC GCCCCGGCGG CCGACACCGG GCAGGGGGCG
GACCCCGCCG CGGCCACAGC CAAGGGCTGA
 
Protein sequence
MTVFYQMLFR SVLRHMDAEK VHRLSFAALR GVTSLPAVAS AMKGVLGPRE PELTVHALGQ 
EFPGPLGLAA GFDKNAESPS GLAALGFGFV EVGTVTAQPQ PGNPRPRLSR LVADRAIVNR
MGFNNEGSAL VAERLHHRRG GRRPVLGVNI GKTKVTPEEE APADYALSAR RLARYADYLV
VNVSSPNTPG LRNLQGVERL RPLLAAVREA MAEAGRPDLP LLVKIAPDLA DEDVDAVADL
ALAEGLDGII ATNTTISREG LTTPAAQVEA AGAGGLSGAP LKRRSLEVLR RLRARVGDRV
TLIAVGGIET PLDAWFRIRA GASLVQGYTG LIYGGPLWPR RINRGLARLV RASGHRSINE
VVGADVPSPA APAADTGQGA DPAAATAKG