Gene Ndas_3104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3104 
Symbol 
ID9246960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3718479 
End bp3719780 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content75% 
IMG OID 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_003681019 
Protein GI297562045 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0529728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACA CCCACGGACC GCACCTGATC CGCGGCGCCC GCCCCCTCGG CGGCGACCCC 
GTCGACCTGC TCCTGGCCGA CGGCCGGATC GCCGCGACCG GCCGGGACCT GGACGCCCCC
GAGGGCGCCC GGACCGTCGA CGCCACCGGC CTGATCGCCC TGCCCGGTCT GGTGGACCTG
CACACCCACC TGCGCGAGCC CGGCCGCGAG GACGCCGAGA CGGTCGCCAG CGGCTCCCGC
TCCGCCGCCA TGGGCGGCTA CACCGCCGTC CACGCCATGG CCAACACCGA CCCCGTCGCC
GACACCGCCG GGGTCGTCGA ACAGGTCTGG CGCCTGGGCC GCGAGGCCGG GTACTGCGAC
GTGCGCCCCG TCGGCGCCGT CACCGTCGGC CTGGCCGGGG AGCGCCTCTC CGAGATCGGC
GCCATGGCCG ACTCCGCCGC GGGCGTGCGC GTCTTCTCCG ACGACGGCAT CTGCGTCTCC
GACGCCCTGC TCATGCGCCG CGCCCTGGAG TACGTCAAGG CCTTCGACGG CGTCATCGCC
CAGCACGCCC AGGAGCCGCG CCTGACCCAG GGCGCCCAGA TGAACGAGGG CTCCGTCTCC
GACCGCCTCG GCCTGCCCGG CTGGCCCGCG GTCGCCGAGG AGGCGATCAT CGCCCGCGAC
TGCCTGCTCG CCCAGCACGT GGGCTCGCGC CTGCACGTGT GCCACGTCTC GACCAGGGGC
TCGGTCGACA TCATCCGCTG GGCCAAGGCC CGCGGCTGCG ACGTCACCGC CGAGGTCACA
CCCCACCACC TCCTGCTGAC CGAGGAGCTG GTGGAGGGCT ACGATCCCGT CTACAAGGTC
AACCCGCCCC TGCGCACCGC CGAGGACGTC CAGGCGCTGC GCGAGGGCCT GGCCGACGGC
ACCATCGACA TCGTCGCCAC CGACCACGCC CCCCATCCCG CGGAGGCCAA GGAGACCGAG
TGGTCCACCG CCGCCATGGG AATGGTCGGG TTGGAGACGG CGTTGGCGGT CGTGCAGCGC
ACGATGGTCG ACACCGGCCT GCTGACCTGG GCCGACGTCG CCGACCGCAT GTCCGCGGCA
CCGGCCCGCA TCGGCCGGGT CCACGACCAC GGGCGTGACC TGGCGGTCGG CCAGCCCGCC
AACGTGGTCC TGTACGACCC GGCCGCCGCC GTCGACGTGG ACGGCGCGGC CATGGTCAGC
AAGAGCGGCA ACACCCCCTT CCGGGGCATG ACCCTGCCCG GCCGGGTGCG CGCCACGTTC
CTGCGCGGCA CCCCGACCGT CCTGGAGGGG AAGATCCAGT GA
 
Protein sequence
MPDTHGPHLI RGARPLGGDP VDLLLADGRI AATGRDLDAP EGARTVDATG LIALPGLVDL 
HTHLREPGRE DAETVASGSR SAAMGGYTAV HAMANTDPVA DTAGVVEQVW RLGREAGYCD
VRPVGAVTVG LAGERLSEIG AMADSAAGVR VFSDDGICVS DALLMRRALE YVKAFDGVIA
QHAQEPRLTQ GAQMNEGSVS DRLGLPGWPA VAEEAIIARD CLLAQHVGSR LHVCHVSTRG
SVDIIRWAKA RGCDVTAEVT PHHLLLTEEL VEGYDPVYKV NPPLRTAEDV QALREGLADG
TIDIVATDHA PHPAEAKETE WSTAAMGMVG LETALAVVQR TMVDTGLLTW ADVADRMSAA
PARIGRVHDH GRDLAVGQPA NVVLYDPAAA VDVDGAAMVS KSGNTPFRGM TLPGRVRATF
LRGTPTVLEG KIQ