Gene Ndas_4104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4104 
Symbol 
ID9247978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4901089 
End bp4902435 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content75% 
IMG OID 
ProductCoproporphyrinogen dehydrogenase 
Protein accessionYP_003682006 
Protein GI297563032 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.524988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCA CCCTGACCCC GCGCCCCGAA CCGGTGCGCG CCGACTCCCC CTACCAGTCC 
TACGTCTACG CCTACCCGCA CAAGAGCGCC TACCGGCCCT TCACCGAGCG CCCCGCGCTC
GCCGACCTGT GGCGCGGCGA GGACGTCGGC GCGCTGTCGC TGTACGCGCA CATCCCGTTC
TGCGAGATGC GCTGCGGGTT CTGCAACCTG TTCACCCGCT CCACTCCCCC GGCCGAGCAG
GTCACCGCCT ACCTGGACGC GCTGGAGCGC CAGGCGGAGG CGGTGGCCGG GGCGCTGCCG
GAGGGCGCCG CCTTCGCCCG GGCCGCCCTG GGCGGCGGCA CCCCCACCTA CCTGACCGCC
GAGGAGCTCA CCCGGGTCTA CGACCTCACC GAGAGCGCCT TCGGGGTGGA CCTGTCGGCG
ATCCCGGTGT CGGTGGAGAC CTCTCCGGCC ACGGCCACGC CCGACCGGCT GGCGGTGCTC
GACGCGCGCG GCGCCACCCG GATCAGCATG GGCGTGCAGA GCTTCCTGGA CGCCGAGGCG
CACGCGGCCG GGCGTCCGCA GAAGCGCGCC GAGGTGGACC GGGCGCTGGC CGCGATCCGC
GAGCACGCCT CGGCCGACCT CAACCTCGAC CTCATCTACG GCATCGACCG CCAGGACGCC
CGCACCTGGG CCTACTCCCT GGACACGGCG CTGGAGTGGG AGCCCGAGGA GGTCTACCTG
TACCCGCTGT ACGTGCGCCC GCTCACCGGG CTGGGCCGCC GCGCCCGCGC GTGGGACGAC
CACCGGCTGG GCCTGTACCG GCAGGGCCGC GACCACCTGC GCGAACGCGG CTACGAGCAG
GTGTCCATGC GCATGTTCCG GCGGGCGGAC GCCCCGAAGA CGCAGGCCCC GGACCACTCC
TGCCAGACCG ACGGCATGGT GGGGCTGGGC TGCGGGGCGC GGTCCTACAC CTCCGCCGCG
CACTACTCCT TCGACTACGC CGTGGGCGTG GGGCAGGTGC GGTCGATCAT CGCCGACTAC
ACGAGCCGCC GCCAAGCGGA CTTCGGACGG GCCGAGGTGG GGTTCCGCAT GGACGAAGGC
GAGCGGCGCC GCCGCCACCT GCTCCAGTCG CTGCTGCTCG CGGAGGGGAT GGACACCGCC
GCCTACGCCG ACCGGTTCGG CTCCCGCCCC GAGGAGGACT TCGCCGCGAC CCTGGCGGTG
CTCGACGGGC GCGGCTGGCT GGAGCGGGAC GGCGCGCCGG ACCTGCTGCG GCTGACCCCG
GAGGGGCTCG CGCACTCCGA CGCGGTGGGG CCGATGTTCT TCTCCGCCGG GGTGGCGGCC
CTGATGGCCG ACTACGAGGC CCGGTGA
 
Protein sequence
MTTTLTPRPE PVRADSPYQS YVYAYPHKSA YRPFTERPAL ADLWRGEDVG ALSLYAHIPF 
CEMRCGFCNL FTRSTPPAEQ VTAYLDALER QAEAVAGALP EGAAFARAAL GGGTPTYLTA
EELTRVYDLT ESAFGVDLSA IPVSVETSPA TATPDRLAVL DARGATRISM GVQSFLDAEA
HAAGRPQKRA EVDRALAAIR EHASADLNLD LIYGIDRQDA RTWAYSLDTA LEWEPEEVYL
YPLYVRPLTG LGRRARAWDD HRLGLYRQGR DHLRERGYEQ VSMRMFRRAD APKTQAPDHS
CQTDGMVGLG CGARSYTSAA HYSFDYAVGV GQVRSIIADY TSRRQADFGR AEVGFRMDEG
ERRRRHLLQS LLLAEGMDTA AYADRFGSRP EEDFAATLAV LDGRGWLERD GAPDLLRLTP
EGLAHSDAVG PMFFSAGVAA LMADYEAR