Gene Ndas_4370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4370 
Symbol 
ID9248245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5204160 
End bp5205848 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content75% 
IMG OID 
ProductUroporphyrin-III C/tetrapyrrole (Corrin/Porphyrin) methyltransferase 
Protein accessionYP_003682265 
Protein GI297563291 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCCA ACGCCAACCC CCAGCCGGAG GAGCCGCGCG CCGCAGCGCG GGCCGGGCAC 
GTCGCCCTGG TCGGTTCCGG TCCGGGCGGC GCCGACCTGC TCACCCTGCG CGGCGCCGAG
CTGCTGAGCC ACGCCGACGT GGTCATCACC CTGCGCGAGC CCGCCGCCGA GGAGCTCTCG
GGCTTTCGCG CCGAACTCCT CTCCCACTGC TCCGAGGACG TCTCCGTGAT CGAGGCGGGC
GAGTGCGGCG GCGACGTCAA CGCCCTGGCG GTGGCGCGCG CCCGCGACGG GCAGCGGGTG
GTCCGCCTCT ACTCCGGCGA CCCGTTCTTC GGCTGCCGCG GCGCCGAACT GGTCTCCGCC
TGCCACGAGG CCGGGGTCGA GGTCGAGGTC GCCCCCGGCG TCTCCGCGAT CACCTCCGTG
CCGACCTTCG CGGGCGTGCC GCTGCTGGAC GCGGACAGCC CCGAGGTGCG CGTGCTCAAC
GCCCGCGCGC ACGGCGGGGG AGGGGTCGAC TGGCACGAGG CCGCCGCCAG CGGCGCCACC
CTGGTCGTCA TGGGCGGCGA CGCGCCCGCC GGTGAGCCGG TCGAGCACCA GGGACCCGGC
TTCGACGTGC TGTGCAAGAC CCTCATCGCC GGAGGGCGGC CCGCCTCGAC CCCGGTCGCG
GTCGTGCGCG CGGGCGGCAC CACCCGTCAG ACCACGGTCT CCTCGACCCT GGGGCGGCTC
GTCGCCGACC TCAAGTCCAA GGACGCCAAG GGCCACCACG TCACCGCGCC CGCGCTGATG
GTCGTCGGCC CCGCCGCGGG CCGCCACCCC GAGCTGTCCT GGTACGAGAG CCGTCCGCTG
TTCGGCTGGC GCGTCCTGGT GCCCCGCACC AAGGAGCAGG CCGCGGCCCT GTCCGACCAG
CTGCGCGGCT ACGGCGCGGT GCCCGAGGAG GTGCCCACCA TCTCCGTGGA GCCGCCGCGC
ACCCCGCAGC AGATGGAGCG CGCCGTCCGC GGCCTGGTCA CCGGCCGCTA CCAGTGGGTG
GCCTTCACCT CCGTCAACGC GGTCCGCGCC ATCCGGGAGC GCCTGGAGTC CTACGGCCTG
GACGCGCGCG CGTTCGCCGG GGTCAAGGTC GCCGTCGTCG GCGAGGCCAC CGCGCGCGCC
GTGCGCGAGT TCGGCATCCA GCCCGACCTG GCCCCGCCCG AGGAGGAGCA GTCCAGCTCG
GGCCTGGTCT CGGTGTGGCC GCCCTACGAC GCCGAGATCG ACCCGATCGA GCGGGTCCTG
CTGCCGCGCG CCGACATCGC CACCGAGACC CTGTCCGCCG GGCTGGACAA GCTCGGCTGG
GAGGTCGACG ACGTCACCGC CTACCGCACC GTGCGCGCCG CGCCCCCGCC CGCGCCCGTC
CGGGAGGCGA TCAAGGGCGG CGGCTTCGAC GCGGTGCTGT TCACGTCCTC CTCCACGGTG
CGCAACCTGG TGGGGATCGC GGGCAAGCCG CACAACACCA CCGTCATCGC CGTCATCGGT
CCCGAGACGG AGAGGACCGC GATCGAGTTC GGCCTGCGCG TCGACGTCGT GGCGCCCAAA
GCCTCGGTTT CCGCCCTCGC ACAGGCCCTT TCGGAGTACG GTGCCGAGAA GAGGCGCGAG
GCGGTCGAGG CGGGCAAGCC CGTCCTCAAG CCCAGTCAGA AGAGACGCGG TCGCCGCCGC
AAGCTCTGA
 
Protein sequence
MNANANPQPE EPRAAARAGH VALVGSGPGG ADLLTLRGAE LLSHADVVIT LREPAAEELS 
GFRAELLSHC SEDVSVIEAG ECGGDVNALA VARARDGQRV VRLYSGDPFF GCRGAELVSA
CHEAGVEVEV APGVSAITSV PTFAGVPLLD ADSPEVRVLN ARAHGGGGVD WHEAAASGAT
LVVMGGDAPA GEPVEHQGPG FDVLCKTLIA GGRPASTPVA VVRAGGTTRQ TTVSSTLGRL
VADLKSKDAK GHHVTAPALM VVGPAAGRHP ELSWYESRPL FGWRVLVPRT KEQAAALSDQ
LRGYGAVPEE VPTISVEPPR TPQQMERAVR GLVTGRYQWV AFTSVNAVRA IRERLESYGL
DARAFAGVKV AVVGEATARA VREFGIQPDL APPEEEQSSS GLVSVWPPYD AEIDPIERVL
LPRADIATET LSAGLDKLGW EVDDVTAYRT VRAAPPPAPV REAIKGGGFD AVLFTSSSTV
RNLVGIAGKP HNTTVIAVIG PETERTAIEF GLRVDVVAPK ASVSALAQAL SEYGAEKRRE
AVEAGKPVLK PSQKRRGRRR KL