Gene Ndas_2178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2178 
Symbol 
ID9246028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2601899 
End bp2603074 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content77% 
IMG OID 
ProductUroporphyrinogen III synthase HEM4 
Protein accessionYP_003680106 
Protein GI297561132 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000567169 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000562153 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACACCT CCCTGGACGA CGACACCGCG GCCCCGCCCG CCCACCGCCC GGCCGAACCC 
GCCACCACCG CGCCGCTGGC CGGGTTCACC GTCGCCGTCA CCGCCGCCCG CCGCGCCGAG
GAGATCAGCG CCCTGCTGCG CCGCAAGGGC GCCCAGGTCC TGGCCGCACC GGCCCTGCGC
ATCGTGCCGC TCAGCGACGA CCAGCGCCTG GCCTCGGTCT CCGAACAGCT CGCCCGCCGC
CCCGCCGACG TCGTCGTGGC CACCACCGGC ATCGGCTTCC GCGGCTGGGT GGAGGCGTGC
GAGACCTGGG GCACCGTCGA CCCGCTCCTG GCGAGCCTGC GCACCTCGCG CCTGCTGGCC
CGCGGCCCCA AGGCCAAGGG CGCCATCCGC GCCGCCGGAC TCACCGAGGA GTGGTCGCCG
CCCTCGGAGT CCTCCGCCGA GGTCCTGGAC TACCTGCTCG CCCGGGGCGT GCGGGGTCTG
CGCGTGGCCA TCCAGCTGCA CGGCGAACCC CTGCCCGACT TCACCGCCGC CCTGCGCCTG
GCCGGAGCCG ACGTCGTCGA GGTCCCCGTC TACCGCTGGA CCCTGCCCGA GGACACCGCG
CCCCTGGACC GCCTCATCGA GGCCGTCACC AACGGGGGAG TGGACGCGGT CACCTTCACC
AGCGCCCCGG CCGCGGCGGG GCTGCTGGCC CGCGCCCACA CCACCGGCCA CCAGGCCGCC
CTGGTCCGGG CCCTGCGCGG CGACGTCCTG GCCATGTGCG TGGGAGCGGT CACCGCACGC
CCCCTCATGG CCCACGACAT CCCCACCGTG TGGCCCCAGC GCGCCCGCGT CGGCGCCCAG
GTCCGCGCGC TCGCCGAGGA GCTGCCCGCA CGCTTTCCGA CCCTGTCCGT GGCCGGACAC
CGCCTGCGCC TGCGCGGCCA CGCCGTCCTG GTCGACGGCA CCGTGCGCAC CCTCTCGCCC
ACCCTGATGC GGGTGCTGCG CGAGCTGGCG CGCAGGCCCG GCCAGGTCCT GGACCGCACC
CGCCTGCTCA CCTGCCTGGG CGAGGACGCC GACGCCCACG CCGTGGAGAC GGCCGTGGCC
CGGCTGCGCA CCGCGCTGGG CGACCCCCGC ATCATCCAGA CCGTGGTCAA ACGCGGCTAC
CGGCTGGCCC TGGACCCGGC CGAACGCACC CTCTGA
 
Protein sequence
MNTSLDDDTA APPAHRPAEP ATTAPLAGFT VAVTAARRAE EISALLRRKG AQVLAAPALR 
IVPLSDDQRL ASVSEQLARR PADVVVATTG IGFRGWVEAC ETWGTVDPLL ASLRTSRLLA
RGPKAKGAIR AAGLTEEWSP PSESSAEVLD YLLARGVRGL RVAIQLHGEP LPDFTAALRL
AGADVVEVPV YRWTLPEDTA PLDRLIEAVT NGGVDAVTFT SAPAAAGLLA RAHTTGHQAA
LVRALRGDVL AMCVGAVTAR PLMAHDIPTV WPQRARVGAQ VRALAEELPA RFPTLSVAGH
RLRLRGHAVL VDGTVRTLSP TLMRVLRELA RRPGQVLDRT RLLTCLGEDA DAHAVETAVA
RLRTALGDPR IIQTVVKRGY RLALDPAERT L