Gene Ndas_2889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2889 
Symbol 
ID9246740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3449446 
End bp3450465 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content72% 
IMG OID 
Productferrochelatase 
Protein accessionYP_003680806 
Protein GI297561832 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00549104 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.445564 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTACT ACGACGCGTT TTTGCTGATC TCCTTCGGCG GCCCGGAGGG GGAGGACGAG 
GTCATCCCGT TCCTGGAGAA CGTCACCCGG GGCCGCAACA TCCCCCGTGA GCGGCTGGCC
GAGGTCGGCG AGCACTACTT CCTCTTCGGC GGTGTCAGCC CGATCAACCA GCAGTGCCGC
GACCTGCTCG CCGCGGTCAC CGCCGACTTC GCCGCCAACG GGATCGACCT GCCCGTCTAC
TGGGGCAACC GGTTCTGGGC CCCGATGCTC ACCGACACCC TCGCCCGGAT GAAGGAGGAC
GGCGTGCGCC GCATCCTGGC GCTGCCCACC TCCGCGTACT GCAACTGGTC CAGCCGCACC
GCCTACGAGG AGGACATCGC CCGCGCCCGC GCCGCGCTGG GCGAGGACGC CCCCGAGGTG
GACCTCATCC GGCCCTTCTA CGACCACCCG GGGTTCGTCG AACCGCTGGT CGAGCACACC
CGCGCCGCCC TGGACGCCCT GCCCGCGGAG CAGCGCGAGG GCGCGCACCT GCTCTTCTCC
GCGCACTCCA TCCCCACGGC GATGGCCGAG GCCAGCGGCG GTTCCGGACA GGCCTTCGGC
CCCGAGGGGG CCTACGGCGC CCAGCTGGCC GAGGTGGCGC GTCTGGTGGC CGGGCGGCTC
GACCGCGACT ACCCGTACGA GGTCGTCTAC CAGAGCCGCA GCGGCCCGCC CAGCCAGCCC
TGGCTGGAGC CCGACGTCAA CGACCGCATC GAGGAACTGG CCGCCGAGGG CGTGCCCGCC
GTGGTCGTGG TCCCGCACGG CTTCGTCTCC GACCACATGG AGGTCAAGTT CGACCTCGAC
GTGGAGGCCC GCGACACCGC CAAGAAGCTC GGCGTCGGCT ACGAGCGCGC GCTCAGCCCC
GGCACGCACC CCGCGTTCGT GTCCATGGTG ACCGACCTGG TCCGCGAGTA CGCCGAACGG
GCCGAACCCC GGCGCCTGAG CTCGCTCGCG CGCTGCACCG GCTGCTCGTG CCCGGCCTGA
 
Protein sequence
MSYYDAFLLI SFGGPEGEDE VIPFLENVTR GRNIPRERLA EVGEHYFLFG GVSPINQQCR 
DLLAAVTADF AANGIDLPVY WGNRFWAPML TDTLARMKED GVRRILALPT SAYCNWSSRT
AYEEDIARAR AALGEDAPEV DLIRPFYDHP GFVEPLVEHT RAALDALPAE QREGAHLLFS
AHSIPTAMAE ASGGSGQAFG PEGAYGAQLA EVARLVAGRL DRDYPYEVVY QSRSGPPSQP
WLEPDVNDRI EELAAEGVPA VVVVPHGFVS DHMEVKFDLD VEARDTAKKL GVGYERALSP
GTHPAFVSMV TDLVREYAER AEPRRLSSLA RCTGCSCPA