Gene Ndas_0232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0232 
Symbol 
ID9244066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp288456 
End bp289979 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content77% 
IMG OID 
ProductMg chelatase, subunit ChlI 
Protein accessionYP_003678188 
Protein GI297559214 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCG CCCGCACCCG CTGCATCGCC CTGCTGGGCG TCCAGGGGCA CATCGTCGAG 
GTGGAGGTGC ACCTGGGCGG CGGCGACTTC GGGGTGACCC TGGTGGGGCT GCCCGACGCC
GCGCTCCGCG AGGCCAGGGA CCGCATCCGC GCCGCCGTCG TCAACAGCGG CGAGGAGTGG
CCCAGCGGCC AGATCGTCAT CAGCCTGTCC CCGGCCAGCC TGCCCAAGTC CGGGAGCCTG
TTCGACCTGG CCATCGCCGC CGCCGTCCTC GCCGCGGCGG GCTCGGTGCC GGTGGAGCGG
CTGGAGGACA GCGTCCTCAT CGCGGAACTC GGCCTGGACG GCCGGGCACG GCCGGTGCTG
GGGGTCCTCC CGGCGGTCGT GTCGGCCGCC GAGCAGGGCT ACCGCAGGTT CGTCGTCGCG
CGGGGGAACG CCGCCGAGGC CCGGCTGGTG CCCGGCGCCG AGGTGACGGC GGTGGACAGC
CTGATGGACC TGTGCCAGTG GCTGCGCGGG GAGTACTTTC CCGAGGCGCA CGAGGCGGAG
GCCGTGCCCC GTCCCCGGGA GCCGCGCGGC CAGGGGCCCG ACCTGTCCGA CGTCCTGGGG
CAGCCGGTGG CCCGCCGCGC GGTGGAGATC GCCGCGGCCG GCGGCCACAA CCTCATGATG
CTGGGGCCTC CCGGCACCGG CAAGAGCCTG CTCGCCGAAC GCCTGCCCAC CGTGCTGCCC
CCGCTCAGCC CCGCCGAGGC GCTGGAGGCC ACCGCGATCC ACTCGGTGGC GGGCATGCTG
CCGCCCGGCG CGCCCCTGGT CACCGCGCCC CCCTTCGCGG CTCCGCACCA CACCTCCACC
CGGGCCTCGA TCATCGGCGG GGGCAGCGGC TACCCCACGC CCGGCTGGGT GTCCAAGGCC
CACCGCGGCG TGCTCTTCGT CGACGAGGCC CCGCAGTTCG GGCGCGGGGT GCTGGACTCC
CTGCGCGAAC CCCTGGAGCG CGGCGAGGTG GTCCTGGCCC GCGCCTCCTC GACGGTGACC
TTCCCGGCCC GCTTCCAGCT GGTGATGGCC GCCAACCCCT GCCCCTGCGC CAAACCCGGC
GCCCTGTGCA CCTGCCCCGC CGGTGAGCGG CGCCGCTACT TCTCCCGCCT GTCCGGGCCC
CTGCTGGACC GGATCGACCT CAAGGTGGAG CTGCAACCGG TGTCGAGGGC CGAGCTGCTC
GCCGACCGCG CCTTCGCCGA GTCCTCCGAG GTGGTGGCCG CCCGGGTGGA GAAGGCCCGC
GCCCGCGCCG CCGAGCGGCT GGCGCACACG CCCTGGACCA CCAACGCGGC CATCCCCGGG
GCGCAGCTGC GCCGGGAGTT CCCGGTGGAG ACCGCCGCGC TGCGGGTGCT GGGCAGAGCG
ATGGACCTCG GACAGATCAG CGCCCGGGGG GTGGACCGCG CCCTGCGGGT CGCCTGGACC
CTGGCCGACC TCGCCGACCG GGACCGCCCC GGCGAGGAGG AGGCGGCCTA CGCCTTCGCG
CTCTGGGCGG GGCGCGCGTG GTGA
 
Protein sequence
MTLARTRCIA LLGVQGHIVE VEVHLGGGDF GVTLVGLPDA ALREARDRIR AAVVNSGEEW 
PSGQIVISLS PASLPKSGSL FDLAIAAAVL AAAGSVPVER LEDSVLIAEL GLDGRARPVL
GVLPAVVSAA EQGYRRFVVA RGNAAEARLV PGAEVTAVDS LMDLCQWLRG EYFPEAHEAE
AVPRPREPRG QGPDLSDVLG QPVARRAVEI AAAGGHNLMM LGPPGTGKSL LAERLPTVLP
PLSPAEALEA TAIHSVAGML PPGAPLVTAP PFAAPHHTST RASIIGGGSG YPTPGWVSKA
HRGVLFVDEA PQFGRGVLDS LREPLERGEV VLARASSTVT FPARFQLVMA ANPCPCAKPG
ALCTCPAGER RRYFSRLSGP LLDRIDLKVE LQPVSRAELL ADRAFAESSE VVAARVEKAR
ARAAERLAHT PWTTNAAIPG AQLRREFPVE TAALRVLGRA MDLGQISARG VDRALRVAWT
LADLADRDRP GEEEAAYAFA LWAGRAW