Gene Ndas_3699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3699 
Symbol 
ID9247568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4437326 
End bp4438735 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content71% 
IMG OID 
Productmetal dependent phosphohydrolase 
Protein accessionYP_003681603 
Protein GI297562629 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.849083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATGG GTCCCTACTC GGGCCTGGAT CTGCGCACAC TGGTCCTGAT CGCGATTCTC 
TTCGTGGTCG CGGAGTCGAT CAGTACGGCG GTCCTGACGG GCGTCAAGGC GGGGATCTCG
CCGAGTTCGG CGGTCTCCCT GGCCGCCGTC GTCCTGGTCG GACCGGTCGG GGCGGCGGTG
GTCGGTTTCG CCTCCTGCTT CGCCATCCGC AGGCAGAACC TCGTCAAGAG GGCGTTCAAC
GGAGCACAGT TCGCACTGGC GGCCTACGCG GCCGGACACG TCTACCTTCT GCTCGGCGGA
ACGGTGGGAG AACCCGGCCG GGAGGACTTC CCGGCCATCG TCCTCCCCTA CGTCGCCGCC
GTCCTGACCC ACTCCCTGGT CAACTCGGTG CTCGTGGCCG TCCTGATGTG GACCCTGGGC
GTCGTGCGCC CCGGCGGCTC GCGTGGCCGG TTCGACTGGC GTCCGCTGGT GGGGGTCGAG
CTGTCCTCCA TGGGCTACCA GATGCTCGGC CTGGCCATCG CCGCGGTCTG GGGCGGGGTC
GGCCTCATCG CGCCGCTGCT GGTGCTGCTC CTGCTGTTCA TCGCGCGCTG GACCTTCGCC
CAGCAGCTGG ACGAGGCGCG CGCGCACGAG GCCACCCTGG CCACCCTGTG CCAGGCGGTG
GAGACCAAGG ACTACTACAC CCGGGGCCAC TGCATGCGCG TGGCCGAGGG CGCCGCCATG
ATCGCACGCG AACTGGGCAT GCCCGCCGAC CGGGTGCAGA AGATCCGCTA CGCCGGGATG
CTGCACGACA TCGGCAAGCT CGGAGTGCCC ACCAAGGTGC TGCAGAAGAC CGGCAAGCTC
ACCGACGACG AGTACGCGGC CATCAAGCTG CACCCCACGC GCGGCTACGA GATCGTGCGG
GAGATCAGCT TCCTGGACGA GGCGCTGGCC GGGATCCGGC ACCACCACGA GCGCCTGGAC
GGGCGCGGGT ACCCCATGGG CCTGGTCGGG ATGGAGATCC CCGAGTCCGC GCGCATCATC
AGCGTCGCCG ACGTCTTCGA CTGCCTCACC TCCACGCGCT CCTACCGCAG GGCGTGGTCG
GTGGAGGACG CCGTCGCCGA ACTGCGGGCC TGCGCCGGGA CCCAGTTCGA CCCCAGGATG
GTCGAGGCGC TCGTGCGCGC CGTCGAACGC GAGGGCTGGG AGACACCCGA CATCGCCGAG
CCTCCGGGCG GCTGCTACGG GTCGGAGGCG GCCGAACGCT CCGTGGAGGG GTCCGGCCGA
GCCGGGGAAC CGGAGGAGGT GCCCGCCGAC GCGTCCGCCG AGGAGAGCGC CGCCGGGGAG
CCGTCCTCCG CAGGGGCGGC GGCCGACGAG GCCGCCTCCG CGGACGCCAC GGCCTCCGCC
GAGGCAGTGA CGGGCGGGCG TGAACGATGA
 
Protein sequence
MLMGPYSGLD LRTLVLIAIL FVVAESISTA VLTGVKAGIS PSSAVSLAAV VLVGPVGAAV 
VGFASCFAIR RQNLVKRAFN GAQFALAAYA AGHVYLLLGG TVGEPGREDF PAIVLPYVAA
VLTHSLVNSV LVAVLMWTLG VVRPGGSRGR FDWRPLVGVE LSSMGYQMLG LAIAAVWGGV
GLIAPLLVLL LLFIARWTFA QQLDEARAHE ATLATLCQAV ETKDYYTRGH CMRVAEGAAM
IARELGMPAD RVQKIRYAGM LHDIGKLGVP TKVLQKTGKL TDDEYAAIKL HPTRGYEIVR
EISFLDEALA GIRHHHERLD GRGYPMGLVG MEIPESARII SVADVFDCLT STRSYRRAWS
VEDAVAELRA CAGTQFDPRM VEALVRAVER EGWETPDIAE PPGGCYGSEA AERSVEGSGR
AGEPEEVPAD ASAEESAAGE PSSAGAAADE AASADATASA EAVTGGRER