Gene Ndas_4240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4240 
Symbol 
ID9248114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5056229 
End bp5057392 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content55% 
IMG OID 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_003682137 
Protein GI297563163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.481849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.893444 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACGC CTTCACCGCC TCCCAACCAA CAGCCCCAGG TCCTCGAGTT CTTCGCGGGG 
ATCGGCTTGG CCAGAATCGG ACTCGAAGCA GCGGGTCTCA GGGTCTCGTG GTCCAACGAT
TACGAGACGA GCAAGAAGAA CATGTACGAA GGTCACTTTG GGACATCCAG CGACCACACA
TATGTGCTCC GCGATATTCG GAAAGTCTAC GCTGACCAGC TCCCGGCTGG CGCATCCGTG
GCATGGGCTT CATCACCTTG CACAGACCTG TCACTCGCAG GAGCCAGGGC CGGCCTGGCA
GGGGCAGAGT CAGGAACCTT CTGGGAATTC ATTAGAATCC TTAAAGATTT CAATGAATCA
CGCCCTCCTA TCGCCGTCCT TGAAAATGTT GTTGGACTAG CGACTTCACA TGCAGGCGAA
GATCTAGCTG CCGCAGTAAA GGCATTCAAT GAGCTCGGTT ACTCAGTCGA CGTCCTAGTG
ATCGACGCGC GCCGCTTCAT TCCACAGTCA CGACCGCGTC TTTTTCTTGT AGCCGCCCAA
AACCCTCCAA ATGGCCGCCC TCAGACAGAT TCCACACTAC GCCCCGACTT TCTTCAGCCA
GTCTTTGGGG ATCCAACACT CACAACGCAT AGGGCACATC TTCCTGAACC TCCGGCACTC
CTAACGTCCG GTTTTGGGAT GTGCGTCGAA GAGATGCCCT TGAATGACGA GCGATGGTGG
GATGAAGAAC GAACAGAGGC ATTCATGTCC TCGCTATCAC CCACACAATA CCAGCGCGTG
ATGCAGATGC ATTCCTCACC GGGCGTTAAG TACCGAACAG CATATAGGCG AACTCGTAAG
GGAATCCCCG TATGGGAGGT TCGCCCTGAT GACGTATCGG GGTGCTTGCG AACTGCACGC
GGTGGCTCTT CCAAGCAAGC TGTTCTAAGG GTCGATAACT CATCGCTTCA TGTCAGGTGG
ATGACCCCTC GAGAATATGC CCGCTTGATG GGAGCAGGCG AGTATAAGCT TGACGGGATC
CGAGCCAATA AGGCATTGTT CGGCTTCGGC GACGCTGTCG CCGCACCTGT CGTGCAGTGG
CTAAGCGAGA AATATCTTTT GCCTCTCCTT CGAGAAGAAA ATTTTACCGA GCCCGAGATG
ATGGAGATCC CTCTTGGCCA GTAG
 
Protein sequence
MFTPSPPPNQ QPQVLEFFAG IGLARIGLEA AGLRVSWSND YETSKKNMYE GHFGTSSDHT 
YVLRDIRKVY ADQLPAGASV AWASSPCTDL SLAGARAGLA GAESGTFWEF IRILKDFNES
RPPIAVLENV VGLATSHAGE DLAAAVKAFN ELGYSVDVLV IDARRFIPQS RPRLFLVAAQ
NPPNGRPQTD STLRPDFLQP VFGDPTLTTH RAHLPEPPAL LTSGFGMCVE EMPLNDERWW
DEERTEAFMS SLSPTQYQRV MQMHSSPGVK YRTAYRRTRK GIPVWEVRPD DVSGCLRTAR
GGSSKQAVLR VDNSSLHVRW MTPREYARLM GAGEYKLDGI RANKALFGFG DAVAAPVVQW
LSEKYLLPLL REENFTEPEM MEIPLGQ