Gene Ndas_0098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0098 
Symbol 
ID9243929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp122232 
End bp124265 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content76% 
IMG OID 
Productprotein of unknown function DUF255 
Protein accessionYP_003678055 
Protein GI297559081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAACC GCCTGAGCGA CGCGACCAGC CCGTACCTGT TGCAGCACGC CGACAACCCC 
GTGGAGTGGT GGCCCTGGGG TGAGGAGGCC CTCGCCGAGG CGCGTCGGCG CGACGTGCCC
CTCCTCGTCT CCGTCGGCTA CGCCGCCTGC CACTGGTGCC ACGTGATGGC CCACGAGTCC
TTCGAGGACG AGGCGACCGC GGCCCTGATG AACAGCCTGT TCGTCAACGT CAAGGTGGAC
CGCGAGGAGC GCCCCGACGT CGACGCCGTG TACATGGAGG CGACCCAGGC CATGACCGGC
CAGGGGGGCT GGCCCATGAC CGTGTTCGCC ACCCCCGACG GCGCCCCGTT CTACTGCGGC
ACCTACTTCC CGCGCGAGCA CTTCCAGCGC CTGCTGCGGG GCGTGGCCGA CGCCTGGCGG
GACCAGCGCA CCGAACTGGT CGGCCAGGGC GCGCGCGTGG TGGAGGCGCT GAGCGGCCCG
CGCACCCTGG CCGCCGCGCC CCCGCCCTCC GCGGACCGGC TCGACCTGGC CGTCCGCGCG
CTGGTGCGCG ACTACGACAG CGCCCACGGC GGTTTCGGCA CCGCGCCCAA GTTCCCGCCG
TCGATGCTGC TCTCCTTCCT CACCGCCCAG GACGAGCGCA CCCGGCCCCT GCAGAGCGCG
GACGAGTCCA CGCCCGCCTG GCTCATGGCC AGCGGCACCG CCCTGGCCAT GGCGCAGGGC
GGCATGTACG ACCAGCTCGG CGGCGGTTTC GCCCGATACT CGGTGGACCG CGAGTGGACC
GTGCCGCACT TCGAGAAGAT GCTGTACGAC AACGCCCTGC TGCTGCGCGC CTACGCCCGG
ATGGGCCGCC GCCCCTCGGG TCCGGGGGTC TCCGACGCCG CCACCCACGC CCTGCTGCGC
CGGGTCGCCG GGGAGACCGC CGACTGGATG CTGCGCGACC TGCGCACGCC CGAGGGCGGG
TTCGCCTCGG CGCTGGACGC CGACAGCGAG GGCGAGGAGG GCACCTACTA CGTGTGGACG
CCCGCCCAGC TGCGGGAGGT CCTGGGCGAG GAGGACGCCG CCTTCGCCGC CGAGGTGTTC
GGCGTGACCG AGGAGGGCAC CTTCGAGCGC GGCGCCTCCG TGCTCCAGCT GCCCGCCCCG
CCCGCCGACG CCTGGCGCTA CCAGCGGGTC CGTGAGGCCC TGCTGGCGGC CCGCGCCGAA
CGGGTCGCCC CCGCGCGCGA CGACAAGGTG GTGGCCGCCT GGAACGGCCT GGCGGTCGCC
GCGCTGGCCG AGGCCGGGGT GCTGCTGGAG CGGCCCGACC TGGTGGAGGC CGCCCGCGCG
GCCGCTGACC TGCTGCTGCG CGTGCACCTG CGGGACGGGC GCCTGGTCCG CACCTCCCGG
GACGGGCGCG CGGGCACCAG CGCCGGGGTG CTGGAGGACT ACGCCGACGT CGCCGAGGGG
CTGCTCGTCC TGCACGGTGT GACCGGGGAG GCGCGCTACG CGCACGAGGC CGGGCGCCTG
CTGGACACCG TCCTGGAGCG CTTCGGAGAC GGCTCCGGCG GGTTCTACGA CACCGCCGAC
GACGCCGAGC GCCTCTTCAA CCGGCCCCAG GACCCCACCG ACAACGTCAC ACCGTCCGGC
CGGTCGGCGG CGGCGTCCGC GCTGCTCTCC TACGCCGCGC TGACCGGATC CGAGCGCCAC
CGCACGGCCG CTGAGGAGGC GCTGTCCCCG GTGGCGGTGC TGGCGGAGAA GGCCGCCCGG
TTCGCCGGGT GGGGCCTGGC CACCGGCGAG GCCCTCCTGA CCGGGCCGCG CGCCGTGGCG
GTGGTGGGCG ACCCCGACGA CCCGAGGACC GCGGAGCTGG TGCACGCCGC GCTGGTCTGG
GCGCCGCTGG GCACCGTGCT CTCACGCGGC GACGGCCGCG ACGACGGAGG GGTGCCGCTG
CTGCGCGACC GCGCGCCGGT GGGCGGGCGA CCGACCGCCT ACGTGTGCGA GGGCTTCGTC
TGCAAGCTCC CGGTCACCTC GCCCGAGGAC CTGCGGGAGC AGCTGCTGGC CTGA
 
Protein sequence
MSNRLSDATS PYLLQHADNP VEWWPWGEEA LAEARRRDVP LLVSVGYAAC HWCHVMAHES 
FEDEATAALM NSLFVNVKVD REERPDVDAV YMEATQAMTG QGGWPMTVFA TPDGAPFYCG
TYFPREHFQR LLRGVADAWR DQRTELVGQG ARVVEALSGP RTLAAAPPPS ADRLDLAVRA
LVRDYDSAHG GFGTAPKFPP SMLLSFLTAQ DERTRPLQSA DESTPAWLMA SGTALAMAQG
GMYDQLGGGF ARYSVDREWT VPHFEKMLYD NALLLRAYAR MGRRPSGPGV SDAATHALLR
RVAGETADWM LRDLRTPEGG FASALDADSE GEEGTYYVWT PAQLREVLGE EDAAFAAEVF
GVTEEGTFER GASVLQLPAP PADAWRYQRV REALLAARAE RVAPARDDKV VAAWNGLAVA
ALAEAGVLLE RPDLVEAARA AADLLLRVHL RDGRLVRTSR DGRAGTSAGV LEDYADVAEG
LLVLHGVTGE ARYAHEAGRL LDTVLERFGD GSGGFYDTAD DAERLFNRPQ DPTDNVTPSG
RSAAASALLS YAALTGSERH RTAAEEALSP VAVLAEKAAR FAGWGLATGE ALLTGPRAVA
VVGDPDDPRT AELVHAALVW APLGTVLSRG DGRDDGGVPL LRDRAPVGGR PTAYVCEGFV
CKLPVTSPED LREQLLA