Gene Ndas_1112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1112 
Symbol 
ID9244962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1364575 
End bp1365756 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content79% 
IMG OID 
ProductMoeA domain protein domain I and II 
Protein accessionYP_003679059 
Protein GI297560085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.363631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.36117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGCGG GCGGCTGCAC CGGGGGACAC GGCGGGCACC GGTCGTGGCC GCGGGCGCGG 
GAGGCCGCCC GGCTCCTGGG CGGCGGCAGG CCCCCGGCTC CGCGGGAGCT CCCCCTGGAG
GACGCCCTGG GCGCCGTCCT CGCCGCGGAC GTGCACGCGT TGACGGGCCT GCCCGCCTTC
GACGCCTCGG CCATGGACGG CTTCGCGGTG TCGGGCCCGG GGCCCTGGCG GCTGGTCGGC
AGGCGGCTGG CCGGGGCGGC GGAGGCACCC GTGGGCCTGC GCCCGGGCGA GGCCGTCGAG
ATCGCCACCG GCGCCCGTGT GCCCAAGGAC ACGGAGTGCG TCGTGCCCTA CGAGCTGGCC
GCCGTTCGGG ACGGGACGGT GGACGGGCCC GCGGAGGCGG GCCGCCACGT GCGCTGGGCG
GGGGAGGAGA CCGCCCCCGG CGAGACCGTG CTCGAACGGG GAGCGGTGGT CGGCCCCGCC
GCGCTGGGTC TGGCCGCCAG CCTGGGCCAC GACACCCTGC CGGTGCTGCG CCCCCGGGTC
TCGGTCCTGG TCACCGGTGA GGAGATCACC ACATCGGGGC TGCCCGGCGA CGGCAGTGTG
CGCGACGCCA TCGGTCCGGC GCTGCCCGGG GTCGTCGAGC GCGCGGGCGG CCGGATCGCG
TCCCTGCGCC ACCTGGGCGA CGAGCGGCGT CCGCTGCTCG ACGCGCTGGA GGGCGCGGAC
GGCGACGTGG TCGCGGTGTG CGGTTCCTCG TCCCGGGGGC CGGCCGACCA CCTGCGTTCG
GTCCTGGAGG AGCTGGGCGC CGAGGTCGCG GTGGACGGGG TCGCCTGCCG CCCGGGCCAC
CCGCAGCTGC TCGCGCACAC CGACCGGACC GTGTTCGTGG GCCTTCCCGG CAACCCCGGG
GCCGCCCTGG TCGCCGCGGC CACCCTGCTG GTCCCCCTGC TGGCCGCCAT GACCGGACGC
CCGGACCCCG GAACGGGTCT GGCCCGAGCC GTCCTGGAGG GCGCCGTCAC CGCTCACCCC
CGGGACACCC GGCTGGTCGC GGTGCGCCTG GACGGCGGCC GGGCGCGGCC GGTGGGCCAC
GACCGTCCGG GCAGTCTGCG CGGCGCCGCG CTGGCCGACG CCTACGCGGT GGTGCCGCCG
GACTGGGACG GCGGCGAGGT GGAACTGCTC CGCGTGCCCT GA
 
Protein sequence
MGAGGCTGGH GGHRSWPRAR EAARLLGGGR PPAPRELPLE DALGAVLAAD VHALTGLPAF 
DASAMDGFAV SGPGPWRLVG RRLAGAAEAP VGLRPGEAVE IATGARVPKD TECVVPYELA
AVRDGTVDGP AEAGRHVRWA GEETAPGETV LERGAVVGPA ALGLAASLGH DTLPVLRPRV
SVLVTGEEIT TSGLPGDGSV RDAIGPALPG VVERAGGRIA SLRHLGDERR PLLDALEGAD
GDVVAVCGSS SRGPADHLRS VLEELGAEVA VDGVACRPGH PQLLAHTDRT VFVGLPGNPG
AALVAAATLL VPLLAAMTGR PDPGTGLARA VLEGAVTAHP RDTRLVAVRL DGGRARPVGH
DRPGSLRGAA LADAYAVVPP DWDGGEVELL RVP