Gene Ndas_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4109 
Symbol 
ID9247983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4905581 
End bp4906669 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content81% 
IMG OID 
ProductMoeA domain protein domain I and II 
Protein accessionYP_003682011 
Protein GI297563037 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGG AGCCGCTCGA CGGCGCCCTC GGTGCCACCC TCGGCGCCGA CCTCGTCTCG 
GTGGTCGACG TCCCGGTGCT GGACAGCGCG GCCATGGACG GGTACGCGGT GGCCGGTGAG
GGCCCCTGGA CCGTCCTCGG CCGCTCGCTC GCCGGGCGGC GGGGGCCCGT CGTCCGCCTG
AACCCGGGGG AGGCGGTGGA GGTCGCCACG GGCGCCGTCG TGCCGGAGGG CACCACCGCC
GTCCTCCCCT GGGAGCGGGC GGCGGCCTCC TCCGGGCGCG TGCGCGGCGC GGCCGAGGCC
GGCAGGCACA TCCGCCGCAA GGGCGAGACC ACCCCCGCCG GGGCGCTGGC CGCGCGCCGG
GGCAGCCCCG TCACCCCCGC CCTCCTGGGG CTGGCCGCGA GCCTGGGCCT GGACACGCTG
CCCGTGGTCC GCCCCGCGGT GCGCGTCCTG GTCACCGGGG ACGAGGTCGT GCGCGAGGGG
AGACCGCGCC CGGGCACCGT GCGCGACGCG ATCGGCCCGC TGCTCCCCGG CCTGGTCGCC
TGGGCCGGAG GGCGCTGCCT GCCCCCGCTG GCGGTCGCCG ACCGGGGCCG GGACACGGCC
CGCGCCCTGG AGGCGTCCGG GCCCTCCGAG GTCGTCGCGG TCTGCGGCTC CTCCTCGGCC
GGGCCCGCCG ACCACCTGCG CCCGGTGCTC ACCGCGCTCG GCGCGCGGAT GGTCGTCGAC
GGGGTGGCCT GCCGCCCGGG GCACCCGCAG GTGCTGGCCG TGCTGCCCTC GGGAACCGTG
GTCGTGGGAC TGCCGGGCAA CCCCGGCGCC GCGCTGGCCG CCGCGCTCAC CCTGCTCGTC
CCGGTGCTCG CCGGTCGCGC GGACCGGCGC GACCCCGCCC ACCTCGGCCG ACGGGTCCGG
CTCATCGGGC CGACCCGGCC GCACCCGACC GACACCCGCC TGGTGCCGGT GCGCGTCAGC
CGCGACCTGG CGGTGGAACT GCCCGGCACC GGCTCGGCCG ACCTGCGCGC CGCCGCCGTC
GCCGACGCCC TCGCGGTCGT GCCGCCCGGC CGCCGGACGG GGCGCGTCGA ACTCGTGGAG
CTGCCGTGA
 
Protein sequence
MAEEPLDGAL GATLGADLVS VVDVPVLDSA AMDGYAVAGE GPWTVLGRSL AGRRGPVVRL 
NPGEAVEVAT GAVVPEGTTA VLPWERAAAS SGRVRGAAEA GRHIRRKGET TPAGALAARR
GSPVTPALLG LAASLGLDTL PVVRPAVRVL VTGDEVVREG RPRPGTVRDA IGPLLPGLVA
WAGGRCLPPL AVADRGRDTA RALEASGPSE VVAVCGSSSA GPADHLRPVL TALGARMVVD
GVACRPGHPQ VLAVLPSGTV VVGLPGNPGA ALAAALTLLV PVLAGRADRR DPAHLGRRVR
LIGPTRPHPT DTRLVPVRVS RDLAVELPGT GSADLRAAAV ADALAVVPPG RRTGRVELVE
LP