Gene Ndas_0546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0546 
Symbol 
ID9244387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp675949 
End bp677325 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content74% 
IMG OID 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003678499 
Protein GI297559525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGAA GTCCCCAGAC CCCCTCACTG CTCCTGAGAG GGGTCTGCGC CTCCACCTCC 
GCCCTCCTGG CCTTCGGGCT GCTGCCCGCC GCCCCCGCGC TGGCGGAGAC CGCCGTCAAC
CCCGTGTCCG GCAACCAGGG CTTCCTCGTG GTCACCGAAG GCGACGCGGT CCTGGCGGGC
AACGAGTCCG AGGGAGCCGT CGCGGTCGGC GGCGACCTCG TGTTCGGCGA CTACAACGTG
GCCGCCCACA CAGCCGGTTC CCACACCGCC GAGGGGGACG AGCACCCCAC CGCCCTCCTG
GTGAACGGCC GGGTGGACTT CGGCGCCAGC GAGGGCGACC TCAAGGTGCT CCAGAGCGGG
TACGTCAAGG TCGGCGCGGC AGCGGACGCC GACATCAGCA CGACGGACCA CAACGGCGCC
CAGGTCAACA CCGTGGTCGC CGGCCCCGAG GGCCGTGAGA GCACACCGCG GGTCTCGCTC
ACCGTCCGCC AGCCGCAGGA GTCCGTCACC GGCCCGGCCC TCGACGTCGC CGGGCTGTTC
CCCGCCTACC GCGAGCGCTC GGCCGGACTC GCCGCGTGCG AGGGCGGGGT CGTCCCGCTG
CGGACGGGCG AGGGTTCCGA GGCCCTGCCG CCCTTCGAGC AGGGCGCCAA CGTCACCCTG
CCGCTGGCCC AAGGCGCCCA GAACGTGTGG AACGTCGCCG CCGAGGACCT GGCCGCCCTG
GAGGTCATCA CCTTCGAGGA CAAGCCCGGA CCGGACGCGC CGCTCCTGGT CAACGTCGAC
ACCTCGGGCG TGGGCGGACG GTTCGAGTGG CGCACCCCCA ACATGGCCGG GATCGGGCTG
GAGGAGGCCC GCTACGTGCT GTTCAACTTC GGGGAGGCCG CCTCCGTCAC CTTCACCCCC
GACAGCCGGA CCCTGGAGGG CACCGTCTTC GCCCCCGACG CGAAGGTGAG CTGGCTGTCG
CCGAGCAACA TCGAGGGCAA CGTCGTGGCC GCCTCCTTCG AGCACGGCTC CCTGTCGGCG
GGCGTCGTCG GCAACGGTGA GCTGCACAAC GGCGACTTCT CCGCGGAGCT GACTCCGTGC
GGGCCCGGCG GGGGCGAGAC CCCCGAGGAG CCGGGAGAAC CCGAGCCGAC TCCCGACGGT
GAGGAGCCCG GGGCCCCCGA GGAGGAGCAG CCGACCGAGG CTCCCGTCGC GGAGGAGACG
CCCGGCGCGG AGCCCTCCGC GTCCCCCTCA CCCGAGCAGG CGGCCGGATC GGACGGCGGT
CTGGCCGTGA CCGGCGCGAG CCTGTGGGGG CTGGTCGGCG CGGCGGTGGT GGCGATCGGG
GCCGGTGTGG CGGCCTTCGT GTTCACCAGG CGCAGGAAGT CCGGGTCCTC CGTCTGA
 
Protein sequence
MPRSPQTPSL LLRGVCASTS ALLAFGLLPA APALAETAVN PVSGNQGFLV VTEGDAVLAG 
NESEGAVAVG GDLVFGDYNV AAHTAGSHTA EGDEHPTALL VNGRVDFGAS EGDLKVLQSG
YVKVGAAADA DISTTDHNGA QVNTVVAGPE GRESTPRVSL TVRQPQESVT GPALDVAGLF
PAYRERSAGL AACEGGVVPL RTGEGSEALP PFEQGANVTL PLAQGAQNVW NVAAEDLAAL
EVITFEDKPG PDAPLLVNVD TSGVGGRFEW RTPNMAGIGL EEARYVLFNF GEAASVTFTP
DSRTLEGTVF APDAKVSWLS PSNIEGNVVA ASFEHGSLSA GVVGNGELHN GDFSAELTPC
GPGGGETPEE PGEPEPTPDG EEPGAPEEEQ PTEAPVAEET PGAEPSASPS PEQAAGSDGG
LAVTGASLWG LVGAAVVAIG AGVAAFVFTR RRKSGSSV