Gene Ndas_5349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5349 
Symbol 
ID9249252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp525690 
End bp526736 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content73% 
IMG OID 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003683235 
Protein GI297564262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.584645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.183228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACA CACTCCGTTA CTCCTTCGCC ACCGTCATGA CCGCCGGGCT GGTCGTGGCG 
CCCGTGGCGG CGGCCTTCGC CGACACCACC ACCAGCGGCT CCGGCGGGAT CGGCAGCGGG
AACCAGGTCG TCGTGCCCGT GGACGTCGAA GCCGAGCTGT GCGGCAACTC GATCGCGATC
CTCGGCATCT CCAGCGCCAC GTGCACCCAG GTCTCGGAGG TCCTCTACGA GGCCAGCGGC
CAGGGCGGGG CCTCCACGGA CGGCTCCGGC GGTGTGGCCA GCGGCAACCA GATCATCGTC
CCCGTGGACG CCGCCATCGA CGCCTGCGGC AACGCGGCCG CGGTCGGCGG CATCAGCCAG
GCCGAGTGCG TCGAGGTGGT CGAGGTCCTG GAGGAGGAGA GCGCCGACGC GCCCACCACC
AGGACCGACG GCTCCGGCGG TGTGGCCAGC GGCAACCAGA TCATCGTCCC CGTGGACGCC
GCCATCAACG TCTGCGGCAA CTCGGTGGCC GTCCTGGGCG GCTCCAGCGC CAAGTGCACC
ACCATCATCA ACATCATCCA GGCCTCCCCC GAGAACGAGG GCGCCCCCGA CGCCGCCACC
AGCGGCGCGG GCGGGATCGG CAGCGGCAAC CAGGTCGTGG TCCCGGTGGA CGCGGCCGTC
GACATCTGCG GCAACGCCGT GTCCGTGCTC GGCCTGGCCG AGGGCTCCTG CATGGAGATC
ATCTCCGAGG AGGAGCGGCC GGAGGAGCCC GGCGAGGAGA AGCCCGAGGA GCCCGGCCAG
CCCGAGGAGG AGAAGCCGGA GGAGGAGCAG CCCGAGGAGC CGGGTGAGGA GAAGCCCGAG
GAGCCCCGCG AGGAGGACAA GGGCGAGGAC GACTCCAGCA CCGGCGAGGA GCCGACCGAG
CCCCAGGCCG ACGAGCAGCT CCCCGTGACC GGTGGCGCCC TGGCCGGTCT GGTCGCCGCG
GGCGTCGCCG CGGTCGGCGC GGGCGGTGCC GGGCTGTACT TCGCCCGCAA GCGCAAGGCC
GCCGCCGTGA CCGGCGACGA CGAGTAG
 
Protein sequence
MRNTLRYSFA TVMTAGLVVA PVAAAFADTT TSGSGGIGSG NQVVVPVDVE AELCGNSIAI 
LGISSATCTQ VSEVLYEASG QGGASTDGSG GVASGNQIIV PVDAAIDACG NAAAVGGISQ
AECVEVVEVL EEESADAPTT RTDGSGGVAS GNQIIVPVDA AINVCGNSVA VLGGSSAKCT
TIINIIQASP ENEGAPDAAT SGAGGIGSGN QVVVPVDAAV DICGNAVSVL GLAEGSCMEI
ISEEERPEEP GEEKPEEPGQ PEEEKPEEEQ PEEPGEEKPE EPREEDKGED DSSTGEEPTE
PQADEQLPVT GGALAGLVAA GVAAVGAGGA GLYFARKRKA AAVTGDDE