Gene Ndas_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0344 
Symbol 
ID9244179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp419270 
End bp420874 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content73% 
IMG OID 
ProductAminopeptidase Y 
Protein accessionYP_003678298 
Protein GI297559324 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.69761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCACCA GTTCCGAAGG GAAGCCGCCG CGTGGAGCGG CCGCCCCCGC CTCCCGGCGC 
CGCGCCGCCA CGGCCGGCGC CGCGGCCGTG CTGATGGCCG TCGCCGCCCT GGCGTCCTCC
CCCGCCCACG CCGAACCCGG GCCGCCCGGG CACGCCGGGC CCCCGGACCA CTCCGGAGCC
CCCGGGCACG CGGGGCCGCC CGCGCACGCG GGCAGGCCGG GCGAGGACCC GCCGCTGTCG
GAGCGGGTGA GTGCGGAGGC CATCCGCGAG CACCTGGAGA ACCTCGACAC CATCGCCGAG
TACAACGGCG GCAACCGGGC CACGAACACC CCCGGCTACG ACGTCGCCGC CCTCTACATC
GAGGACCAGC TGGAGCGCGC CGGGTACGAG CCCTACCGGC ACGAGTACGA GTACGAGCTG
TGGCGGGAGA ACTCCCCGGC CGTCCTCGCC CAGACCGCGC CGGAGCAGGT CGCCTACGAG
GCCGAGACCG ACTACGCGAC CATGAGCTAC TCCGGCTCGG GCGACGTCAC CGCCCCGGCG
GTGGCCGTCA ACGCCGACAG CGCCGCGAGC GGCTGCTCCC CCGACGACTT CGCCGACTTC
CCCGAGGGCG CGGTCGCGAT CACCGTGCGC GGCACCTGCC CGTTCTCGGA CAAGGTGGAC
AACGCCGCCG CCGCGGGGGC CGCGGCCGCC CTGGTGGTCA ACAACGAGGA CGAGGTCTTC
CTGGGCACCG TGGGCGAGCA CTCGGCCATC CCCGCCCTGG GCGTGTCCGG AACGCTGGGC
GCGGAGCTGC TGGCCGCCGA GGGCCTGGAG CTGCGGGTGA GCGTGGACGC CGAGGTCAGC
AACGAGACCT CCTACAGCGT CCTGGCCGAG ACCCCGGGCG GCCGGGACGA CAACGTGGTG
GTCGTGGGCG GCCACCTGGA CAGCGTCGAG GACGGCCCCG GCATCAACGA CAACGGCAGC
GGCGCGGCCT TCCTGCTGGA GACCGCCATC CAGCTGGCCG AGCAGGAGGA GCCCGACAAC
AAGGTGCGGT TCGCCTTCTG GGGCACCGAG GAGGAGGGCC TGGTCGGTTC CACCCGGTAC
GTGGAGGACC TGACCGCGCA GGAGGTCGAG GACATCGCCC TCTACCTCAA CTTCGACATG
ATCGGCTCGC ACAACTACGG CCGGTTCGTG CTGGACGGCC GGATGGAGCT GCCCGGGTCC
GTGGCCGCCC CGTCGGGCTC CGGCGCGATC GCGAAGGTCT TCGAGGACTA CTTCGCCGCC
CAGGACCAGG TCAGCGAGCC CGGCGTGCTG AGCGGGCGCA GCGACTACCA GGCCTTCATG
ACCGCGGGCA TCCCGTCGGG CGGCCTGTTC AGCGGCGCCG ACGGCGTCAA GACCGAGGAG
CAGGTCGAGT GGTACGGCGG TACGGCGGGC GAGCAGTTCG ACCCGTACTA CCACACCGCC
GACGACACCA TGGAGCACAT CAACTGGGAC TCGGTGGCCG AGCTGTCGGC GGCCGGTGCG
CACGGCGTGG AGTTCTTCGC CGAGAGCACC CTGCCGGTGA ACGGTGTGCT GCGCACGACG
GCCGCGCCCG ACTTCCCGCG CCTGGGCGAC GGCTGGCTCA GGTAG
 
Protein sequence
MRTSSEGKPP RGAAAPASRR RAATAGAAAV LMAVAALASS PAHAEPGPPG HAGPPDHSGA 
PGHAGPPAHA GRPGEDPPLS ERVSAEAIRE HLENLDTIAE YNGGNRATNT PGYDVAALYI
EDQLERAGYE PYRHEYEYEL WRENSPAVLA QTAPEQVAYE AETDYATMSY SGSGDVTAPA
VAVNADSAAS GCSPDDFADF PEGAVAITVR GTCPFSDKVD NAAAAGAAAA LVVNNEDEVF
LGTVGEHSAI PALGVSGTLG AELLAAEGLE LRVSVDAEVS NETSYSVLAE TPGGRDDNVV
VVGGHLDSVE DGPGINDNGS GAAFLLETAI QLAEQEEPDN KVRFAFWGTE EEGLVGSTRY
VEDLTAQEVE DIALYLNFDM IGSHNYGRFV LDGRMELPGS VAAPSGSGAI AKVFEDYFAA
QDQVSEPGVL SGRSDYQAFM TAGIPSGGLF SGADGVKTEE QVEWYGGTAG EQFDPYYHTA
DDTMEHINWD SVAELSAAGA HGVEFFAEST LPVNGVLRTT AAPDFPRLGD GWLR