Gene Ndas_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3894 
Symbol 
ID9247765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4666260 
End bp4667777 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content74% 
IMG OID 
ProductLeucyl aminopeptidase 
Protein accessionYP_003681797 
Protein GI297562823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCGA ACGATTGTGA GCGGCCTGTG CCATTCGCCA CGGAGATCCA GCCCCGTTCG 
GGGAGCCTCG CCGAATCCAC CGCGGACCTG CTGGTCCTTC CCGTCCTCTC GGGAGACGGC
GGACCCGCGG AGGTCGAAGG GGTCGCCGAG GGCGGCCTGA CACAGGTGCT TCCGGCTCCG
CTCGGCGATC TGTTCGCGCA CTACTCGCTC ACCGGGAAAC CCGGTGAGCT GGCCCAGTTC
CCGGTGCCCC GGGACCAGGG GCTGGTACGG CTGGCCCTTC TGGGGGTCGG TTCGGGTTCC
CCCGACGACC TCCGGAAGGC CGGCGCAGCC CTGTCCCGGG CCGCCCGGGG AAGAGAGCGG
GCCGCCCTGG CCTGGCCGGG TGTCGGCGGC GACGCCGCCA CCGCCTTCGC CGAGGGCGCC
CTCCTGGCCT CCTACACCTT CAGCCTCAAG ACCGCCGAAC CCCCGGCGGA CAAGCGTCCC
GCCCCGGTGG TCGAGGTGGT GGACCCGGGC GCCGGGGACC TCGCCGACGC GCTGGAGCGC
GGCACCCGGC TGGCCGCGGC CACCGCCCTG GCCCGCGACC TCATCAACAC CCCGTCCATG
GTCAAGGACC CCGCGTGGAT GGCCGCGCGC GCCAGCGAGG TCGCCGCGGA CTCCGGCCTC
CAGGTGCGGG TCTGGGACGA GGCGGAGCTG GAGCGCGACG GCTTCGGCGC GATCCTCGCC
GTCGGCCGGG GCTCCTCACG CCCCTCCCGG CTGGTGCAGC TCTCCTACAC CCCCGAGAAC
CCCACCGCGC ACGTGGTGCT GGTCGGCAAG GGCATCACCT TCGACACCGG CGGCCTCTCG
CTCAAGCCCA ACGACAACAT GTCGCTGATG AAGACCGACA TGAGCGGCTC GGCGGTCGTG
CTCGCCGTGC TGTCCGCGCT GTCGGCCGTG GGCGCCTCGG TGCGGGTGAC CGGTCTGCTG
GCCCTGGCCG AGAACGCCTT CGGCGGCGAC GCCACACGCA TCGGCGACGT GCTCACCACC
TACGGCGGCA CGACCGTCGA GGTGCTCAAC TCCGACGCCG AGGGCCGCCT GGTCCTGGCC
GACGCCATGG GTTACGCGGT CGCCGAGCTG GCCCCGGACG TGCTCGTGGA CGTGGCCACG
CTCACCGGCG CGGCCAAGGT CGCGCTCGGT ACGGGCACGG GCGCCCTGTA CAGCACCGAC
GACGCGCTGG CCGCCGAGAT CGAGACCGCG GGCCGGGCCT CCGGCGAGCC GCTGTGGCGG
ATGCCGCTGA CCGAGGAGTA CGTCGAGACC ACCGAGTCCC GCGTGGCGGA CCTGGCCAAC
ATCGGCACCC GTCGCGAGTT CGGCCCGGCC GGCGCCACCG ACGCCGCCCT GTTCCTGCGC
GAGTTCACCG GCGGCGTGCC CTGGGCCCAC CTGGACATCG CCGGTCCCGG CCGGTCGACC
AAGGAGAGCG GCCTGCTCAG CAAGGGCGGC ACCGCCTTCG CCACGCGCAC GCTGCTGCGC
TGGCTCGCCG AGCGCTAG
 
Protein sequence
MTANDCERPV PFATEIQPRS GSLAESTADL LVLPVLSGDG GPAEVEGVAE GGLTQVLPAP 
LGDLFAHYSL TGKPGELAQF PVPRDQGLVR LALLGVGSGS PDDLRKAGAA LSRAARGRER
AALAWPGVGG DAATAFAEGA LLASYTFSLK TAEPPADKRP APVVEVVDPG AGDLADALER
GTRLAAATAL ARDLINTPSM VKDPAWMAAR ASEVAADSGL QVRVWDEAEL ERDGFGAILA
VGRGSSRPSR LVQLSYTPEN PTAHVVLVGK GITFDTGGLS LKPNDNMSLM KTDMSGSAVV
LAVLSALSAV GASVRVTGLL ALAENAFGGD ATRIGDVLTT YGGTTVEVLN SDAEGRLVLA
DAMGYAVAEL APDVLVDVAT LTGAAKVALG TGTGALYSTD DALAAEIETA GRASGEPLWR
MPLTEEYVET TESRVADLAN IGTRREFGPA GATDAALFLR EFTGGVPWAH LDIAGPGRST
KESGLLSKGG TAFATRTLLR WLAER