Gene Namu_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2110 
Symbol 
ID8447721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2327969 
End bp2329255 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content68% 
IMG OID645041233 
Productpeptidase M24 
Protein accessionYP_003201477 
Protein GI258652321 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0590999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.141965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGATCA CCAGCCCCGC CGCGCCACCC CGGCCGGCCG CCCCGACCGG GGCGCTCACC 
CTGGCCGCCC CTGGGCACAT GGGTGTCGAC TACGAGGCCC GGGTCGACTT CGACCGGTTG
CGCCGGTACC GGATCGATCG GGCCCGGGCG GCACTGGCGG CCAGCGAGTG CGGCGCGTTC
CTGTTGTTCG ACTTCTACAA CATCCGGTAC ACGACCTCCA CCTGGATCGG TGGGGCGTTG
GGCGACAAGA TGATTCGCTA CGCGCTGATC ACCCGGGACA GCGATCCGGT GCTGTGGGAT
TTCGGGTCGG CGGTCAAGCA TCACCAGTTG CACTCGCGGT GGATCCCGGA GCAGAACTAC
CGGGCCGGGT TCCTGGGATT CCGCGGCGCG GTGGCGCCGA CCGTAGGGTT GATGCAGGCC
GCCGTGGCCG AGATCAAGGG CCTGCTGGTC GAGGCCGGGG TGGCCGACCA GCCGGTCGGG
GTGGACATCG TGGAGCCGCC GTTCCTGTTC GAGCTGCAGC GGCAGGGCCT GCGGGTGGTG
GACGCGCAGC AGTCCATGCT GGATGCGCGG TGTATCAAGT CGCCCGACGA GATCGTGCTG
CTCAACCAGG CCGCGGCGAT GGTCGACGGG GTCTATCAGG ACATCGTGGA GGTCCTCAAG
CCGGGGATCC GGGAGAACCA GATCGTGGCG TTGGCCAACC AGCGGCTCTA CGAGATGGGC
TCCGATCAGG TCGAGGCGGT CAACGCGATC TCCGGGGAGC GGTGCAACCC GCATCCGCAC
AACTTCACCG ACCGGTTGAT CCGCCCGGGT GATCAGGCGT TCTTCGACAT CATCCACTCC
TACAACGGGT ATCGGACCTG CTACTACCGC ACGTTCGCGG TCGGCAAGGC CACCGCGGCG
CAGCGGGACG CCTACACCCG GGCCCGAGAA TGGATGGACC GGGGGATCGC CGGGATCAGG
GCCGGGATCG GCACCGACGA GGTCGCCGCA TTGTTGCCGC CCGCGCAGGA GTTCGGGTTC
GGCACCGAGA TGGACGCGTT CGGCCTGCAG TTCGCGCACG GGCTGGGGCT GGGTCTGCAC
GAGCGACCGA TCATCTCGCG GCTCAACTCG TTCAAGGAAC CGGTGGAACT GCAGGTCGGC
ATGGTCTTCG CGTTGGAGAC CTACTGCCCG GCCAGCGACG GCGTCTCGGC CGCCCGCATC
GAGGAGGAGA TCGTCATCAC CGACACCGGC CCGAAGATCT TGACCCTGTT CCCGGCCGAG
GAACTGTTCG TGGCCAACCC CTACTGA
 
Protein sequence
MTITSPAAPP RPAAPTGALT LAAPGHMGVD YEARVDFDRL RRYRIDRARA ALAASECGAF 
LLFDFYNIRY TTSTWIGGAL GDKMIRYALI TRDSDPVLWD FGSAVKHHQL HSRWIPEQNY
RAGFLGFRGA VAPTVGLMQA AVAEIKGLLV EAGVADQPVG VDIVEPPFLF ELQRQGLRVV
DAQQSMLDAR CIKSPDEIVL LNQAAAMVDG VYQDIVEVLK PGIRENQIVA LANQRLYEMG
SDQVEAVNAI SGERCNPHPH NFTDRLIRPG DQAFFDIIHS YNGYRTCYYR TFAVGKATAA
QRDAYTRARE WMDRGIAGIR AGIGTDEVAA LLPPAQEFGF GTEMDAFGLQ FAHGLGLGLH
ERPIISRLNS FKEPVELQVG MVFALETYCP ASDGVSAARI EEEIVITDTG PKILTLFPAE
ELFVANPY