Gene Namu_4916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4916 
Symbol 
ID8450547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5489097 
End bp5490035 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content74% 
IMG OID645043955 
Productproline iminopeptidase 
Protein accessionYP_003204179 
Protein GI258655023 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGC CGATCGAGCC GTACGCGACC GGTCTGCTGC CGGCCGCGCA CGGTGCGCAG 
CTGTACTGGG AGTGCTCGGG GAACCCGCGT GGCCGGCCGG CCCTGTTCCT GCACGGCGGC
CCGGGCGCGC CGATGTCCGG CGGATACCGG CGGCGGTTCG ACCCGGACCG CTGGCGTGTG
GTGTACCTGG ACCAGCGCGG ATGCGGACGT AGCCGGCCGC TGGCCCACCA GGACCTGGCC
TCGCTGGCCG GCAACACCAC CGACCAACTG ATCCAGGACA TCGAAACGTT GCGGGTCCAC
CTGGGCATCG ACCGCTGGCT GGTGGTCGGC GGGTCGTGGG GCGTCACCCT GGCCCTGGCC
TACGCGCAGC GGCACCCCGA CCGGGTGTCC GGGCTGGTCC TGGCCGCGGT CACCACGGGC
GGCCGGGAGT ACCTGGAATG GATCACCGAG TCGATGCGGC ACGTGTTCCC GCGGGAGTGG
GACGAGTTCG CCGCCGCATC CGGGCGACGG CCCGGCCAGC GGGTCCTGGA CGCCTACCGG
GAGCGGATCA CCGACCCCGA CCCGGAGGTG CGGGCGGCCG CGGCGGCGGC CTGGTGCGCC
TGGGAGGACG TGCACGTTTC GCTGGCCCCG GACTGGGCCC CGTCCGCGGC GTTCGCCGAC
CCGCAGTTCC GCGCCCAGTT CGCCACCCTG GTCATCCACT ACTGGGCCAA CGACTGCTTC
CTGCCCCCGG ACGGCGTGCT CGGCGCGATG GCCACGATCA CCGACCTGCC CGGCGTGCTG
ATCCACGGCC GGTACGACGT CAGCGGCCCG CTGTCGGCGG CCTGGGAACT GCACCGGCGC
TGGCCGGCCA GCCGCCTGGT GGTGCTGGCC GACAGCGGGC ACAGCGGGGC GTCGATGACC
GACGAGCTGA CCGCCGCCAT CGCCGGGTTC GACCCGTGA
 
Protein sequence
MSTPIEPYAT GLLPAAHGAQ LYWECSGNPR GRPALFLHGG PGAPMSGGYR RRFDPDRWRV 
VYLDQRGCGR SRPLAHQDLA SLAGNTTDQL IQDIETLRVH LGIDRWLVVG GSWGVTLALA
YAQRHPDRVS GLVLAAVTTG GREYLEWITE SMRHVFPREW DEFAAASGRR PGQRVLDAYR
ERITDPDPEV RAAAAAAWCA WEDVHVSLAP DWAPSAAFAD PQFRAQFATL VIHYWANDCF
LPPDGVLGAM ATITDLPGVL IHGRYDVSGP LSAAWELHRR WPASRLVVLA DSGHSGASMT
DELTAAIAGF DP