Gene Namu_5100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5100 
Symbol 
ID8450731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5685937 
End bp5687844 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content73% 
IMG OID645044135 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003204359 
Protein GI258655203 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGGAAC ACGCCCCCAC TCACGACCAG CTCGGCCTGA CCCCGCAGCT GCGGACGGCG 
ATCGATCGCA TCGCCGGCGT CGTCGGCTCC TGGTGCCCGT CGATCTCCCC GGACGGCCGG
CAGATCGCCT ACGTCACCGA CCGTTCCGGG CTGCCCCGGC TGGAGGTCGC CCATCTGGAC
ACGGCCGGCG AATGCGCCGA CCGCACCCCC CGGCAGGTCT CGCTGCCGGA CCAGGAGGTC
ATCTCGGTGG CCTGGTCGCC GGACGGACAG TGGCTGGCCT ACCTGGTCTC GCCGCGCGGG
TTCATCCGGG CCGAGCTGCA CGCGATCCGA CCGGACGGCA CCGATCACCG CGCACTGGCC
GGCCTGGCCG ACCTGGAGAC GGCGTTTGCC GGGACCTGGA CCAGCCTGCC GCACACCTAC
GCGTTCTCAC TGGCCGACGG CCGCAGCCCG GACGCCGACG TCTGTTTCGT CGATGTGCAG
ACCGGCGCGG TTCGGCGGGT CGCGACCGGC GGCTTCCTGC TGGTGACCGG GGTGTCCCCG
GACGCGACCC GGGTGCTGGC CCGACGGGGG CCGCGGGGCC GCCGGCACCT GGTGCTGGCC
GACGTGCCGC CCATCGCCGA CGCGCCGTTG ACGCCACCTC GCCGCCTGCT CGAAGCCGAC
TTCCCCGTCG AGGGGACCGA TGTCGCCGAG GACGGCCGGT TCTCCGCCGA TGGTGGCACC
GTCTACCTGC GGGTCGGCGC CGGGCGCGAG CACCCCGCCC TGGGCGCCGT CGACCTGGAC
CCAGACGGCC GCCCGGGACG GCTGCGGATC CTGGCCCAGC GCGACGACTC GGACCTGGAC
GCCTACGCGG TGCTGGACCG GGGCGCTCGA GCGATGTTGG TGTGGAACCT GTCCGGGCAC
AGCGCGATCG AGATGCGGGA CCTGCACACG GGGGCCGGGC ACCGGGTGGA CATCGGGCAG
CGGGTGATGC CCGGCTGGTC GGTGTTGCCG GACGGTCATT CCGGCATCCT GGAACTCACC
GAGCCGGTCG CCCCGCGCAG CGTGTATCAC GTCGATCTGG TCGACCACCC GGACGATGCC
CCACCGGCGG CCTCCCGGAT CACCGGCCTG CCGGTGCCCC GGATACCGGC GGCCGAGCTG
TGCGTGCCCG AGCGGGTCAC CTTCGCCGCC GACGACGGTG TGCGGCTGCG CGGCCTGCTG
TACCGGCCGG CCGGCGAGCC CCCGTGGCCG ACCGTGATCC TGCTGCACGG CGGGCCGGAG
GCCGAGGAGC GACCGGCGTT CTCCATCCTG ATCCAGTCGC TGATCGCGGC CGGCTTCGCC
GTCTTCGCCC CCAACGTGCG CGGCTCCACC GGGTACGGCG CCAGCTTCAC CGCCCTGGAC
GACCTGGACC GGCGCGAGTC CTCGTTCGCC GACGTCAAGG CCGCGGTCGA CTACCTCCTC
GGCCGGGGCC TGGCCGCGCC GGGCCACATC GGCGTGCACG GCTGGTCTTA CGGCGGCTAC
CTGGCGATGG TCGCCGTGAC CCGCTTCGGC GAGCTGTTCG CCTCCGGTTC CAGCCACGCC
GGCATGTCCG ACCTGCGCAC CTTCTTCCGC CACACCGAGC CGTGGATGGC GGCGGCCTCG
GTGACCGAGT ACGGCGATCC GATCACGGAC GCGCAGCTGC TGGCCGATCT GTCCCCGTTG
GTTGGTTTTG CCGACCTGCG GGTACCCACG ATGTTTGTGC ACGGAGAAAG CGATACGAAC
GTCCCGGTCA TCGAGTCGGT GCAGGCCGCG GCGGCCCTGA CCGATGCCGG GGTGCCGACC
AGGCTGATGC TGTTACCAGG GGAAGGACAC ACCATCGTCG GACGCGAGGG TCGGATCGCC
TCGACCGAGG CCATCGTCGA CTGGCACCTG CGCTGGGCGG CCCGGTGA
 
Protein sequence
MLEHAPTHDQ LGLTPQLRTA IDRIAGVVGS WCPSISPDGR QIAYVTDRSG LPRLEVAHLD 
TAGECADRTP RQVSLPDQEV ISVAWSPDGQ WLAYLVSPRG FIRAELHAIR PDGTDHRALA
GLADLETAFA GTWTSLPHTY AFSLADGRSP DADVCFVDVQ TGAVRRVATG GFLLVTGVSP
DATRVLARRG PRGRRHLVLA DVPPIADAPL TPPRRLLEAD FPVEGTDVAE DGRFSADGGT
VYLRVGAGRE HPALGAVDLD PDGRPGRLRI LAQRDDSDLD AYAVLDRGAR AMLVWNLSGH
SAIEMRDLHT GAGHRVDIGQ RVMPGWSVLP DGHSGILELT EPVAPRSVYH VDLVDHPDDA
PPAASRITGL PVPRIPAAEL CVPERVTFAA DDGVRLRGLL YRPAGEPPWP TVILLHGGPE
AEERPAFSIL IQSLIAAGFA VFAPNVRGST GYGASFTALD DLDRRESSFA DVKAAVDYLL
GRGLAAPGHI GVHGWSYGGY LAMVAVTRFG ELFASGSSHA GMSDLRTFFR HTEPWMAAAS
VTEYGDPITD AQLLADLSPL VGFADLRVPT MFVHGESDTN VPVIESVQAA AALTDAGVPT
RLMLLPGEGH TIVGREGRIA STEAIVDWHL RWAAR