Gene Namu_0809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0809 
Symbol 
ID8446401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp893851 
End bp895728 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content73% 
IMG OID645039946 
Productpeptidase M28 
Protein accessionYP_003200209 
Protein GI258651053 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA TGGAACTGCC GGCCCGGGCG GACGCCTCCG ACCTGGATCA GGCCCTGCCG 
CGGCAGCGCA CGGGCGGGCG GGCGCGGCCG GCCCTGCGGC TGATCCTGTT CGATCTGGGC
GACACCCTGG AAAGCGGCGA GCAGTTGCGA CCCGGCGCCC TGACAACCCT GCGGGCGATC
GAGAAGCTGG GCGACGTCGC GCACGTGGCC CTGCTTTCCG ATGTCGAGCA GCCGGCCTCG
GCGCGGGACG AATCGCGGAT CCGCCACGAA TACGAGGAGC TGCTGGGCCG GCTGGGCATC
CGGGCGTTCT TCGAGCCACT GGCCCGCTGG ATCACCCTGT CCAGCGAGGT CGGCGTGCGC
AAGCCGGCGC CGGCCACCTT TCGGCGGGCG ATGCGCAAGG CGGGCGCCGA CCTCGGCTTC
GGCGACGTCA TGTTCATCAC CGAGAACGAG GGTCATGTCC GGCGGGCCCG GGAGCTCGGC
ATGCGGGCCG TCCAGGTCCC CGGGCCCGGC GGTCCCGCGG CCGGGGCGGA CATCACCGGG
CTGGAGGAGT TGATCGGTGT CGTCGAGCAG TTCGTCAGTG GCCGGGAGGG CGAGGGCCCC
GCGGCGGCCA CCGGTGAACA GGACGGGGCG ACGATCCACC TTGCGGTGCT CGCCCCGGCC
GGCGCTTCGG ACGAGGCCGG TGGCTCGATC GGGCACCGGG TGCGCCTGGG GGCGCTGTCG
GTCCAGATCG ATACCGAACC GATCGGCACC GACTTGATCG ACACCGAACT CACTGGTCCC
CAGGCTTCCT CCGACGCGCG CGGGGCGCCG CGGTCCCTGG ACCGGTTGCA CCTGGTGGTG
CAGAACGGCC GCACCTTCCA GCAGGAGCAC CCGGACGTGC CGGTGCTGGC GGACCAGGGC
CGCTACCTCG TCGTCGACCT CGACCCGGCA ATCGCGCGCG AGCTGGACGG CCCCGATCAG
GTCTGTTTCA GCGTGCTGCC CCTGCCGCTG AACACGACTG TCTTCGCCCG GGCGGTGGCG
GAGCCGGTGG CCGAGCAGCC GTGGATCCGG GAGCTGGTCG ACCGGGTGGC GGCGGGGCGA
TTCCGGATGG ATCTGGACAA GCTGGTCGCC TTCGGCAGCC GGTATTCCAC GAGCAGCGCC
TACCGGGCCG CGGCCACCGC CACCCGCGAC GAACTGGCCG CCCAGGGCTA CGCGGCCGTC
CTGGTCCCGA TCTCGGTGCA GGGTCGGCAG TCGTGGAACG TCGTCGCCGA CCACCCGGGC
AGCGGGCCGC AGCCCCGCCC GGTGGTGCTG GTGACGGCCC ACCTGGACTC GATCAACCTG
GCCGGCGGCC CGCAGGCCAT GGCGCCCGGC GCCGACGACA ACGCGTCCGG ATGTGCCGGC
CTGCTCACCT TCGCCCGGGT GTTCGGCACC CACCCGGGGG CGGCCGACCT GCGGTTGATC
CTGTTCGGCG GGGAGGAGCA GGGCTTGTTC GGCAGCCGCC AGTACGTGGC CGGGCTCGAT
CCGGCCGAGC GTGCCCGGAT CGCGGCCGTG GTCAACATGG ACATGATCGG CACGCTGACC
ACGCAGCGGC CGACCGTGCT GATCGAGGGC GCCGCGGTGT CCCGGCCGGT GATGGACGGC
TTGAGCGCGG CCGCCGCGAC CTACACCAGC CTGATCGTGC AGACCAGCCT GCACCCGTAC
AACAGCGACC ACGTGCCGTT CCTGGACGCC GCGATCCCGG CCGTGCTGAC GATCGAGGGG
GCGGACGGCG CCAACGACCG GGTGCACACC GACCAGGACC TGGCCCGGTT CGTCGACGAC
GAGCTGGCCG TGCAGATCCT GCGGATGAAC GTGGCCTTCG TCGCCGAGCA GCTGGGCCGG
GCCGGTGACC CCGGCTAA
 
Protein sequence
MAAMELPARA DASDLDQALP RQRTGGRARP ALRLILFDLG DTLESGEQLR PGALTTLRAI 
EKLGDVAHVA LLSDVEQPAS ARDESRIRHE YEELLGRLGI RAFFEPLARW ITLSSEVGVR
KPAPATFRRA MRKAGADLGF GDVMFITENE GHVRRARELG MRAVQVPGPG GPAAGADITG
LEELIGVVEQ FVSGREGEGP AAATGEQDGA TIHLAVLAPA GASDEAGGSI GHRVRLGALS
VQIDTEPIGT DLIDTELTGP QASSDARGAP RSLDRLHLVV QNGRTFQQEH PDVPVLADQG
RYLVVDLDPA IARELDGPDQ VCFSVLPLPL NTTVFARAVA EPVAEQPWIR ELVDRVAAGR
FRMDLDKLVA FGSRYSTSSA YRAAATATRD ELAAQGYAAV LVPISVQGRQ SWNVVADHPG
SGPQPRPVVL VTAHLDSINL AGGPQAMAPG ADDNASGCAG LLTFARVFGT HPGAADLRLI
LFGGEEQGLF GSRQYVAGLD PAERARIAAV VNMDMIGTLT TQRPTVLIEG AAVSRPVMDG
LSAAAATYTS LIVQTSLHPY NSDHVPFLDA AIPAVLTIEG ADGANDRVHT DQDLARFVDD
ELAVQILRMN VAFVAEQLGR AGDPG