Gene Namu_4133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4133 
Symbol 
ID8449759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4568653 
End bp4569786 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content75% 
IMG OID645043182 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_003203411 
Protein GI258654255 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0770784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000385329 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGCCG ACTGGCTGGT ACCCGGCTAC GACCAGCGCG CGCTGGGGGC CCTGCTACCC 
GGTGCGGCCG CCGCGCTCGG CCACGACCTG GGCCGGCCCG CCGTGTCGCT GCCGGCGGCC
GAACGGATCT GCGTGGTCGT CGTCGACGGG CTCGGTCACC GGATGCTGCT CGAACGCCCG
CGGGCCGCCC CCTTCCTGAG CACGCTGATG GACCCCGAGC AATGCCTGGT GGCCGGCGCG
CCCAGCACCA CCGCCACCTC GATGGCCTCC TTCGGCACCG GCCTGCCGCC CGGCCGGCAC
GGGCTGGTCG GGTACGAGGT GATGGACCCG GACCGCGGCG AGCTGCTCAA CGAGCTGCGC
TGGCATCCGG ACACCGATCC GCTGCGCTGG CAGCCGCACC CGACGGTGTT CCAGGAGCTG
GCCGCTCGCG GCGTCCCGGT CACCCAGATC GGCAACCCGG AGTTCTACGG GTCGGGGCTG
ACCGAGGCGG CCCTGCGCGG GGCGACGTTC GTCGGGCTCA CCCGGCTGCG CGACCGGGTC
GACGCCGCGG TCGACCGGTT GCGCGAACCG GGCCTGGTCT ACCTCTACTG GGCCGACGTG
GATTCGGTGG GCCACGTGCA CGGCTGGCGG TCCGCGCAGT GGCGCCGCAC GGTCCGGGCG
CTGGACCGGG AGCTGGCCCG GCTGTCCCGG TGCCTGCCGT CGGGCACGCT GCTGGTGATC
ACCGCCGACC ACGGCATGGT CGACGTCCCG CATGCCGAGC GGCTCGACCT GGCCGCCCAG
CCCGGCCTGT GGTCCCGGTT CCGGGTGCTG GGCGGCGAGG GCCGCTTCGC CCAGCTCTAC
TGCGAACCCG GCACCCCGGC CGATCGGGTG GCGGACCTGG CCCGGCAGCT GGCCGACTGG
ATCGGCGAGC GAGCCCATGT CTGCACCCGG GTCGCGGCGA TCGACGCCGG CTGGTTCGGC
CCGGTCGAGG AACGGGTCCG GCCGCGGCTG GGCGAGGTCA TCGTGGCCGG CCGGGAGCCG
TTCACCCTGA TCGATTCGCG CACGGCCCGG CCGCACACGC TGTCGCTGAT CGGGCAGCAC
GGCTCGCTGA CCCCCGACGA GCAACTGGTG CCTTTCCTGC GATCCGTCAG CTGA
 
Protein sequence
MTADWLVPGY DQRALGALLP GAAAALGHDL GRPAVSLPAA ERICVVVVDG LGHRMLLERP 
RAAPFLSTLM DPEQCLVAGA PSTTATSMAS FGTGLPPGRH GLVGYEVMDP DRGELLNELR
WHPDTDPLRW QPHPTVFQEL AARGVPVTQI GNPEFYGSGL TEAALRGATF VGLTRLRDRV
DAAVDRLREP GLVYLYWADV DSVGHVHGWR SAQWRRTVRA LDRELARLSR CLPSGTLLVI
TADHGMVDVP HAERLDLAAQ PGLWSRFRVL GGEGRFAQLY CEPGTPADRV ADLARQLADW
IGERAHVCTR VAAIDAGWFG PVEERVRPRL GEVIVAGREP FTLIDSRTAR PHTLSLIGQH
GSLTPDEQLV PFLRSVS