Gene Namu_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0235 
Symbol 
ID8445815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp264475 
End bp265803 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content73% 
IMG OID645039380 
Productallantoinase 
Protein accessionYP_003199655 
Protein GI258650499 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACTGG TGGTGCACGC CCCGCGGGCG CTCGTCGACG GGCGGGAACA GGCGGTCAGC 
GTGGCCGTGC GGTTGGGGCG CATCGTCGGC CTGGCGCCGC TGGGCACGGC CCCACCGAGC
CGGCTGGTCG CCACCCTGAC CGACGACGAG GTGCTGATCC CCGGTCTGGT GGACACCCAC
GTGCACATCA ACGAGCCCGG GCGCACCGAG TGGGAGGGCT TCGACACGGC CACCGCCGCG
GCCGCCGCGG CCGGGGTCAC CACGTTGATC GACATGCCGC TCAACTCGCT GCCGCCGACG
CTGAACGCGG CCGCGCTGGC CACCAAGCGG GCCGCCGCCC GCGGCCGGTG CCGGGTCGAC
GTGGGTTTCT GGGGCGGCTG CGTGCCGACC AACCTGGCCG ACCTGCCGGA GCTGTTCGCG
GCCGGCGTCT TCGGCGTCAA ATGCTTCCTG CAGGACTCCG GCGTCCCCGA ATTCCCCCCG
GTGACCACCG CCGAGATGCG GGCGGCCATG CGGACCATCG CCGAGGTCGG CGGGCTGCTG
CTGGTGCACG CGGAGGATCC CGGGGTGCTG CACACCTCGC CGACCCCGCA GGGCCGGGAT
TACCAGGCAT TCGTGGCCTC CCGGCCCGAG GCGGCCGAGA CCGCCGCCGT CCAAGCGTGT
ATCGAGGCGG CCGCGATCAC CCGCTGCCGC ACCCACATCG TGCACCTGTC CAGCGCCGGC
GGCGTCGCCC TGATCCGCCG GGCCAAGGCC GACGGCGTGC CGATCACCGC CGAAACATGT
CCGCATTACC TGGTTTTCGA TGCGGAGGAC ATCCCCGACG GGTCCCCGCA GTACAAGTGC
TGCCCGCCGA TCCGGGGCCG GGCCGATCAG GAGGCGCTCT GGGCCGGGCT GGCCGACGGC
ACGATCGACA TCGTGGTCAG CGACCATTCC CCCAGCACGG CCGAGCTCAA GCTGCTGGAA
GTCGGTGACC TGGGCCTGGC CTGGGGCGGG ATCGGTGGCC TGCAGTTCGG TTTCGCCGCG
GTGTGGGCGG AGGCGAACCG GCGCGGGGTG CCGTTGGCCG ACGTGGTGCG CTGGATGTCC
ACCGGCCCGG CGACGCTGGC CGGCCTGGAC CACAAGGGGC GGCTGGAGGT CGGCGCCGAC
GCCGATCTGG TGATCCTGGC CGAGGCCGAG ACCTTCACCG TCACCGACGG GATGATCCGG
CACCGCAACC GGATCACCCC GTACCTGGGC CGGGAGCTGC GCGGGGTGGT GCGGGCGCGC
TGGCTGCGCG GCACGGAGCT GTCCGACGAT CTGGTCGCCG GCCGGTTGAT CAGTCACGAA
CCCGGCTGA
 
Protein sequence
MELVVHAPRA LVDGREQAVS VAVRLGRIVG LAPLGTAPPS RLVATLTDDE VLIPGLVDTH 
VHINEPGRTE WEGFDTATAA AAAAGVTTLI DMPLNSLPPT LNAAALATKR AAARGRCRVD
VGFWGGCVPT NLADLPELFA AGVFGVKCFL QDSGVPEFPP VTTAEMRAAM RTIAEVGGLL
LVHAEDPGVL HTSPTPQGRD YQAFVASRPE AAETAAVQAC IEAAAITRCR THIVHLSSAG
GVALIRRAKA DGVPITAETC PHYLVFDAED IPDGSPQYKC CPPIRGRADQ EALWAGLADG
TIDIVVSDHS PSTAELKLLE VGDLGLAWGG IGGLQFGFAA VWAEANRRGV PLADVVRWMS
TGPATLAGLD HKGRLEVGAD ADLVILAEAE TFTVTDGMIR HRNRITPYLG RELRGVVRAR
WLRGTELSDD LVAGRLISHE PG