Gene Namu_0506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0506 
Symbol 
ID8446089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp560572 
End bp561969 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content69% 
IMG OID645039642 
Productguanine deaminase 
Protein accessionYP_003199914 
Protein GI258650758 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCA CGGCGGTCCG CGGCACCTTC CTCGACTTCG TCGACGACCC CTGGCGGCAT 
GTGGGCGCCG AACAGGAGTC GGCCCGGTTC CACGCCGACG GGCTGCTGGT CATCAGCGAC
GGTCGGGTCG CCGACTTCGG TCCCTACGCC GAGGTCCGGG CCCGGCACCC GCAGGTCCCG
GTCACCGAGA TCGCCGACCG GCTGATCCTG CCCGGCTTCG TGGACGGGCA CATCCACTTC
CCGCAGACCC GCGTGCTCGG CGCCTACGGC GAGCAGCTGC TGCCGTGGCT GCAGAAGTCG
GTGTTCCCCG AGGAACGCAA GTACGCCGAT CGCGAGTACG CCCAGCTGGG CGGGCAACAC
TTCTTCGACA ACCTGCTCGC CTCGGGCACC ACGACCATCC AGGCGTTCAC CTCCAGCGCG
CCGGTCTGTA CCGAGGAGCT GTTCCTGGAG GCCACCCGCC GCAACCTGCG GGTGATCTCC
GGCATCACCG CCATCGACCA GAACGCACCC GACTGGTTCA CCATCTCGCC GGCCGACTTC
GAGGCCGCGG CCAAGGACCA AATCGCCCGC TTCCACCGGC AGGGCCGCAA CCTGTACGCG
ATCACTCCGC GGTTCGCCTT CGGCGCGACC GAGGAGCTGT TCCGGACCTG CCAGCGGCTC
AAGCACGAGT ACCCGGACCT GTGGGTGCAC ACCCACATCT CGGAGAACCC GGCCGAGGTC
CGCGGCGTCC CGCCGCTGCA CCCCGGCTGC ACCGACTACC TCTCGGTCTA CGAGAAGTTC
GACCTGGTCG GCCCGAAATT CACCGGCGGG CACGGCGTCT GGCTGACCAA CGACGAGTTC
CGGCGGTTGT CCGCGAGCGG CGGCGCGGTC ACCTTCTGTC CCTGCTCGAA CCTGTACCTG
GGCAGCGGCC TGTTCCGGCT GGGCCGGGCG ACCGACCCGG AACACCGGGT GAAGCTCACC
TTCGGCACCG ACATGGGCGG CGGCAACCGC TTCAGCATGC TCAATGTGCT CGAGGACGCC
TACAAGGTCG GCATGCTCAA CAACACCCTG CTCGACGGCA GCGTCGTTCC CAGCGAGCAG
GACCTGGCCG AGTCCGAGCG CAACAAGCTC TCCCCGTACC GGGCGTTCTA CTCGATCACC
CTGGGTGGCG CCCAGGCGCT GGAGATCGAC GACCTGGTCG GCAATTTCGA CGTCGGCAAG
GAGGCCGACT TCGTCGTGCT CGACTGGAAC GGCGGCCCGC CGGCCACCGC CTGGCACATG
AGCCTGCTGC TGCCCGACGG GGCCCCGCGA ACGATGCAGG ACGCCGCCGA GGTGTTGTTC
GGGATCATGA TGGTCGGTGA CGAGCGGGCC GTCGAGCAGA CCTGGCTGAT GGGCGAGCGC
GCGTACCGGA AGCCCTGA
 
Protein sequence
MATTAVRGTF LDFVDDPWRH VGAEQESARF HADGLLVISD GRVADFGPYA EVRARHPQVP 
VTEIADRLIL PGFVDGHIHF PQTRVLGAYG EQLLPWLQKS VFPEERKYAD REYAQLGGQH
FFDNLLASGT TTIQAFTSSA PVCTEELFLE ATRRNLRVIS GITAIDQNAP DWFTISPADF
EAAAKDQIAR FHRQGRNLYA ITPRFAFGAT EELFRTCQRL KHEYPDLWVH THISENPAEV
RGVPPLHPGC TDYLSVYEKF DLVGPKFTGG HGVWLTNDEF RRLSASGGAV TFCPCSNLYL
GSGLFRLGRA TDPEHRVKLT FGTDMGGGNR FSMLNVLEDA YKVGMLNNTL LDGSVVPSEQ
DLAESERNKL SPYRAFYSIT LGGAQALEID DLVGNFDVGK EADFVVLDWN GGPPATAWHM
SLLLPDGAPR TMQDAAEVLF GIMMVGDERA VEQTWLMGER AYRKP