Gene Namu_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0039 
Symbol 
ID8445618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp44734 
End bp46053 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content76% 
IMG OID645039190 
Productformiminoglutamate deiminase 
Protein accessionYP_003199466 
Protein GI258650310 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCGG CCGCCTACTT CGCCGGCCAC GCGCTGCTGC CGGACGGCCC GGCCGCCGGC 
GTGACGTTCG AGGTGGCCGC GGGTCGGTTC ACCGCGATCC GGACCGGCAG CCGCCCACCG
GCCGGCGCCA CCCGGCTGTC CGGCGTGGTC CTGCCCGGCC TGGCCAACGC GCACAGCCAC
GCCTTCCACC GGGCGCTCCG CGGTCGCACC CACCGCGGCG GCGGCACGTT CTGGACGTGG
CGGGAGGGTA TGTACCGGGT CGCCGCGGCC CTGGATCCCG ATTCCTACCT GGCCCTGGCC
CAGGCCACCT ACGCCGAGAT GGCCCTGGCC GGGATCACCA CCGTGGGCGA GTTCCACTAC
CTGCACCACG CCCCCGGCGG CCGCCGGTAC GCCGACCCGA ACGCGATGGG CGCGGCGTTG
ATCCAGGCCG CCGCGGAGGC CGGAATCCGG CTGACCCTGC TCGACACCTG TTACCTGGCC
GGCGGTCTGG GGCCCCCGGG GCACCTGGAG CTAGGGCCGG AGCAGCTGCG GTTCACCGAC
GGCGACGCCG ACGGGTGGGC CGCCCGGGTG GCCCGGCTGG CCGACGCCGA GTCGGTCCGG
ATCGGCGCGG CGGTGCACTC GGTGCGGGCC GTGCCGCGGG CGGCGCTGCC GGTGGTGGCC
GCGGCCGCAC ACGGACGGCC GCTGCACGTG CACCTGTCCG AGCAGCCGGC CGAGAACCAG
GCCGCCCTGG CGTTCTACGG CCGCACACCG ACCGAGTTGC TCGACGAGGC CGGGGTTCTC
GGCCCACTGA CCTCGGCGGT GCACGCCACC CACCTGACGG CGTCCGACAT CGCCGCGCTC
GGCCGGACCG GGACCACCTG TTGCCTGTGC CCGACCACCG AACGGGACCT GGCCGACGGG
ATCGGCCCGG CCCGCGCCCT GCTCGATGCC GGCGCACCGC TCAGTCTCGG GTCGGACCAG
CACGCGGTGA TCGACCTGAT CGAGGAGGCC CGCGCGCTGG AGATGCACGA GCGGCTGGCC
ACCCTTCATC GCGGCCGGTT CAGTCCGGAG CAGTTGCTGA CCGCGGCCAC TCGCCACGAC
AGCCTGGGCT GGACCGACGC CGGAAGCCTG GCCGTCGGCG GGCGGGCCGA CCTGGTCGCC
GTCCGGACCG ACACCGCCCG CACTGCCGGC GCCGACCCCG CGCAGATCCT GCTGGCCGCG
ACCGCCGCCG ACGTGGACAC CGTGGTGGTG GACGGACAAC CGGTGGTGAC CGGCGGCCGG
CATCGGCTCG GCGATGTCGG CGCGCTGCTG GGCGCCGCCA TCGAGCCCCT CTGGCGGTGA
 
Protein sequence
MSPAAYFAGH ALLPDGPAAG VTFEVAAGRF TAIRTGSRPP AGATRLSGVV LPGLANAHSH 
AFHRALRGRT HRGGGTFWTW REGMYRVAAA LDPDSYLALA QATYAEMALA GITTVGEFHY
LHHAPGGRRY ADPNAMGAAL IQAAAEAGIR LTLLDTCYLA GGLGPPGHLE LGPEQLRFTD
GDADGWAARV ARLADAESVR IGAAVHSVRA VPRAALPVVA AAAHGRPLHV HLSEQPAENQ
AALAFYGRTP TELLDEAGVL GPLTSAVHAT HLTASDIAAL GRTGTTCCLC PTTERDLADG
IGPARALLDA GAPLSLGSDQ HAVIDLIEEA RALEMHERLA TLHRGRFSPE QLLTAATRHD
SLGWTDAGSL AVGGRADLVA VRTDTARTAG ADPAQILLAA TAADVDTVVV DGQPVVTGGR
HRLGDVGALL GAAIEPLWR