Gene Namu_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0236 
Symbol 
ID8445816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp265887 
End bp266969 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content71% 
IMG OID645039381 
Productallantoicase 
Protein accessionYP_003199656 
Protein GI258650500 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG4266] Allantoicase 
TIGRFAM ID[TIGR02961] allantoicase 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGGA CCGATCTGAA CCTGGCCAGC CGGGCCCTGG GCGCATCGGT GCGCTACGTC 
AGCGACGAGT TCTTCGCCCC GTGCGAGGCG TTGCTGATGC CGGGCGCACC CGTGCACGAC
GTGAGCACCT TCGGCCCGCA CGGGAAGATC TACGACGGGT GGGAGACCCG GCGCCGGCGC
ACCCCCGGTC ACGACTGGGC CGTCGTGGCC CTCGGCGTTC CCGGCGTGCT GCACGAGATC
GTCGTCGACA CGGCGTTTTT CCGCGGCAAC TACCCGCCCG AGGTGTCGAT GGAGGCGACC
TGGCTGGACG GTGCGCCCGA CCGGGCCGCG CTGGATGCCG CGGAATGGAC GACGATCGTG
CCGATCTCCG CGGCCCGCGG CGACACCGCC AACACCTACC GCGTCTACAA CAGCCAGGCC
TTCACCCATG TCCGGCTCAA CATCTATCCC GACGGTGGGG TGGCCCGGCT GCGGGTCCTG
GGCACCGCGG TGCCCGACCC GCGGGTGCTG GGCGATCGGA TCGATCTGGC CGCGATTCAC
CACGGCGGGG ACATCGCCGA ATGCTCGGAC ATGTTCTATT CCGACGCCCG CCACGTGCTC
TATCCCGGCA TCGCCGAGTC GATGGCCGAC GGCTGGGAGA CCGCGCGGCG GCGGACCGCC
GGCAACGACT ACCTGGTGGT CACCCTGGCC GGCCCGGCCG AGCTGGAGTT CGTCACCATC
GACACCGGCT ACTTCCTGGG CAATGCCCCT GGGCGGGTGC GTCTTTCGGC CCGGCGAACC
GACACCTCGG CCTGGCGGGA GATCGTGCCC GAGCGTGCGA TCTCGCCGGA TGCGCGCAAC
CGGTTCCGGG TGCTGGCCGA CCGGCAGGTC ACCGCGGTCC GGGTGGACGT CTACCCCGAC
GGCGGGTTCT CCCGGCTGCA CCTGATGGGC AAGCTGCTGC CCGAGGCGCT GTCCCGGGCG
ATCGCCCACT GGCTAGAGCG GCTGCCGAAA TCGGCCTCGG CCACGGTGCT GGCCGAGGCC
GGCCTGGGCG GCATCCCGCT CGGTGAACTG CGCGAGGACC AGCTGCTGAC ACTGGCCTGG
TGA
 
Protein sequence
MKRTDLNLAS RALGASVRYV SDEFFAPCEA LLMPGAPVHD VSTFGPHGKI YDGWETRRRR 
TPGHDWAVVA LGVPGVLHEI VVDTAFFRGN YPPEVSMEAT WLDGAPDRAA LDAAEWTTIV
PISAARGDTA NTYRVYNSQA FTHVRLNIYP DGGVARLRVL GTAVPDPRVL GDRIDLAAIH
HGGDIAECSD MFYSDARHVL YPGIAESMAD GWETARRRTA GNDYLVVTLA GPAELEFVTI
DTGYFLGNAP GRVRLSARRT DTSAWREIVP ERAISPDARN RFRVLADRQV TAVRVDVYPD
GGFSRLHLMG KLLPEALSRA IAHWLERLPK SASATVLAEA GLGGIPLGEL REDQLLTLAW