Gene Namu_1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1783 
Symbol 
ID8447387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1958413 
End bp1960080 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content71% 
IMG OID645040911 
Productprotein of unknown function DUF853 NPT hydrolase putative 
Protein accessionYP_003201162 
Protein GI258652006 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000734669 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0773892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACAC CAGCAGCGGC GGCGGAGCAG GCCGCCCAGC GCATCGCACA CGGATATGAC 
ACGACAGCGC CGGCCGTCGT GGTCGGGTCG GTGGTCATCG ACCAGACGGC CGAACCCACC
GCCCTGGTCC GGATCCCGCT GTCCATGTTC AACCGGCACG GCCTGGTCGC CGGCGCGACC
GGCACCGGCA AGACCAAGAC CCTGCAGGTG CTCGCCGAGC AGCTGTCCGC GGCCGGCGTC
CCGGTCTTCA TGCCTGACAT CAAAGGCGAC CTGACCGGGT TGGCCGTTTC CGGCGAACCC
AACGCCAAGA TCCAGCAGCG GGCCCGGGAC ACCGGCGACG ACTGGGCGCC GGCCACCGTT
CCGGTCGAGT TCCTCAGCCT GAACGGTCAG GGCACGGCCA TCCCGGTGCG GGCCACGATC
GACTCGTTCG GACCCATCCT GCTGTCCAAG GTGCTGGGCC TGAACGAGAC CCAGGAGTCC
ACCCTCGGGT TGATCGTGCA CTGGGCCGAC CAACGCAACC TGCCGCTGCT GGACCTGAAG
GACCTGCGCG CGGTGATCAT GCACCTGACC AGCGACGAGG GGAAGGAGGA CCTGCGCGGG
CTGGGTGGGG TGTCCAAGGC CACCGCCGGA GTCATCCTGC GGGCGGTCAC CAACCTCGAC
GCCCAGGGCG GCGACCGCTT CTTCGGTGAA CCCGAGCTGC AGGTCGAGGA CCTGCTGCGG
GTGGGCGCCG ACGGCCGCGG GATCGTCACC CTGTTCGAGG CCTCCGGGCT GCAGACCAAC
CCGGCCCTGT TCTCCACCTT CCTGATGTGG CTGCTGGCCG AGTTGTTCGA GCAGCTGCCC
GAGGTCGGCG ACGTCGACAA GCCCAAGCTG GTGTTCCTGT TCGACGAGGC GCATCTGCTG
TTCGCCGACG CATCCAAGGC GTTCCTGCAG GCCGTGCAGC AGACGGTCAA ACTCATCCGG
TCCAAGGGTG TCGGGGTGTT CTTCTGCACC CAGCTGCCCA CCGACGTGCC CGCCGCCGTG
CTCTCCCAAC TGGGCGCGCG GATCCAGCAC GCCCTGCGCG CATTCACCCC GGACGACCAG
AAGGCGCTCA AGGCCACGGT CAAGACGTAC CCGATCACCC AGGACTACGA CCTGTCGGCG
GCGTTGACCT CGCTGGGCAC CGGCGAGGCG ATCGTCACCG TGCTGTCCGA AAAAGGCTCG
CCCACCCCGG TCGCCTGGAC CCGGGTGCGC GCCCCGCGGA CATTGATGGC GCCGGCGCCG
CCGGCGACCG TCGCCGCCGC GGTGGCCGCC TCACCACTGT TCGCCCGGTA CGGATCGGAC
GTCGACCGCG AGTCGGCCTA CGAGATGCTC ACCGCGCGGC TCGCCCCGCC CGCCCCGACC
CCGAGCGCGC CGGCCCCGAG CGCGCCGGCA CCCTCGGGGT CCGCCGCGCC GCCGGCGCCC
TCGCTCGGGC CCGACGGCTG GCCGAGCCTG ACCCCGGAGC CGTACAACCC CTACCTGGAC
GAGCCGGCGC CGCCAACCAC CCGGTCGCAG CCCTCGCCCG GATCGGGCAG CGGTCTGGGG
GAGGTGCTGA GCAACCCGGC GGTCACCTCC TTCATGAAAT CGCTGGGCAG TTCGCTCGGC
GGGGCACTGG GCCGGTCGGT CTTCGGCACC CGCAAACGGC GCCGCTGA
 
Protein sequence
MTTPAAAAEQ AAQRIAHGYD TTAPAVVVGS VVIDQTAEPT ALVRIPLSMF NRHGLVAGAT 
GTGKTKTLQV LAEQLSAAGV PVFMPDIKGD LTGLAVSGEP NAKIQQRARD TGDDWAPATV
PVEFLSLNGQ GTAIPVRATI DSFGPILLSK VLGLNETQES TLGLIVHWAD QRNLPLLDLK
DLRAVIMHLT SDEGKEDLRG LGGVSKATAG VILRAVTNLD AQGGDRFFGE PELQVEDLLR
VGADGRGIVT LFEASGLQTN PALFSTFLMW LLAELFEQLP EVGDVDKPKL VFLFDEAHLL
FADASKAFLQ AVQQTVKLIR SKGVGVFFCT QLPTDVPAAV LSQLGARIQH ALRAFTPDDQ
KALKATVKTY PITQDYDLSA ALTSLGTGEA IVTVLSEKGS PTPVAWTRVR APRTLMAPAP
PATVAAAVAA SPLFARYGSD VDRESAYEML TARLAPPAPT PSAPAPSAPA PSGSAAPPAP
SLGPDGWPSL TPEPYNPYLD EPAPPTTRSQ PSPGSGSGLG EVLSNPAVTS FMKSLGSSLG
GALGRSVFGT RKRRR