Gene Namu_5003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5003 
Symbol 
ID8450634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5581393 
End bp5582853 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content74% 
IMG OID645044041 
Productprotein of unknown function UPF0089 
Protein accessionYP_003204265 
Protein GI258655109 
COG category[R] General function prediction only 
COG ID[COG4908] Uncharacterized protein containing a NRPS condensation (elongation) domain 
TIGRFAM ID[TIGR02946] acyltransferase, WS/DGAT/MGAT 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG ACCGCGCCGC GGGCCGGCCC GCCGTGCCCA GTCCCGACGA GCTGAACCGC 
AAGGCCTGGG CGGCCGCCGG GCAATGGGGC CTCGACGGGT CGATGAGCGA GCTGGAGACG
CTCATGTGGC GCTCCGAGCG CCACCCCGAG CTGTCCTCGA CGATCGTGGC GATGATGCTG
CTGGACACCG AGCCGGAGTG GGACCGGTTA CGCGGCGCCC ACGAGTGGGC GGTCGAGCTG
ATCCGGCGGA CCCGGGAACG GGTGCAGGAG CCGATCCTGC CGGTGGGCCC GCCGGTCTGG
GTGCAGGACA CCTCGTTCGA CCTGGACAAC CACCTGCGCC GGGTCCGGCT GGATGCCCCC
GGCGACCAGG CCACCCTGCT GGCCTACGTC GAGCGGTTCG CGCTGGCCCC GCTCAACCGG
CGGCGCCCGC TGTGGGAGGC GGTGCTGGTC GAGGGCCTGG CCGACGGCCG GGCCGCCTAC
CTGCTCAAGC TGCACCATTC GGTGACCGAC GGGCTCGGCG GCATCCAGCT GCTCTCCCTG
GTGCAGAGCC GCACCCGCGA GCACACCGCC GACAAGCCGT CGTCGGCGGT CGGCACCTCG
ACCGGCCCGG TCAACCCCGC GGTGCTGGCC ACCCGGCAGC TGGCTGCCCG CCTCGGGCAG
ACTCCCGACC TGCTGATGCG CTCGCTCAAG TTCGGCGGCA CCGTGCTGTC CGACCCGCTG
CAGGCCGGCG AGGAGGCCGT CCGGTTCTCC GCGTCGCTGC GCCGGATGCT GCCGCCGCCG
CCCGCCGCCC CCTCGCCGCT GTTCCGCGGG CGCACCGGCC GCAACTGGCG CTTCGCCACC
CTGGAATGCC GGTTCAAGGA CCTGCGCGCG GCCGCCAAGC AGGCCAGCGG ATCGGTCAAC
GACGCCTACA TCGCGGCCCT GCTCGGCGGG CTGCGCCGCT ATCACGAGCG GCACGGCACC
AGCGTGGAGA GCCTGCCGAT GGCCATGCCG GTCTCCCTGC GCCGCGGGGA CGACCCGATG
GGCGGCAACA AGTTCGCCGG GGCACTGCTG GCCGGGCCGG TCGGGGTCAG CGACCCGGTG
GAACGCATCG CCGTCATCCG CGGGCTGGTG CTGACCCTGC GCACCGAACC CGCGCTGGAC
TCGTTCGGCC TGTTCGCGCC GCTGGTGAAC CTGCTGCCGT CCTCGGTCGG CGCCGCCGCC
TGGCGGCTCG GGTCGTCGGC CGACATGTCC GCGTCCAACG TGCCCGGGCT GCCGTTCGAG
TCCTACCTGG CCGGCGCCCA GGTGCAGCGC ATCTTCGCGT TCGGTCCCCT GCCCGGGGTG
GCCATCATGG TGGCGATGAC GACCCACGCC GGCACCTGCT GTCTGGGGTT CAACGTCGAC
GGCGACGCGG TCGAGGACCT GTCGGTGCTG ATGGAGTGCT TCCAGCAGGG CCTGGACGAG
GTGCTCGCGA TCGCCCGCTG A
 
Protein sequence
MTTDRAAGRP AVPSPDELNR KAWAAAGQWG LDGSMSELET LMWRSERHPE LSSTIVAMML 
LDTEPEWDRL RGAHEWAVEL IRRTRERVQE PILPVGPPVW VQDTSFDLDN HLRRVRLDAP
GDQATLLAYV ERFALAPLNR RRPLWEAVLV EGLADGRAAY LLKLHHSVTD GLGGIQLLSL
VQSRTREHTA DKPSSAVGTS TGPVNPAVLA TRQLAARLGQ TPDLLMRSLK FGGTVLSDPL
QAGEEAVRFS ASLRRMLPPP PAAPSPLFRG RTGRNWRFAT LECRFKDLRA AAKQASGSVN
DAYIAALLGG LRRYHERHGT SVESLPMAMP VSLRRGDDPM GGNKFAGALL AGPVGVSDPV
ERIAVIRGLV LTLRTEPALD SFGLFAPLVN LLPSSVGAAA WRLGSSADMS ASNVPGLPFE
SYLAGAQVQR IFAFGPLPGV AIMVAMTTHA GTCCLGFNVD GDAVEDLSVL MECFQQGLDE
VLAIAR