Gene Namu_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3884 
Symbol 
ID8449503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4278096 
End bp4279589 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content73% 
IMG OID645042932 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003203168 
Protein GI258654012 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.445629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.108787 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCCA CCCTGCTGAG CGACCTGCTG GACGCCGCCG CGGCCCGGCA GCCGCAGGCA 
CCCGCGCTGA CCGCCGGCGA CCGAACCGAC AGCTACCTCG ACCTGGCCAC CGAGAGCCGG
CGGGCGGCGG GCTGGCTGGC GGCGCGCGGC ATCGGCCGGC GGGACCGGGT GGTGGTGCTG
GCCCAGGACC ACCGGCTGCA GGTGCCGGTC CTGTTCGGCT GCGCCCGGAT CGGCGCGATC
TTCGTCCTGC TGCACGAGGA TCTGGCCGAC CCCACCGTCC GGCACGTCCT GGCGGACACC
GCACCCACCC TGTTGATCAC CGACCGCCCG GACGCGGCGG CCCAGGCGCA GCGGCTCGGC
GTGCCCGTGG TGTCGGCGGC GCAGACCCGG GACGGCATTG CCGCCGCGCC CCCCGCCGAT
CCGCCGCGGC CGCTGAGCGT CGATCCCGCG TGCCTGATCT ACACCTCGGG CAGCACCGGG
ATGCCGAACG CGGTGGTGTC CACGCACGCG CAGATGGTGT TCGCGGCCCG GGCGATCCAG
TCCCAGCTGG GCTACCGGCC CGACGACGTG GTGTTCTGCC CGCTACCGTT GTCGTTCGAC
TACGGTTTGT ACCAGGTGTT TCTCGCCGCC CTGGGGGGCG CGCAGCTGCA CCTGGGCTCG
GCCCAGGACG CCGGCCTGGG GTTGCTGCGC CGGCTGCGCA CGGTCGGTGC GACGGTGATG
CCGGCCCTGC CGTCCCTGGC CGCCATCCTG GCCCGGCTGC TCGAACGCTA TGGCGGCACC
GTGGACCTGC GCCTGATCAC CAACACCGGC GCCGCCATGC CCGCCAGCAC CATCGCCCGA
CTCCGCCGGC TGTTGCCGGA CGTGCGGATC CAGTTGATGT TCGGGCTGAC CGAGTGCAAG
CGGGTGAGCA TCATGCCCCC GGACGGCGAC CTGGACCGGC CCGGGGCGTG CGGCCGCCCG
CTGCCGGGCA CCGAGGTCGT CGTGGTCGAC GACGACGGCG CGGCCCTGCC CGCGGGCGAG
GTCGGCGAGT TCGTGGTCCG GGGACCGCAT GTGATGGCCG GGTACTGGCG ACGACCCGAA
CTGACCGCCC GGCGCTTCCA CCGGGTCGAC GACCTGTTCG TCGAACTGCG CTCCGGCGAC
TACGGATACC TCGACGAGGA CGGGTACCTG TACTTCGTCG GGCGGCGGGA CGACATCTAC
AAGTCGCGCG GGTTCCGGGT CAGTGCCACC GAGGTCGAGG CGGCCGCGCT GCGGGTGCCC
GGAGTGACCG CGGCGGCCGT CCTCGCGCCC ACCGCCCAGC ACCCCGAACC GGTGCTGTTC
GCGGTGACCG ACCTGGATCA GCCCACCTTC CACGCCCGGC TCCGGGAGCA GCTCGAGCAG
TACAAGATCC CGCGGCACTG CGAGCTCGTG GACGCGTTGC CGCTCACCCA GAACGGAAAG
ACCGACAAGA AGGCCCTGGC CTCCCGCGCG GAGGCCGGCC GGCTGATCGC CTGA
 
Protein sequence
MEPTLLSDLL DAAAARQPQA PALTAGDRTD SYLDLATESR RAAGWLAARG IGRRDRVVVL 
AQDHRLQVPV LFGCARIGAI FVLLHEDLAD PTVRHVLADT APTLLITDRP DAAAQAQRLG
VPVVSAAQTR DGIAAAPPAD PPRPLSVDPA CLIYTSGSTG MPNAVVSTHA QMVFAARAIQ
SQLGYRPDDV VFCPLPLSFD YGLYQVFLAA LGGAQLHLGS AQDAGLGLLR RLRTVGATVM
PALPSLAAIL ARLLERYGGT VDLRLITNTG AAMPASTIAR LRRLLPDVRI QLMFGLTECK
RVSIMPPDGD LDRPGACGRP LPGTEVVVVD DDGAALPAGE VGEFVVRGPH VMAGYWRRPE
LTARRFHRVD DLFVELRSGD YGYLDEDGYL YFVGRRDDIY KSRGFRVSAT EVEAAALRVP
GVTAAAVLAP TAQHPEPVLF AVTDLDQPTF HARLREQLEQ YKIPRHCELV DALPLTQNGK
TDKKALASRA EAGRLIA