Gene Namu_4552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4552 
Symbol 
ID8450180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5064496 
End bp5066187 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content73% 
IMG OID645043593 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_003203820 
Protein GI258654664 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGT TCTCCGACGC CCTGTGCACA GCGGCGCAGG GCCCCACCGG CATGACCACT 
GGCGAGCCGC ATGAGCCGGT TCGCACGTCC TGGGCCGATG TCCACGCCAA GGCCTGCGCC
GGGGCGCGGG TGCTGGCCGC CCACGGGATC GGGCCGGGTG ACGCGGTCGC GGTGCTGGCC
GCCAAGCCGT TCGAGGTGGC CCCGATCGCC CAGGCGGCCT GGTTGGCCGG GGCCTCGGTG
ACGATGCTGC ACCAGCCCAC CGCCCGGACC AATCTGATGA CCTACGCGCA GGACACGGCC
GCGGTGCTGT CCCTGGTCGG GGCCAAGGCA GCCGTGCTGG GCGACCCGTT CACCGAGTTC
GCGGAACTGT TGGACGGTTC GGGCGTGCTC GCGCTGACCG TCGACGATCT GCTGGCCGAG
CCCGGCGGTC CGGCTCCGGA CGTCGAGATC GGCGAGGACC TGCCGGCCCT GCTGCAGCTG
ACCTCCGGGT CGACCTCGAC CCCCAAGGCC GTGCGGATCA CCCACCGCAA CCTGTGGGCC
AACATCGAGG CGATGTGCCA GGCGGCCCAG ATCCGGCCGG GCGAGGTGAT GGTGTCCTGG
TTGCCGCTGT TCCACGACAT GGGCATGGTC GGCTTCCTGA CCCTGCCGAT GTGCCGGGGC
ATCGAGCTGG TCACCGTCAC CCCCACCGAT TTCCTGGCCT CGCCGTTGAT CTGGCCGACG
TTGATCAGCA AGTACCGCGG CACCATCACC GCCGCCCCCA ACTTCGCCTT CGCGCTGACT
GCGCGGGTGC TGGCCCGGCC GACCACCCGG GAGCTGGGCC TGGACCTGTC CTGCATGCGG
TTCGCGCTGA ACGGGGCCGA ACCGATCGAC GTCGCCGCGG TCCGGGCCTT CCTGGCCGCG
GGGGCACCGT TCGGGCTCCC GGAGACCGCG GTGGTCTGCG CCTACGGCAT GGCCGAGGCG
TCGTTGGCCG TGTCCTTCCA CCCCTGGGGC ACCCCGCTCA AGGTGGACAC CGTCGACGCG
CAGGCGTTGG AGATCGCCCG GCGGGCGGTG CCGGCCGAAT CGGGCCGCTC GTTCCCGGTG
CTGGGCCCGC CGCTGGACGG GATCGAGGTC GCCGTGCGGG GCCGCGACGG CGCGGTGCTC
GGCGACCGCG AGGTCGGGGT GCTGCACCTG CGCGGCGAGT GCATCACCGA GCAGTACCTG
ACCGTGGACG GGCCGGTGGC CACCCAGGAT GCCGACAAGT GGCTGGACAC TGGGGATCTC
GGCTACCTGG TGGACGGCGA GGTGGTGGTC TGCGGCCGGG TCAAGGACGT GATCATCATG
GGTGGGCGCA ACATCTACCC GACCGACATC GAGCGGGTGG CCCAGAGCAT CGACGGCGTT
CGAGCGGGTA ACGCGGTCGC GGTGCGGTGG ACGACGCCCA GCGGTCGCGA ATCGTTCGCG
GTGGCCGTCG AGTCCCGCGA GGCCGGTGAC CAGGACGCGG CCGAGCGCAT CCGGCAGGCC
GTGCGGTCGG CGGTGACCGC CGAGATCGGC GCCCGCCCGG CGACGGTGTC GGTGCTCCCG
GTCGGGAGTC TGCCCAAGAC TCCGTCCGGC AAGCTGCAGC GCTCCGCCGC GGCCCGGTTG
ATCACGCCAC CGCCGGCCGA GGCGCTGCCG GTTCGGCCCC CGGCGCCCGA GGGTCTGCCG
CTGCCCGGCT GA
 
Protein sequence
MSAFSDALCT AAQGPTGMTT GEPHEPVRTS WADVHAKACA GARVLAAHGI GPGDAVAVLA 
AKPFEVAPIA QAAWLAGASV TMLHQPTART NLMTYAQDTA AVLSLVGAKA AVLGDPFTEF
AELLDGSGVL ALTVDDLLAE PGGPAPDVEI GEDLPALLQL TSGSTSTPKA VRITHRNLWA
NIEAMCQAAQ IRPGEVMVSW LPLFHDMGMV GFLTLPMCRG IELVTVTPTD FLASPLIWPT
LISKYRGTIT AAPNFAFALT ARVLARPTTR ELGLDLSCMR FALNGAEPID VAAVRAFLAA
GAPFGLPETA VVCAYGMAEA SLAVSFHPWG TPLKVDTVDA QALEIARRAV PAESGRSFPV
LGPPLDGIEV AVRGRDGAVL GDREVGVLHL RGECITEQYL TVDGPVATQD ADKWLDTGDL
GYLVDGEVVV CGRVKDVIIM GGRNIYPTDI ERVAQSIDGV RAGNAVAVRW TTPSGRESFA
VAVESREAGD QDAAERIRQA VRSAVTAEIG ARPATVSVLP VGSLPKTPSG KLQRSAAARL
ITPPPAEALP VRPPAPEGLP LPG