Gene Namu_4574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4574 
Symbol 
ID8450202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5094077 
End bp5096026 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content68% 
IMG OID645043615 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003203842 
Protein GI258654686 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCCGG CGTCCGTTTT CGCCCGGTCC CGCAGGGGCG CAGGTGACGC ACGCCACGTA 
AAGTCGGACA TCAGACAGTT GCGGGGGCAG CTGTCCGTTT CCCGCAGCCG GCCCGTCGAG
GAGGACGTCC ACTTCATGAC TGCAGCCCTG ACCGAACGAG CGCCATCGTT CGGGCACCTT
TTCGTCGATC GGATCGAGAA GACGCCGAAC CGGGAAGCGT TCCGCTACCG CGTCGGCGAC
GCCTGGAAGT CGATGTCCTG GGCGCAGACC AAGGTGCGCG TGTTCAACAT CGCGGCCGGG
CTGATCAGCC TGGGGGTGCA GCCGCAGCAG CGGGTCGCCA TCGCCGGCAC CACCCGCATC
GAATGGCTGC TGTCCGACCT GGGCATCCTG TGCGGGGGGA TGGCGACCAC CACCGTCTAC
CCGACCACCG CCCGCGAGGA CGTCGCCTAC ATCCTGAGCG ACTCCGAGTC GGTCGTGCTC
ATCGCCGAGA ACGCCGAGCA GGCCAACAAG GCCCTGGACT CGGACCTGCC CGACCTCAAG
GCCGTCGTGC TGTTCGACGA CACCCCGGCC GACGTGCACC GGCACGAGGG GGTCGAGGTC
ATCACGCTGG CCGACCTGGA GCAGCGCGGC GCCGCCCTGC TGGCCGAGCA GCCGGCCGCG
GTCGACGACC GGATCGCCGC CTGCGGGCCG GAGGACCTGG CCACCCTCGT CTACACCTCG
GGCACCACCG GCAAGCCCAA GGGCGTGCGG CTGGTGCACG ACAACTGGGT GTACGAGGGC
AAGGGGGTCG CCGCCCTCAA CATCCTGGGC CCGGACGACG TGCAGTACCT GTGGTTGCCC
CTGTCGCACG TGCTGGGCAA GGTGCTCTCC GCGGTCCAGC TCGAATTCGG CTTCAGCACC
GCAGTCGACG GCGACCTGAC CCGCATCGTG GAGAACCTGG GCGTCATCAA GCCCACCTTC
ATGGGCGGGG CGCCGCGCAT CTTCGAGAAG GTTCGGGCCA AGGTCACCCT GACCGCGCAG
GGCGAGGGCG GCCTCAAGGC CAAGATCTTC GACTGGGCGA TCGGCGTCGG GGTCAAGGCC
TCCCGGATCC GCCAGCAGGG CGGGCAGCCC GGCTTCCTGC TGCGGCTGCA GCTGGCGATC
GCCGACAAGC TGGTGTTCAG CAAGGTCCGG GCCCGGATGG GCGGCCGGAT CCGGTTCTTC
GTGTCCGGCT CGGCGGCGCT GTCCGCCGAG GTGTGGGAAT GGTTCGACGC CGTCGGCATG
ACCATCCTGG AGGGCTACGG GCTCACCGAG ACCAGTGCCG CCGCCGCGGT CAACCTGCCC
GGCGACTCCC GCATCGGCAC CGTCGGCCCG CCGCTGCCCG GCACCCAGTT CAAGATCGCT
GAGGACGGCG AGGTGCTGAT CAAGGGACCG GGTGTGATGC GCGGTTATCA CAACCGGCCT
GATGCGACCG CGGAGGTGTT CTCCGACGGC TGGTTCCATT CCGGCGACAT CGGCGAGCTG
ACCGACGGGT ATCTCAAGAT CACCGACCGG AAGAAGGACC TGATCAAGAC TTCGGGCGGC
AAGTACGTCG CCCCACAGAA GATCGAGGTC ATCTTCAGCG CCGAATGCCC GTGGGCCGGA
CACATCGTGG TGCACGGCGA CGGGCGCCAT TTCGCCTCGG CGTTGATCAC CCTGGACGAC
GAGATGATCC ACGAGTGGGC CGAGAAGAAC GGCCTCGGCG GCAAGACCAC CGAGGAGCTG
GCCCGCGATC CCCAGGTCTA CGCCCTGATC GACGAGCACG TGCAGAAGCT GAACAGCCAG
CTGGAGCGCT GGGAGACGAT CAAGAAGTTC ATCATCCTGC CGCGCGACCT GACCATCGAG
GACGGCGAGC TGACCCCGTC GATGAAGGTG CGCCGCAAGC TGGTCGAGCA GAAGTACATG
AGCGAGCTGG ACTCGCTGTA CAAGGGCTGA
 
Protein sequence
MGPASVFARS RRGAGDARHV KSDIRQLRGQ LSVSRSRPVE EDVHFMTAAL TERAPSFGHL 
FVDRIEKTPN REAFRYRVGD AWKSMSWAQT KVRVFNIAAG LISLGVQPQQ RVAIAGTTRI
EWLLSDLGIL CGGMATTTVY PTTAREDVAY ILSDSESVVL IAENAEQANK ALDSDLPDLK
AVVLFDDTPA DVHRHEGVEV ITLADLEQRG AALLAEQPAA VDDRIAACGP EDLATLVYTS
GTTGKPKGVR LVHDNWVYEG KGVAALNILG PDDVQYLWLP LSHVLGKVLS AVQLEFGFST
AVDGDLTRIV ENLGVIKPTF MGGAPRIFEK VRAKVTLTAQ GEGGLKAKIF DWAIGVGVKA
SRIRQQGGQP GFLLRLQLAI ADKLVFSKVR ARMGGRIRFF VSGSAALSAE VWEWFDAVGM
TILEGYGLTE TSAAAAVNLP GDSRIGTVGP PLPGTQFKIA EDGEVLIKGP GVMRGYHNRP
DATAEVFSDG WFHSGDIGEL TDGYLKITDR KKDLIKTSGG KYVAPQKIEV IFSAECPWAG
HIVVHGDGRH FASALITLDD EMIHEWAEKN GLGGKTTEEL ARDPQVYALI DEHVQKLNSQ
LERWETIKKF IILPRDLTIE DGELTPSMKV RRKLVEQKYM SELDSLYKG