Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4574 |
Symbol | |
ID | 8450202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 5094077 |
End bp | 5096026 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645043615 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_003203842 |
Protein GI | 258654686 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCCGG CGTCCGTTTT CGCCCGGTCC CGCAGGGGCG CAGGTGACGC ACGCCACGTA AAGTCGGACA TCAGACAGTT GCGGGGGCAG CTGTCCGTTT CCCGCAGCCG GCCCGTCGAG GAGGACGTCC ACTTCATGAC TGCAGCCCTG ACCGAACGAG CGCCATCGTT CGGGCACCTT TTCGTCGATC GGATCGAGAA GACGCCGAAC CGGGAAGCGT TCCGCTACCG CGTCGGCGAC GCCTGGAAGT CGATGTCCTG GGCGCAGACC AAGGTGCGCG TGTTCAACAT CGCGGCCGGG CTGATCAGCC TGGGGGTGCA GCCGCAGCAG CGGGTCGCCA TCGCCGGCAC CACCCGCATC GAATGGCTGC TGTCCGACCT GGGCATCCTG TGCGGGGGGA TGGCGACCAC CACCGTCTAC CCGACCACCG CCCGCGAGGA CGTCGCCTAC ATCCTGAGCG ACTCCGAGTC GGTCGTGCTC ATCGCCGAGA ACGCCGAGCA GGCCAACAAG GCCCTGGACT CGGACCTGCC CGACCTCAAG GCCGTCGTGC TGTTCGACGA CACCCCGGCC GACGTGCACC GGCACGAGGG GGTCGAGGTC ATCACGCTGG CCGACCTGGA GCAGCGCGGC GCCGCCCTGC TGGCCGAGCA GCCGGCCGCG GTCGACGACC GGATCGCCGC CTGCGGGCCG GAGGACCTGG CCACCCTCGT CTACACCTCG GGCACCACCG GCAAGCCCAA GGGCGTGCGG CTGGTGCACG ACAACTGGGT GTACGAGGGC AAGGGGGTCG CCGCCCTCAA CATCCTGGGC CCGGACGACG TGCAGTACCT GTGGTTGCCC CTGTCGCACG TGCTGGGCAA GGTGCTCTCC GCGGTCCAGC TCGAATTCGG CTTCAGCACC GCAGTCGACG GCGACCTGAC CCGCATCGTG GAGAACCTGG GCGTCATCAA GCCCACCTTC ATGGGCGGGG CGCCGCGCAT CTTCGAGAAG GTTCGGGCCA AGGTCACCCT GACCGCGCAG GGCGAGGGCG GCCTCAAGGC CAAGATCTTC GACTGGGCGA TCGGCGTCGG GGTCAAGGCC TCCCGGATCC GCCAGCAGGG CGGGCAGCCC GGCTTCCTGC TGCGGCTGCA GCTGGCGATC GCCGACAAGC TGGTGTTCAG CAAGGTCCGG GCCCGGATGG GCGGCCGGAT CCGGTTCTTC GTGTCCGGCT CGGCGGCGCT GTCCGCCGAG GTGTGGGAAT GGTTCGACGC CGTCGGCATG ACCATCCTGG AGGGCTACGG GCTCACCGAG ACCAGTGCCG CCGCCGCGGT CAACCTGCCC GGCGACTCCC GCATCGGCAC CGTCGGCCCG CCGCTGCCCG GCACCCAGTT CAAGATCGCT GAGGACGGCG AGGTGCTGAT CAAGGGACCG GGTGTGATGC GCGGTTATCA CAACCGGCCT GATGCGACCG CGGAGGTGTT CTCCGACGGC TGGTTCCATT CCGGCGACAT CGGCGAGCTG ACCGACGGGT ATCTCAAGAT CACCGACCGG AAGAAGGACC TGATCAAGAC TTCGGGCGGC AAGTACGTCG CCCCACAGAA GATCGAGGTC ATCTTCAGCG CCGAATGCCC GTGGGCCGGA CACATCGTGG TGCACGGCGA CGGGCGCCAT TTCGCCTCGG CGTTGATCAC CCTGGACGAC GAGATGATCC ACGAGTGGGC CGAGAAGAAC GGCCTCGGCG GCAAGACCAC CGAGGAGCTG GCCCGCGATC CCCAGGTCTA CGCCCTGATC GACGAGCACG TGCAGAAGCT GAACAGCCAG CTGGAGCGCT GGGAGACGAT CAAGAAGTTC ATCATCCTGC CGCGCGACCT GACCATCGAG GACGGCGAGC TGACCCCGTC GATGAAGGTG CGCCGCAAGC TGGTCGAGCA GAAGTACATG AGCGAGCTGG ACTCGCTGTA CAAGGGCTGA
|
Protein sequence | MGPASVFARS RRGAGDARHV KSDIRQLRGQ LSVSRSRPVE EDVHFMTAAL TERAPSFGHL FVDRIEKTPN REAFRYRVGD AWKSMSWAQT KVRVFNIAAG LISLGVQPQQ RVAIAGTTRI EWLLSDLGIL CGGMATTTVY PTTAREDVAY ILSDSESVVL IAENAEQANK ALDSDLPDLK AVVLFDDTPA DVHRHEGVEV ITLADLEQRG AALLAEQPAA VDDRIAACGP EDLATLVYTS GTTGKPKGVR LVHDNWVYEG KGVAALNILG PDDVQYLWLP LSHVLGKVLS AVQLEFGFST AVDGDLTRIV ENLGVIKPTF MGGAPRIFEK VRAKVTLTAQ GEGGLKAKIF DWAIGVGVKA SRIRQQGGQP GFLLRLQLAI ADKLVFSKVR ARMGGRIRFF VSGSAALSAE VWEWFDAVGM TILEGYGLTE TSAAAAVNLP GDSRIGTVGP PLPGTQFKIA EDGEVLIKGP GVMRGYHNRP DATAEVFSDG WFHSGDIGEL TDGYLKITDR KKDLIKTSGG KYVAPQKIEV IFSAECPWAG HIVVHGDGRH FASALITLDD EMIHEWAEKN GLGGKTTEEL ARDPQVYALI DEHVQKLNSQ LERWETIKKF IILPRDLTIE DGELTPSMKV RRKLVEQKYM SELDSLYKG
|
| |