Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2453 |
Symbol | |
ID | 8448064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2704492 |
End bp | 2707272 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645041567 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_003201811 |
Protein GI | 258652655 |
COG category | [C] Energy production and conversion |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000141408 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0047804 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTCGCC AGACGGGTTC GGGGCTGTCC CCGGCCAGGG GCCCGGACGC CGTCAGCGCA GACGCCCGCA CCGCGGACAC CTTCACCGCC GACACCTTCA CCGCGGACAC CTTCACCGCG GACACCTTCA CCGCGGACGC CCTCACCGTG GACGGTCGCA TCGTGCGGCT ACGCCCGGTG GGCCCCGGGG ATGCGGACGC GCTGCGTGAT CTGCATCGGG CCATCTCGGA CGACTCGCTC TACCTGCGCT TTTTCGGCCT GAGCCGGAGC GCGGCGATGG ACTACGTCGA CCGCCTGGTC ACACCGCAGG ACGGCCGATT GACGGTGGCC GCCTGGCTGG CCGATCGGCT GGTCGCCGTG GCCTCGTGCG AACGCACCGA TGCCAGCACG GCCGAGGTCG CGTTGCTGGT GGCCGACGAC TGCCACCGGC TGGGCATCGG CACGCTGATG CTGGAACACC TGGCGGCGCG GGCCCGGGCG CACGGCCTGC GCCGGTTCAC CGCGGAGATT CTCGCGCAGA ACGAGTTGGC CCTGCGGACC CTGCGAGATC TCGGTCTGGG CATGTCCACG ACCTGGGAGA GCGGGACGGC GCTGGTGGAA ATGGATCTGC GACCCGACGA CGACACGATC CGGTCGGTCC ACCGGCGGGG TTGGTCGGCC GAGCGAGCCA GTGTGCGGCA CCTATTGGCT CCGGCGTCGG TGGCCGTGAT CGGGGCCGGC CAGGACCCGT CCGCGGTCGG ACACCAGGTG CTGCGCAACC TCATCGACGG CGGCTTCACC GGATCGCTGG TGGCGGTCAA TCCGCATCAC GACGCAGTGC TCGGCGTACC GTGCGTGCCC TCGCCGGCCC AGCTGCCGTG CGGGATCGAC CTGGCCATCG TGGCGATCCC GGCCGCGGGC GTCCTGGACG TGGTGCGGGG CTGCGGCGCC CGCCACACGC GGGCGATGGT GATCCTCACG GCCGGATTCG GCGAGGCCGG CGCGGCCGGA CGGAGCCGTC AGGACGAGAT CCTGGCGGCC GCCCGGCAGG ACGGGATGCG CCTGGTCGGC CCGAATTGCC TCGGGTTGAT CAACACCGAC CCGGCCGTGC GGCTGAACGC GACCTTCACC GACCTGCCGG TGCCGGCGGG ACCGCTGGGC CTGGTGTCGC AGTCCGGCGC CCTGGGCATC GGCGTACTGG ACGCCGCCGG CCGGGCCGGA CCGGGGGTGG CGCAATTCGT CTCGATCGGC AATCGCGCCG ATGTGAGCAG CAACGACCTG CTGGCGGCCT GGGCGGACGA GGACCGGATC CGGGTGGTCG CCCTCTACCT GGAGTCGGTG GGCAACCCGC GCACGTTCGC CCGGGTCGCT CGCCGGGTGG CCGACCGCAA GCCGGTGATC GCGATCAAGT CGGGACGGTC GGCGGCCGGG CGGCGGGCCG GCCGCTCGCA CACCGCGGCC GCCGCGACGG GCGATGTCGT GATCGACGCA TTGTTCCGGC AGGCCGGCGT CCTGCGGGTG GACACGATGG AGCAGATGCT CGATGCCGCG CGGGTGCTGT GCGACCAGCC GGTCCCGAGC GGCTGCCGCC TGGCCGTCGT CGGCAATTCC GGCGGCCCCC AGATCCTGGC CGCGGATGCT GCGTCCGCGG CCGGGCTGGA GGTCGTCGAG TTGGCCCCGG GGACCCGGCG GGCCCTGCGC CAAGTGGTGC CCGACGCGGC ATCGGCGGAC AACCCGGTCG ACCTCGGGTC GGCAGCCACC CCGGTCCAGG TCGGCGATGC CCTGTCCGTG CTGCTCGCCG CGGACGAGGT CGACGCCATC CTGGCCGTCG TCACCCGGAC CGCGGTGACC GATCTGCCGG CCGTGCTGGA CCGCATCGCG GCCGCGGCCG GCGGCGACAA ACCCGTCGTG GCATGCTGTG TCGGCGAAAC GGCGGAATCC GTTCGCGTCC CCGGTGCGGC GGACCGGCGC CTACCGGTGT TCGGCTTCCC CGAACCGGCC GCGGCCGCCC TGGCCGTGGC GGCGCGGTAC GGGCGGATTC GGTCCGCCGA CGGGCCCGGG CACCCGGCCC GCCCCGCCGG GATCGATCGG GAGCAGGCGG CCGCGATCGT CACCGGAGCG CTGGCCGCCG GCGCCGGGTG GCTGACCGCC GCCGAGGTCG AGCGGCTGCT GGCCGCCTAT GGCTTGCCCA CCTGCCCGCA GCGGCCGGCC ACCGGCGTCG CGCAGGCCCT CATCGCGGCC GGGGAACTCG GCTACCCGGT CGTGGTCAAG CTCGCCGACC CGGGGCTGCA CAAGACCGAC GTCGGCGGGG TCCGGCTGGG CCTGGTCGAC GAGCGGGCCC TGCGCGCCGC CGTCGCCGAC CTGACCGGCG GCCGACCCCG GGCCCTGCTC CTGCAGCCGA TGGTGCCGCC GGGGTTGGAG TTCATCGTCG GGGCGGTGCA GCACGACCAC TTCGGGGCGG TGCTGATGGT GGGCGCCGGG GGCGTGTTCA CCGACCTGGT GGCCGACCGC GCCTTCCGGC TGGCTCCGGT CGGCCCGGGC GACGCCGCCG CGATGCTGGA CGAGCTGCGC ATGGCGCCGA TGCTCGACGG TTATCGCGGC GCCCCGGCCG TCTCCCGAGA ACGGTTGGCC GACCTGCTGA TCCGGGTTGG GTCGGTGGTC GAGGACCTCG GCGAGGTCGC CGAACTGGAC CTGAACCCGG TCATCGGCCG AGGGACCGAG CTGATGATCG TCGATGCGCG GATCCGGGTC GCCGCGGTCC CACCCCGCCC CGACCCGCTG GTCCGGCGGC TGCCACTCTG A
|
Protein sequence | MARQTGSGLS PARGPDAVSA DARTADTFTA DTFTADTFTA DTFTADALTV DGRIVRLRPV GPGDADALRD LHRAISDDSL YLRFFGLSRS AAMDYVDRLV TPQDGRLTVA AWLADRLVAV ASCERTDAST AEVALLVADD CHRLGIGTLM LEHLAARARA HGLRRFTAEI LAQNELALRT LRDLGLGMST TWESGTALVE MDLRPDDDTI RSVHRRGWSA ERASVRHLLA PASVAVIGAG QDPSAVGHQV LRNLIDGGFT GSLVAVNPHH DAVLGVPCVP SPAQLPCGID LAIVAIPAAG VLDVVRGCGA RHTRAMVILT AGFGEAGAAG RSRQDEILAA ARQDGMRLVG PNCLGLINTD PAVRLNATFT DLPVPAGPLG LVSQSGALGI GVLDAAGRAG PGVAQFVSIG NRADVSSNDL LAAWADEDRI RVVALYLESV GNPRTFARVA RRVADRKPVI AIKSGRSAAG RRAGRSHTAA AATGDVVIDA LFRQAGVLRV DTMEQMLDAA RVLCDQPVPS GCRLAVVGNS GGPQILAADA ASAAGLEVVE LAPGTRRALR QVVPDAASAD NPVDLGSAAT PVQVGDALSV LLAADEVDAI LAVVTRTAVT DLPAVLDRIA AAAGGDKPVV ACCVGETAES VRVPGAADRR LPVFGFPEPA AAALAVAARY GRIRSADGPG HPARPAGIDR EQAAAIVTGA LAAGAGWLTA AEVERLLAAY GLPTCPQRPA TGVAQALIAA GELGYPVVVK LADPGLHKTD VGGVRLGLVD ERALRAAVAD LTGGRPRALL LQPMVPPGLE FIVGAVQHDH FGAVLMVGAG GVFTDLVADR AFRLAPVGPG DAAAMLDELR MAPMLDGYRG APAVSRERLA DLLIRVGSVV EDLGEVAELD LNPVIGRGTE LMIVDARIRV AAVPPRPDPL VRRLPL
|
| |