Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0252 |
Symbol | |
ID | 4711132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 287595 |
End bp | 290243 |
Gene Length | 2649 bp |
Protein Length | 882 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639854712 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_001001848 |
Protein GI | 121997061 |
COG category | [C] Energy production and conversion [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) [COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0975681 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTAC GCAATCTCGA AAGCCTCTTC CAGCCCCGCT CCATCGCCCT GATCGGCGAG GCCAGCGATA CGGACCGACG CATCCTGCGC AACCTGCTCA GCGAGCCCTT CCAGGGGCCG GTGATGCCGG TCATGGCGGG GATCGACTCG ATGGGCGGGG TGCCCGTCTT CCCCGGCATC GACGATCTGC CTGCGGTACC GGATCTGGCG GTCATTACCC GGCCGCTCGA GGAGGTGCCG GCGCTGATCA CCGCCCTCGG TGAGCGTGGC ACCCGGGGCG TGATCATCAC CCGCGCCGTA CCTCGGGATT ACACCCGGGA GCAGCGCGAG GCGCTGGAGC AGACCATCCT CGAGGCGTCG CGCCCCCATC TGGTGCGCGT GGCCGGGCCG GGGAGCAGCT GCATCAGCGT CCCCCACCGG GGGCTGCACG CCTCGTCGTT GCCGATCACC CTGGCCGCCG GCAAGGCCGC CCTGGTCACC AAGTCCTCGG CCATGGCCGG CGCCGCCCTG CGCTGGTGCC GCGAGAACGA CTCGGGGCTG TCGCACATCA TCCACATCGG CGCCGCCACC GACGTGGATC TGGGTGACAG CTTCGACTAC CTGGCCACCG ACCATCGGGC CCGGGCGGTG ATCGTCTACA TCGAGCGAGT GCGACGGGCG CGCAAGTTCA TGTCGGCGCT GCGCCGGCTG GCGCGCATGA AGCCGGTGAT CATCCTCAAG CCGCCCCAGG TCGGCGAGGA GGAGATCGAC GACTTCGTCT ACGATGCCGC CTTCCGCCGT GCCGGCGTGG TCCGGGTGGA CGACCTGGAG GAGCTGTTCA GCGCCGTGGA GATCCTCGAC AGCAGCCGGG TCCCGGGCCG CGACGGGCCG CTGGCCATCT TCGGTAACAG CCGCAGTCTG GGGCTGCTGG CCAGCAACGC CCTGCGCCGC CACGACAGCA GCCCGCAGCA GCTGGCGCCG GAGAGCAACG AGGCGCTGCA GGAGCTCGCC CGGCGCAACG CCAGCAGCCA CAACCCGCTG GATATGGGGG TGGACGCCGG CGCCGACGCC TACGACCGGG CGCTGGAGAT CCTCGGCAAG GACCGGCGCA TCAGCGGCAT CCTGGCCCTC AAGGCGCCCG GTGCCAGTGA CGACGGCGAT GCGGTGGCCG AGGTGGTGGC CCGCCACGCC AAGCGGCTGC GCAAGCCGGT GCTCGCCGCC TGGGACCCCT CCGCCGGGGA GGCGGGGCTG CGCCGGCTCG CGGCGGCGGT GCCGGCCTTC TCCTACCCCG AGGAGGCGGT GCGCGCTTAC AGCCGCCTCC TGCAGTACCG GCGCAGCCAG ACCCTGCTGA TGCAGACGCC GCCCTCGCTC CCCGAGGACT TCACGCCGGA CTACGAACAG GCGCGGCTGA TCCTCTCGGC GGCGCTGACC GCCGGGCGCG ACCATCTCAA CGAGTACCAG ACGGGGCGGC TGCTCAGCGC CTACGGCATC CCCTGCGTGG ATAGCCGCCG GGCCAATGAC CCGGAGGAGG CCGCGGCGGT GGCCGCCGAG CTGAACCAGC CGGTGGCCCT GAAGCTGATG TCCCCGGAGG TGCAGCTCAA GTCCCAGGCC GGCAGTGTGG CCCTGGATCT GACCGGCCCC GAGGCGGTCC GCGCCGAGGC CGAGGCGATG CTGGCCCGCC TGCACGAGCT CCGCGGTGAG GACGTGGCCG TGGACGGCTT CGCCGTGCAG CCGATGACCC GCCGCGACGG GGCCTTCGAG CTGACCCTCG GGGTGCGCCC CGGCGGGCCC TTCGGGCCGG TGCTCTACTT CGGCCACGGC GGCACTGAAA CCGAGGCCAT CGCCGACTGG GCCTGCGGCC TGCCGCCGCT GAACATGCAC CTGGCCCGGG AGATCATGCA GCGCACCCGG ATCTACGGCC TGCTGGTCGA CAGCGGGCTG CGCAAGCCGG ACCTGGACGC CGTGGCCCTG AGCCTGGTCA AGCTCTCGCA GCTGGTGATC GACTTCGGCG CCATCGAGTC GCTGGATATC AATCCCCTGT GGGCCACCGG CCACGGCGTG CTGGCCCTGG ACGCCGGGGT GGGGATCCAG CCCCAGCGCG GCGATCCCGC CGAATCATTG TGTATCCGGC CGTATCCCAC CGAACTCAAC GAGCGGATCG AGCTCCCCGA CGGCCAGTGC CTGAACCTGC GCCCGGTGCT GCCCGAGGAC GGGCCGCAGC TCAACGCCAT GGTCGAGCGC ACGCCGCCGG AGCAGGTGCG CATGCGCTTC TTCCAGGCGC TCAAGAGCCT GCCCCAGGAG CTGGCGGCGC GGCTGACGCA GATCGACTAC GACCGCGAGA TGGCCCTGGT CATCACCCGC GACGGCATCC CCGGCCGGGC GCCGCTGCTG GGGATGGTGC ACATCAGCGC CGACCCCGAT CTGGAGCAGG CCGAGTACGA CATCATGCTC GACCCCTCGG TGGCCGGCAT GGGGCTGGGG CCGATGCTCA TGCACCGGAT CGTCGACTAC GCCCGTCAGC GCGGCATCCG CGAGATCTAC GGCGAGGTGC TGCGCGAGAA CGAGCCGATG CTCAAGATCA ACGAGGCCAT GGGCTTTCGC ATCGAGCCGA GCAGCGACGA CCCGGGGCTG ATGCACGTGG CCCTGCGTCT GGATGGCAGC GGCGCGTGA
|
Protein sequence | MTVRNLESLF QPRSIALIGE ASDTDRRILR NLLSEPFQGP VMPVMAGIDS MGGVPVFPGI DDLPAVPDLA VITRPLEEVP ALITALGERG TRGVIITRAV PRDYTREQRE ALEQTILEAS RPHLVRVAGP GSSCISVPHR GLHASSLPIT LAAGKAALVT KSSAMAGAAL RWCRENDSGL SHIIHIGAAT DVDLGDSFDY LATDHRARAV IVYIERVRRA RKFMSALRRL ARMKPVIILK PPQVGEEEID DFVYDAAFRR AGVVRVDDLE ELFSAVEILD SSRVPGRDGP LAIFGNSRSL GLLASNALRR HDSSPQQLAP ESNEALQELA RRNASSHNPL DMGVDAGADA YDRALEILGK DRRISGILAL KAPGASDDGD AVAEVVARHA KRLRKPVLAA WDPSAGEAGL RRLAAAVPAF SYPEEAVRAY SRLLQYRRSQ TLLMQTPPSL PEDFTPDYEQ ARLILSAALT AGRDHLNEYQ TGRLLSAYGI PCVDSRRAND PEEAAAVAAE LNQPVALKLM SPEVQLKSQA GSVALDLTGP EAVRAEAEAM LARLHELRGE DVAVDGFAVQ PMTRRDGAFE LTLGVRPGGP FGPVLYFGHG GTETEAIADW ACGLPPLNMH LAREIMQRTR IYGLLVDSGL RKPDLDAVAL SLVKLSQLVI DFGAIESLDI NPLWATGHGV LALDAGVGIQ PQRGDPAESL CIRPYPTELN ERIELPDGQC LNLRPVLPED GPQLNAMVER TPPEQVRMRF FQALKSLPQE LAARLTQIDY DREMALVITR DGIPGRAPLL GMVHISADPD LEQAEYDIML DPSVAGMGLG PMLMHRIVDY ARQRGIREIY GEVLRENEPM LKINEAMGFR IEPSSDDPGL MHVALRLDGS GA
|
| |