Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_3748 |
Symbol | |
ID | 5603605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 4137616 |
End bp | 4140264 |
Gene Length | 2649 bp |
Protein Length | 882 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640939300 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_001479972 |
Protein GI | 157371983 |
COG category | [C] Energy production and conversion [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) [COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0223555 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.236384 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGC GAGGGTTAGA AGCGCTTTTA CGTCCTAAAT CGATTGCCGT TCTCGGCGCT TCACAACAGC CCGGCCGCGC CGGTAACCTG ATGATGAGCA ACCTGCTGGC CGGCGGTTTC AACGGGCCGG TGATGCCGGT CACACCGCGC TACAAGGCAG TGTGTGGCGT CATGGCCTAC CCGGACGTCG CCAGCCTGCC CATCAGCCCC GATTTGGCGA TACTTTGTAC CCATGCGCAG CGCAACCTGG CATTGATTGA AGAGCTGGGC AAGCGCGGCT GCAAAACTGC CATTGTGCTC TCCTCCCCGC CGGAACAGTT CGCCGAGCTG AAAGCCTGTG CGCAGCGTTA TTCGATGCGC CTGCTGGGGC CAAACAGCCT GGGCCTGCTG GCCCCCTGGC AAGGCATTAA CGCCAGTTTC TCACCGGTCC CCATTCTCAA AGGCAAGCTG GCGTTTATTT CCCAGTCCGC CGCCGTGGCC AACACCATTC TCGACTGGGC GCAACAGCGT GAGGTTGGCT TTTCCTATTT TATCGCGCTG GGCGATAGCC TGGATATTGA CGTCGACGAC CTGCTGGACT TTCTGGCGCG CGACAGTAAA ACCAGCGCCA TTTTGCTGTA TCTGGAAAAC ATCAGCGATG CACGGCGCTT TTTATCCGCC TCGCGCAGCG CCTCACGCAA CAAACCGATT CTGGTGGTCA AAAGCGGCCG CAGCCAGCAG GCCCAGCTGT TGCTTAACAG CCAACAGGGG CTGGATGCCG CCTACGATGC CGCTATCCAG CGCGCCGGTT TGTTGCGCGT GCAGGATACC CATGAGCTGT TCTCCGCAGT AGAGACCCTG AGCCATATGC ACCCGCTGCG CGGCGAGCGG CTGATGATCG TCAGCAACGG TGCGGCACCG GCCGCTATGG CTCTGGACGA GCTGTTGGGC CGTAATGGCA AGCTGGCGCA GTTGGGCGAC GAGATCCTGG CGCAGCTTGG CGAGGCTTTG CCCGAATTTA TCCAAGCCGG CAACCCGATA GACCTGCGTG ATGACGCGAC ACCGCAACGT TATCTGGCGG CGGTTAAAAC GCTGCTCGAC AGCCATGACT ACGACGCCTT GCTGCTGATC CACGCTCCAA GCGCGGCGGC CCCGGGCACC ATTACTGCCG AACGTATTAT CGAAGCGGTA CGTCAGCACC CGCGGGGCAA ACGCATTACG CTACTCACCA ATTGGTGCGG TGAATATTCC TCGCAGGAAG CCCGACGGCT GTTTACCGAG GCCGGCATAC CCACCTATCG CACCCCGGAA GGCGCGGTGA CCGCGTTTAT GCATATGGTG GAATACCGCC GTAATCAGAA ACAGCTAAAA GAAACCCCGG CGCTCCCGGT CGGCCTGACC GCCAACACCG CCGATGCCCA CCGATTAATT CAACAGGCGC TGGCCGAAGG CGCTACGCAG CTCGATACTC ACGAGGTGCA ATCGATTCTG CAGGCCTATG ATCTGACCAC GCTGCCAACC TGGATTGCCG AAGACAGCGC CGAAGCGGTG CATATCGCCG AGCAAATTGG CTACCCGGTA GCCCTGAAAT TGCGTTCGCC GGATATCCCC CACAAGTCTG AAGTCCAGGG CGTTATGCTC TACTTGCGCA CCGCGACCGA AGTGCAGCGG GCAGCAGATG CGATCCTTGA CCGGGTGAAA CGCACCTATC CGCAGGCACG TATCCACGGC TTGCTGGTGC AGAGCATGGC CAACCGCGCC GGGGCACAAG AGCTGCGTAT TGCGGTGGAA CAAGATGCCA TTTTTGGCCC ATTGATCATG CTGGGCGAAG GGGGTATCGA ATGGCGGCAG GAAAATCAGG TCGCGGTGGC GCTGCCGCCG CTGAATATGG CGCTGGCGCG CTATCTGGTA TTGCAGGCGG TAAAAGGAGG AAAAATCCGC GGGCGCAGTG CCTTGAGACC GCTGGATATT CCCGGGCTGA GCCGCCTGTT GGTACAAGTC TCCAATCTGA TCCTCGACTG TCCGGAAATC ACCCGGCTGG ACATCCACCC GGTACTGGCC TCGGGCAGCG AGTTCACCCT GCTGGATGTT TCCATGCAGC TTGCCCCCTT TGTGGGTGAC CCGCAGGCAA GGCTGGCGAT CCGTCCTTAT CCCCATGAGC TGGAAGAAAC CATTGGGTTG AAAGACGGTT CGCAATGCCT GTTCCGGCCG ATCCTGCCGG AAGATGAACC TGCGTTGAAA CACTTTATCG ATCGCGTGAC CAAGGAAGAC CTCTACTATC GCTACTTCAG TGAGATCAAC GAGTTTACCC ATGACGATTT GGCTAACATG ACGCAGATCG ACTACGATCG AGAAATGGCT TTTGTAGCAG TGCGTGATGA ACAGATTATA GGCGTAACCC GCGCGCTATC CGACCCGGAC AATACCGATG CAGAATTTGC CGTGCTGGTG CGTTCCGATC TGAAAGGACT CGGTCTGGGA CGCCAACTGC TGGAAAAGAT GATCGCCTAT GCCCGAGCCC ATGGGTTGAC CCGTCTGACC GGTATCACCA TGCCGAACAA CCGTGGAATG ATCGGACTGG CACAGCGGTT GGGTTTTGGC ATTGATGTGC AGATCGAGGA CGGTATCGTC AATTTGACCC TGCCGCTGCA GGCAGAGGAA GCGCAGTGA
|
Protein sequence | MSQRGLEALL RPKSIAVLGA SQQPGRAGNL MMSNLLAGGF NGPVMPVTPR YKAVCGVMAY PDVASLPISP DLAILCTHAQ RNLALIEELG KRGCKTAIVL SSPPEQFAEL KACAQRYSMR LLGPNSLGLL APWQGINASF SPVPILKGKL AFISQSAAVA NTILDWAQQR EVGFSYFIAL GDSLDIDVDD LLDFLARDSK TSAILLYLEN ISDARRFLSA SRSASRNKPI LVVKSGRSQQ AQLLLNSQQG LDAAYDAAIQ RAGLLRVQDT HELFSAVETL SHMHPLRGER LMIVSNGAAP AAMALDELLG RNGKLAQLGD EILAQLGEAL PEFIQAGNPI DLRDDATPQR YLAAVKTLLD SHDYDALLLI HAPSAAAPGT ITAERIIEAV RQHPRGKRIT LLTNWCGEYS SQEARRLFTE AGIPTYRTPE GAVTAFMHMV EYRRNQKQLK ETPALPVGLT ANTADAHRLI QQALAEGATQ LDTHEVQSIL QAYDLTTLPT WIAEDSAEAV HIAEQIGYPV ALKLRSPDIP HKSEVQGVML YLRTATEVQR AADAILDRVK RTYPQARIHG LLVQSMANRA GAQELRIAVE QDAIFGPLIM LGEGGIEWRQ ENQVAVALPP LNMALARYLV LQAVKGGKIR GRSALRPLDI PGLSRLLVQV SNLILDCPEI TRLDIHPVLA SGSEFTLLDV SMQLAPFVGD PQARLAIRPY PHELEETIGL KDGSQCLFRP ILPEDEPALK HFIDRVTKED LYYRYFSEIN EFTHDDLANM TQIDYDREMA FVAVRDEQII GVTRALSDPD NTDAEFAVLV RSDLKGLGLG RQLLEKMIAY ARAHGLTRLT GITMPNNRGM IGLAQRLGFG IDVQIEDGIV NLTLPLQAEE AQ
|
| |