Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2737 |
Symbol | |
ID | 6143910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2814997 |
End bp | 2817657 |
Gene Length | 2661 bp |
Protein Length | 886 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617608 |
Product | CoA-binding domain/acetyltransferase domain-containing protein |
Protein accession | YP_001744773 |
Protein GI | 170681301 |
COG category | [C] Energy production and conversion [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) [COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.028573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAGC GAGGACTGGA AGCACTACTG CGACCAAAAT CGATTGCGGT AATTGGCGCA TCGATGAAAC CCAATCGCGC AGGTTACCTG ATGATGCGTA ACCTGCTGGC GGGAGGCTTT AACGGACCGG TACTCCCGGT GACGCCTGCC TGGAAAGCAG TGTTGGGTGT GTTGGCCTGG CCGGATATTG CCAGCTTGCC CTTTACACCA GACCTTGCGG TTTTATGTAC CAATGCCAGC CGTAATCTTG CTCTTCTGGA AGAGCTCGGC GAGAAAGGCT GTAAAACCTG CATTATTCTT TCCGCCCCGT CATCGCAACA CGAAGATCTC CGCGCCTGCG CCCTGCGCCA TAACATGCGC CTGCTTGGAC CAAACAGTCT GGGTTTACTG GCTCCCTGGC AAGGTCTGAA TGCCAGCTTT TCGCCTGTGC CGATTAAACG CGGCAAGCTG GCGTTTATTT CGCAATCGGC TGCTGTCTCC AACACCATCC TCGACTGGGC GCAACAGCGT GAGATGGGCT TTTCCTACTT TATTGCGCTC GGCGACAGCC TGGATATCGA CGTTGATGAA TTGCTCGACT ATCTGGCGCG CGACAGTAAA ACCAGCGCCA TCCTGCTCTA TCTCGAACAG TTAAGCGACG CGCGACGCTT TGTTTCGGCG GCCCGTAGTG CCTCGCGCAA TAAACCGATT CTGGTGATTA AAAGCGGACG TAGCCCGGCG GCTCAACGTC TGTTGAACAC CACGGCAGGA ATGGATCCAG CGTGGGATGC GGCTATTCAG CGTGCCGGTT TGTTGCGGGT ACAGGATACC CACGAACTGT TTTCGGCGGT GGAAACCCTT AGCCATATGC GCCCGCTGCG TGGTGACCGG TTGATGATTA TCAGCAACGG TGCTGCGCCT GCCGCGCTGG CGCTGGATGC CTTATGGTCA CGCAATGGCA AGCTGGCAAC GCTAAGCGAA GAAACCTGCC AGAAACTGCG CGATGCACTG CCAGAACATG TGGCAATCTC TAACCCGCTC GATCTACGCG ATGACGCCAG CAGTGAGCAC TATATTAAAA CGCTGGATAT CCTGCTCCAC AGCCAGGATT TTGACGCGCT GATGGTTATT CATTCGCCCA GCGCCGCTGC TCCCGCAACA GAAAGCGCGC AAGCATTAAT TGAAGCGGTA AAGCATCATC CCCGCAGCAA GTATGTTTCT CTGCTGACGA ACTGGTGTGG CGAGCACTCC TCGCAAGAGG CACGACGATT ATTCAGTGAA GCCGGGCTGC CGACCTATCG CACGCCGGAA GGAACCATCA CTGCTTTTAT GCATATGGTG GAGTACCGGC GTAATCAGAA GCAACTACGC GAAACGCCGG CGTTGCCCAG CAATCTGACC TCCAATACCG CAGAAGCGCA TCTTCTGTTG CAACAGGCGA TTGCCGAAGG AGCAACGTCA CTCGATACCC ATGAAGTTCA GCCCATCCTG CAAGCATATG GCATGAACAC GCTCCCCACC TGGATTGCCA GCGATAGCAC CGAAGCGGTG CATATTGCCG AACAGATTGG TTATCCGGTG GCGCTGAAAT TGCGTTCGCC GGATATTCCA CATAAATCGG AAGTTCAGGG CGTCATGCTC TACCTGCGTA CAGCTAATGA AGTCCAGCAA GCGGCAAACG CTATTTTCGA TCGCGTAAAA ATGGCCTGGC CGCAGGCGCG GATCCACGGC CTGTTGGTGC AAAGTATGGC TAACCGTGCT GGCGCTCAGG AGTTGCGGGT TGTGGTTGAG CACGATCCGG TTTTCGGGCC GTTGATCATG CTGGGCGAAG GCGGTGTGGA GTGGCGTCCT GAAGATCAAG CCGTCGTCGC GTTACCACCG CTGAACATGA ACCTGGCCCG CTATCTGGTT ATTCAGGGGA TCAAAAGTAA AAAGATTCGT GCGCGCAGTG CGCTACGCCC ATTGGATGTT GCAGGCTTGA GCCAGCTTCT GGTACAGGTT TCCAACTTGA TTGTCGATTG TCCGGAAATT CAACGTCTGG ATATTCATCC TTTGCTGGCT TCTGGCAGTG AATTTACCGC GCTGGATGTC ACGCTGGATA TCGCGCCGTT TGAAGGCGAT AACGAGAGCC GGCTGGCAGT GCGCCCTTAT CCGCATCAGC TGGAAGAGTG GGTAGAATTG AAAAACGGTG AACGCTGCTT GTTCCGCCCG ATTTTGCCAG AAGATGAGCC ACAACTTCAG CAGTTCATTT CGCGAGTCAC CAAAGAAGAT CTTTATTACC GCTACTTTAG CGAGATCAAC GAATTTACCC ATGAAGATTT AGCCAACATG ACGCAGATCG ACTACGATCG GGAAATGGCA TTTGTAGCGG TACGACGTAT TGATCAAACG GAAGAGATCC TCGGCGTCAC GCGTGCGATC TCCGACCCTG ATAACATCGA TGCCGAATTT GCCGTGCTGG TTCGCTCGGA TCTCAAAGGG TTAGGCTTAG GTCGACGCTT AATGGAAAAG TTGATTACCT ATACGCGAGA TCACGGACTG CAACGTCTGA ATGGTATTAC GATGCCAAAC AATCGTGGCA TGGTGGCGCT GGCCCGCAAG CTCGGGTTTA ACGTTGATAT CCAGCTCGAA GAGGGGATCG TTGGGCTTAC GCTAAATCTT GCCCAGCGCG AGGAATCATG A
|
Protein sequence | MSQRGLEALL RPKSIAVIGA SMKPNRAGYL MMRNLLAGGF NGPVLPVTPA WKAVLGVLAW PDIASLPFTP DLAVLCTNAS RNLALLEELG EKGCKTCIIL SAPSSQHEDL RACALRHNMR LLGPNSLGLL APWQGLNASF SPVPIKRGKL AFISQSAAVS NTILDWAQQR EMGFSYFIAL GDSLDIDVDE LLDYLARDSK TSAILLYLEQ LSDARRFVSA ARSASRNKPI LVIKSGRSPA AQRLLNTTAG MDPAWDAAIQ RAGLLRVQDT HELFSAVETL SHMRPLRGDR LMIISNGAAP AALALDALWS RNGKLATLSE ETCQKLRDAL PEHVAISNPL DLRDDASSEH YIKTLDILLH SQDFDALMVI HSPSAAAPAT ESAQALIEAV KHHPRSKYVS LLTNWCGEHS SQEARRLFSE AGLPTYRTPE GTITAFMHMV EYRRNQKQLR ETPALPSNLT SNTAEAHLLL QQAIAEGATS LDTHEVQPIL QAYGMNTLPT WIASDSTEAV HIAEQIGYPV ALKLRSPDIP HKSEVQGVML YLRTANEVQQ AANAIFDRVK MAWPQARIHG LLVQSMANRA GAQELRVVVE HDPVFGPLIM LGEGGVEWRP EDQAVVALPP LNMNLARYLV IQGIKSKKIR ARSALRPLDV AGLSQLLVQV SNLIVDCPEI QRLDIHPLLA SGSEFTALDV TLDIAPFEGD NESRLAVRPY PHQLEEWVEL KNGERCLFRP ILPEDEPQLQ QFISRVTKED LYYRYFSEIN EFTHEDLANM TQIDYDREMA FVAVRRIDQT EEILGVTRAI SDPDNIDAEF AVLVRSDLKG LGLGRRLMEK LITYTRDHGL QRLNGITMPN NRGMVALARK LGFNVDIQLE EGIVGLTLNL AQREES
|
| |