Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0119 |
Symbol | |
ID | 5589180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 130981 |
End bp | 132834 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640923848 |
Product | hypothetical protein |
Protein accession | YP_001461285 |
Protein GI | 157156701 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000315572 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATGA CTTTGCCGTT TAAACCCCAT GTGCTGGCGC TAATTTGCAG TGCCGGGCTT TGTGCCGCCT CTACCGGGCT ATATATAAAA AGCCGCACAG TGGAAGCGCC TGTGGAACCG CAATCGACAC AACAGACTGC GCCTGACATC ACCGCAGTTA CGCTTCCTGC AACGGTTTCC GCACCGCCCG TAACGCCCGC CGTCGTCAAA TCCGCATTCA GCACTGCACA AATCGATCAA TGGGTTGCGC CTGTCGCGCT GTACCCCGAT TCTCTGCTTT CACAAGTGTT AATGGCATCA ACCTATCCGG CAAACGTTGC TCAAGCAGTG CAATGGTCGC ACGATAATCC ACTTAAACAA GGCGATGCTG CTATTCAGGC GGTATCTGAC CAGCCGTGGG ACGCCAGCGT TAAATCACTG GTGGCCTTTC CACAATTGAT GGCATTGATG GGCGAAAACC CGCAATGGGT GCAAAACCTG GGCGATGCTT TTCTGGCGCA GCCGCAGGAC GTGATGGACT CGGTACAGCG ATTGCGGCAA CTGGCGCAAC AAACCGGCTC GCTGAAGTCA TCAACCGAAC AGAAAGTTAT TACCACAACG AAGAAAGCTG TACCGGTAAC ACAGACAGTC ACGGCTCCCG TCATACCATC CAATACCGTT TCAACTGCCA GCCCCGTCAT TACAGAGCCT GCAACAACCG TCATTTCCAT TGAGCCCGCC AATCCTGATG TGGTCTATAT TCCCAACTAC AACCCAACCG TGGTTTACGG GAACTGGGCC AATACTGCGT ATCCGCCGGT TTATCTGCCA CCACCAGCCG GAGAACCGTT TATTGACAGC TTTGTGCGCG GATTCGGCTA TAGCATGGGT GTTGCTACCA CGTACGCACT ATTCAGCAGC ATCGACTGGG ACGACGACGA TCATGACCAT CATCATCATG ACGATGATGA TTATCATCAC CACGATGGCG GTCATCGTGA CGGTAATGGC TGGCAACACA ACGGCGACAA CATCAATATC GACGTCAACA ATTTCAACCG TATCACCGGT GAGCATCTTA CTGATAAGAA TATGGCATGG CGGCACAATC CAAACTACCG TAATGGTGTG CCCTATCATG ATCAGGATAT GGCAAAGCGG TTTCATCAAA CCGATGTCAA CGGCGGAATG AGCGCCACGC AGCTACCTGC TCCAACACGC GACAGCCAGC GTCAGGCGGC AGCAAGTCAG TTTCAGCAAC GAACACACGC CGCCCCCGTC ATTACACGAG ATACCCAACG TCAGGCAGCG GCACAGCGGT TTAATGAAGC TGAAAACTAT GGGAGCTATG ACGACTTCCG CGACTTCAGC CGTCGCCAAC CACTGACCCA GCAACAAAAG GACGCCGCTC GTCAGCGTTA TCAGTCGGCC TCGCCTGAGC AGCGCCAGGC AGTCCGCGAG AAAATGCAGA CTAACCCACA GATCCAGCAG CGAAGAGACG CAGCGCGTGA GCGTATTCAG TCCGCCTCGC CTGAGCAGCG CCAGGCAGTC CGCGAGAAAA TGCAGACTAA CCCACAGATC CAGCAGCGAA GAGACGCAGC GCGTGAGCGT ATTCAGTCAG CCTCGCCTGA GCAGCGCCAG GTGTTTAAGG AAAAAGTACA GCAGCGCCCA CTGAACCAAC AGCAACGTGA TAACGCCCGC CAGCGTGTTC AATCAGCATC ACCTGAACAA CGTCAGGTTT TTCGGGAAAA AGTTCAGGAG AGCCGCCCAC AACGTCTAAA CGACAGTAAC CATACTGTCA GGCTGAATAA CGAGCAACGG TCAGCAGTAC GCGAACGTCT CTCTGAGCGC GGAGCAAGGC GACAGGAAAG GTAA
|
Protein sequence | MKMTLPFKPH VLALICSAGL CAASTGLYIK SRTVEAPVEP QSTQQTAPDI TAVTLPATVS APPVTPAVVK SAFSTAQIDQ WVAPVALYPD SLLSQVLMAS TYPANVAQAV QWSHDNPLKQ GDAAIQAVSD QPWDASVKSL VAFPQLMALM GENPQWVQNL GDAFLAQPQD VMDSVQRLRQ LAQQTGSLKS STEQKVITTT KKAVPVTQTV TAPVIPSNTV STASPVITEP ATTVISIEPA NPDVVYIPNY NPTVVYGNWA NTAYPPVYLP PPAGEPFIDS FVRGFGYSMG VATTYALFSS IDWDDDDHDH HHHDDDDYHH HDGGHRDGNG WQHNGDNINI DVNNFNRITG EHLTDKNMAW RHNPNYRNGV PYHDQDMAKR FHQTDVNGGM SATQLPAPTR DSQRQAAASQ FQQRTHAAPV ITRDTQRQAA AQRFNEAENY GSYDDFRDFS RRQPLTQQQK DAARQRYQSA SPEQRQAVRE KMQTNPQIQQ RRDAARERIQ SASPEQRQAV REKMQTNPQI QQRRDAARER IQSASPEQRQ VFKEKVQQRP LNQQQRDNAR QRVQSASPEQ RQVFREKVQE SRPQRLNDSN HTVRLNNEQR SAVRERLSER GARRQER
|
| |