Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4379 |
Symbol | |
ID | 5586857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4369412 |
End bp | 4370842 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640927997 |
Product | hypothetical protein |
Protein accession | YP_001465341 |
Protein GI | 157156538 |
COG category | [S] Function unknown |
COG ID | [COG5339] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00000924151 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACATA AGTCAGCTAC AGGTGTTATT GTTGCGTTAG CCGTAATCTG GGGTGGTGGC ACATGGTACA CAGGTACGCA AATTCAGCCT GGTGTCGAAA AGTTTATTAA AGATTTTAAC GATGCTAAAA AGAAAGGTGA ACATGCCTAC GATATGACGT TAAGTTATAA AAATTTTGAC AAAGGTTTTT TTAATTCTCG TTTTCAAATG CAAATGACAT TCGATAACGG TGCACCCGAT CTCAATATCA AGCCAGGCCA GAAAGTTGTA TTTGATGTGG ATGTTGAGCA CGGTCCGTTG CCCATCACAA TGTTAATGCA TGGTAATGTC ATCCCAGCAC TGGCAGCGGC AAAAGTGAAC TTAGTGAATA ATGAACTGAC ACAACCGCTA TTTATCGCCG CGAAAAATAA ATCGCTCGTG GAAGCGACAT TGCGATTCGC GTTTGGTGGC TCATTCTCTA CGACATTAGA TGTTGCCCCT GCAGAGTATG GAAAGTTTTC TTTTGGTGAG GGCCAGTTTA CTTTTAATGG TGATGGTAGT TCATTGTCTA ACCTGGATAT TGAAGGCAAA GTCGAAGATA TTGTTCTGCA ATTATCACCA ATGAACAAAG TAACGGCAAA AAGTTTTACC ATTGATTCTC TGGCGCGATT AGAAGAAAAG AAATTTCCGG TTGGTGAAAG CGAGTCGAAA TTTAATCAGA TTAACATTAT CAATCACGGG GAAGACGTTG CCCAAATCGA TGCTTTCGTT GCAAAAACCA TGCTGGATCG CGTTAAAGAC AAAGATTATA TCAATGTCAA TCTGACCTAC GAACTTGATA AGTTAACAAA AGGGAATCAG CAACTCGGTA GTGGTGAGTG GTCATTGATT GCTGAATCTA TTGATCCCTC AGCGGTGCGC CAATTTATCA TCCAGTATAA CATTGCAATG CAGAAGCAGC TTGCTGCACA CCCTGAGTTA GCAAACGATG AAGTTGCTCT GCAAGAAGTG AATGCTGCAT TGTTCAAAGA GTATTTACCG TTATTACAAA AAAGTGAGCC GACCATTAAA CAACCGGTAA AATGGAAGAA CGCACTCGGC GAACTAAATG CCAATCTGGA TATCAGTATT GCCGACCCAG CCAAATCTTC ATCATCCACA AACAAAGATA TTAAATCGCT CAATTTTGAT GTGAAGTTAC CGCTTAATGT CGCCACAGAA ACCGCAAAAC AGCTTAATTT ATCTGAAGGA ATGGATGCGG AAAAAGCGCA AAAGCGGGCT GATAAACAAA TCAGCGGGAT GATGACCTTA GGTCAGATGT TTCAGTTAAT CACGATTGAC AACAATACCG CCTCGCTGCA ACTGCGTTAT ACACCGGGTA AAGTTGTTTT TAACGGACAG GAGATGAGCG AAGAAGAATT TATGTCTCGT GCCGGACGTT TTGTTCATTA A
|
Protein sequence | MIHKSATGVI VALAVIWGGG TWYTGTQIQP GVEKFIKDFN DAKKKGEHAY DMTLSYKNFD KGFFNSRFQM QMTFDNGAPD LNIKPGQKVV FDVDVEHGPL PITMLMHGNV IPALAAAKVN LVNNELTQPL FIAAKNKSLV EATLRFAFGG SFSTTLDVAP AEYGKFSFGE GQFTFNGDGS SLSNLDIEGK VEDIVLQLSP MNKVTAKSFT IDSLARLEEK KFPVGESESK FNQINIINHG EDVAQIDAFV AKTMLDRVKD KDYINVNLTY ELDKLTKGNQ QLGSGEWSLI AESIDPSAVR QFIIQYNIAM QKQLAAHPEL ANDEVALQEV NAALFKEYLP LLQKSEPTIK QPVKWKNALG ELNANLDISI ADPAKSSSST NKDIKSLNFD VKLPLNVATE TAKQLNLSEG MDAEKAQKRA DKQISGMMTL GQMFQLITID NNTASLQLRY TPGKVVFNGQ EMSEEEFMSR AGRFVH
|
| |