Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2438 |
Symbol | |
ID | 6971631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2309100 |
End bp | 2310998 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643386308 |
Product | hypothetical protein |
Protein accession | YP_002270790 |
Protein GI | 209398577 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0495046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAAA ACGACATTAT TATCAGAACT CATTATAAGT CTCCTCATAG AATGCACATC GATAGCGACA TACCAACGCC TTCATCAGAG CCTATTAATC AATTTGCGCC CCAACTCATC ACCCTACTTG ATACCTCTGA CTTAAGTTCG ATGCTGTCAT ACTGTGTTAC TCAGGAATTT ACCGCAAACT GTCGAAAAAT ATCACAAAAT TGTTATTCCA CTGCCCTTTT TACCATTAAC TTTGCCACTT CACCCATCCA TGCAGAAAAT ATATTCATTA CATTACACTA TAAAAAAGAA ATCATTTCCT TATTACTGGA AACCACGCCT ATTAAAGCTA ACCATTTGCG AAGCATACTG GATTATATTG AACAGGAACA GTTAACTGCC GAAAATCGTA ACCATTGTAT GAAACTGTCT AAAAAAATCC ATAGAGAAAA AACTATACAA CCAACAGTAA ATCTCAATGG TAGTGCATTT TTTTCGCAAT CTCCTTCTGA CGCTATTTTT TGTCGCCATC TGTCATTGCA ATACGCACTT GATTCATTGA GAAATGGAAA AGGCAAGGTC AACCTGATTA AACATTACTC CTCCGTTGAA TCCATACAGC AGCATGTCCC CTTAGTCCGG GACGCGGAGT TCAGAGCATT ACTTCGCCAT CCTCCTGCAG GGAGTCGCGT TATCGCGAGT AAGGATTTTG GCTTCGCTTT AGATATTTTC TTCTGTCGAA TGATGGCAAA CAATGTCAGT CATATGTCTG CGATTTTATA TATAGACAAT CATACTTTGT CAGTAAGGCT ACGAATAAAG CAGTCGGCGT ATGGGCAATT AAATTATGTT GTGTCCGTTT ACGACCCGAA CGATACCAAC GTTGCCGTCA GAGGCACCCA CAGGACAGCA CGGGGCTTTC TCTCGCTTGA TAAGTTCATC AGTTCAGGTC CCGATGCTCA GACCTGGGCT GATAGGTATG TTCGCAACTG TGCAATTGCT ATTCTGCCCC TATTACCTGA GGGAGTTCCA GGGGCTATTT TCACGGGTAT CGCGACACGA ATGCCATTTG CCCCTATACA TCCATCGGCA ATGTTGTTAA TAATGGCCAC AGGCCAGACT CAACAGCTTA TTACATTATT CAGACAGTTG CCCATACTCC CTGAAAAAGA AATCATTGAA ATAATAACTG CGCAGAATAG CATTGGTACA CCTGCTTTAT TTTTGGCTAT GATGAACGGA CATACTGACA ACGTGAAAAT ATTTATGCAA GAAATTCAGT CACTGGTAGA TAATCACATC ATTCATGAAG ATAATCTGGT TAAATTGTTG CAAACTAAAA GTGCTAACGA AACACCTGGA CTTTATATCT CCATGTTGTA TGGATTCGAT GAGATAATCG ATATCTTTCT GAATGCATTA ACCACTCCTA TAGCACAAGA GCTTTTAAAC AAAAAACTGG TGATGAGTAT TTTAGCCATG AAAATACATG ATGGTGAGCC AGGATTATAC GCCGCAATGG AAAATAATCA CCCTTTGTGT GTCACACGGT TCCTCTCTAA AATTAATGGC ATCGCCTTTA AATACAAGTT GAGCAAAGCT AACATCATGG ATTTATTAAA AGGCGCTACA GCACAGGGAA CCCCTGCATT ATACATCGCT ATGAGCAAGG GTAATGAAGA CGTCGTGTTA TCTTATATAT CGACGCTGGG TGCTTTTGCA AAAAAACATT CTTTTAGTCA ACATCAGTTA TTTACACTGT TGGCTGCTAA AAATCATGAC AACATGTCAG CTGTTCATAT AGCCATTCAT CATAATCATT ATAAAACTGT AGAAACATAT TATGCTGCTA TAAATGTAAT CAGCCAAAGC ATGAGTTTTA GTGCTGATGA ATTAAAGACG TATTTATAA
|
Protein sequence | MSQNDIIIRT HYKSPHRMHI DSDIPTPSSE PINQFAPQLI TLLDTSDLSS MLSYCVTQEF TANCRKISQN CYSTALFTIN FATSPIHAEN IFITLHYKKE IISLLLETTP IKANHLRSIL DYIEQEQLTA ENRNHCMKLS KKIHREKTIQ PTVNLNGSAF FSQSPSDAIF CRHLSLQYAL DSLRNGKGKV NLIKHYSSVE SIQQHVPLVR DAEFRALLRH PPAGSRVIAS KDFGFALDIF FCRMMANNVS HMSAILYIDN HTLSVRLRIK QSAYGQLNYV VSVYDPNDTN VAVRGTHRTA RGFLSLDKFI SSGPDAQTWA DRYVRNCAIA ILPLLPEGVP GAIFTGIATR MPFAPIHPSA MLLIMATGQT QQLITLFRQL PILPEKEIIE IITAQNSIGT PALFLAMMNG HTDNVKIFMQ EIQSLVDNHI IHEDNLVKLL QTKSANETPG LYISMLYGFD EIIDIFLNAL TTPIAQELLN KKLVMSILAM KIHDGEPGLY AAMENNHPLC VTRFLSKING IAFKYKLSKA NIMDLLKGAT AQGTPALYIA MSKGNEDVVL SYISTLGAFA KKHSFSQHQL FTLLAAKNHD NMSAVHIAIH HNHYKTVETY YAAINVISQS MSFSADELKT YL
|
| |