Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2108 |
Symbol | |
ID | 6969872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2011092 |
End bp | 2013464 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643386007 |
Product | TonB-dependent receptor |
Protein accession | YP_002270496 |
Protein GI | 209397189 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.347716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.376393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAG TTCTTATTCC TGGCGTCATT TTATGTGGCG CTGATGTGGC GCAGGCCGTC GATGACAAAA ACATGTACAT GCATTTTTTT GAAGAGATGA CGGTCTATGC TCCTGTCCCT GTACCCGTAA ACGGCAACAC GCATTACACC AGTGAAAGCA TCGAGCGTTT ACCGACCGGG AATGGCAATA TCAGCGATCT GCTGAGAACC AACCCTGCGG TACGCATGGA TTCAACGCAA AGTACCTCGT TGAACCAGGG AGATATTCGC CCGGAGAAAA TCTCTATTCA CGGTGCGTCG CCCTATCAGA ATGCCTATTT GATTGACGGT ATTAGTGCCA CTAATAACCT GAACCCAGCG AATGAGTCCG ATGCCAGTAG TGCAACCAAT ATTAGCGGGA TGTCACAGGG GTATTATCTT GATGTCAGCT TACTGGACAA TGTGACGCTT TATGACAGTT TTGTGCCGGT TGAATTCGGT CGCTTCAATG GCGGGGTAAT TGATGCAAAG TTCAAACGCT TCAACGCTGA TGATAGCAGC GTGAAACTGG GTTATCGCAC TACGCGTTCG GACTGGTTAA CATCGCATAT CAATGAGAAT AACAAGAGCG CATTTAATCA AGGCTCTTCA GGAAGTACTT ATTACTCCCC AGATTTTAAA AAGAACTTTT ATACCTTGTC GTTTAATCAG GAACTCGCTG ATAACTTTGG CGTTACCGCC GGTTTATCGC GCCGCCAGTC TGATATCACC CGCGCGGATT ATGTTTCGAA TGACGGCATT GTCGCCGGTC GGGCACAGTA TAAAAACGTT ATCGATACTG CATTGAGCAA ATTTACCTGG TTTGCCAGCG ACCGCTTTAC CCACGATTTA ACCTTAAAAT ATACCGGCTC CAGCCGTGAT TATAATACCA GCACCTTCCC GCAGTCTGAT CGCGAAATGG GTAATAAATC CTATGGTATG GCATGGGATA TGGATACGCA GCTCGCATGG GCCAAACTAC GTACCACCGT TGGTTGGGAT CATATTAGTG ATTATACCCG TCACGATCAT GACATCTGGT ACACCGAACT TTCATGTACA TATGGTGATA TTACAGGGCG TTGCACCCGT GGCGGATTAG GACACATTTC CCAGGCTGTA GATAATTACA CCTTCAAAAC ACGCCTGGAC TGGCAAAAAT TCGCCGTGGG TAATGTTTCG CATCAACCCT ACTTCGGCGC GGAATACATC TATTCCGATG CGTGGACTGA ACGCCATAAC CAGTCTGAAT CCTATGTGAT TAATGCTGCC GGAAAGAAAA CTAACCATAC CATTTACCAT AAAGGTAAAG GCAGCCTGGG AATTGACAAC TACACACTGT ATATGGCGGA TCGCATTAGC TGGCGTAATG TGTCATTAAT GCCCGGCGTG CGGTATGACT ATGACAACTA TCTGTCAAAC CACAATATCT CCCCGCGCTT TATGACGGAA TGGGATATTT TTGCTGATCA AACCTCAATG ATTACAGCAG GTTATAACCG TTACTATGGC GGGAATATTC TTGATATGGG ATTACGTGAT ATCCGCAATA GCTGGACGGA ATCGGTATCA GGTAATAAAA CCCTGACGCG TTATCAGGAT TTGAAAACGC CTTATAACGA TGAACTGGCA ATGGGATTGC AGCAAAAAAT CGGTAAGAAC GTTATTGCGC GCGCAAACTA TGTTTACCGT GAAGCGCATG ATCAAATCAG CAAAAGCAGT CGTACCGACA GCGCGACTAA AACCACCATT ACTGAATATA ACAACGACGG CAAAACCAAA ACGCATTCGT TCAACCTCAG TTTTGAACTG GCCGAACCCC TGCATATCAG CCAGGTAGAT ATTAACCCGC AAATTGTCTT TAGCTATATC AAGAGCAAGG GCAACTTGTC GTTAAACAAT GGTTATGAGG AGAGCAATAC CGGTGATAAC CAGGTGGTTT ATAACGGTAA TCTTGTCTCT TACGATAGCG TTCCAGTGGC AGATTTTAAT AACCCATTAA AGATCTCCTT AAACATGGAT TTCACGCATC AACCGAGCGG GTTGGTGTGG GCGAATACGC TGGCCTGGCA AGAAGCGCGT AAAGCTCGCA TTATCCTGGG TAAGACAAAT GCGCAATACA TCAGCGAATA TTCAGATTAC AAGCAGTATG TTGACGAAAA ACTGGATAGC AGCCTGACCT GGGACACCCG CTTGTCCTGG ACGCCACAAT TTCTGAAACA ACAAAACCTG ACGATCAGTG CCGATATTCT CAATGTACTG GATAGCAAAA CCGCGGTTGA TACAACGAAT ACCGGTGTGG CGACCTACGC CAGTGGCCGT ACTTTCTGGC TTGATGTCAG CATGAAATTT TAA
|
Protein sequence | MKRVLIPGVI LCGADVAQAV DDKNMYMHFF EEMTVYAPVP VPVNGNTHYT SESIERLPTG NGNISDLLRT NPAVRMDSTQ STSLNQGDIR PEKISIHGAS PYQNAYLIDG ISATNNLNPA NESDASSATN ISGMSQGYYL DVSLLDNVTL YDSFVPVEFG RFNGGVIDAK FKRFNADDSS VKLGYRTTRS DWLTSHINEN NKSAFNQGSS GSTYYSPDFK KNFYTLSFNQ ELADNFGVTA GLSRRQSDIT RADYVSNDGI VAGRAQYKNV IDTALSKFTW FASDRFTHDL TLKYTGSSRD YNTSTFPQSD REMGNKSYGM AWDMDTQLAW AKLRTTVGWD HISDYTRHDH DIWYTELSCT YGDITGRCTR GGLGHISQAV DNYTFKTRLD WQKFAVGNVS HQPYFGAEYI YSDAWTERHN QSESYVINAA GKKTNHTIYH KGKGSLGIDN YTLYMADRIS WRNVSLMPGV RYDYDNYLSN HNISPRFMTE WDIFADQTSM ITAGYNRYYG GNILDMGLRD IRNSWTESVS GNKTLTRYQD LKTPYNDELA MGLQQKIGKN VIARANYVYR EAHDQISKSS RTDSATKTTI TEYNNDGKTK THSFNLSFEL AEPLHISQVD INPQIVFSYI KSKGNLSLNN GYEESNTGDN QVVYNGNLVS YDSVPVADFN NPLKISLNMD FTHQPSGLVW ANTLAWQEAR KARIILGKTN AQYISEYSDY KQYVDEKLDS SLTWDTRLSW TPQFLKQQNL TISADILNVL DSKTAVDTTN TGVATYASGR TFWLDVSMKF
|
| |