Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1580 |
Symbol | |
ID | 5593894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1592070 |
End bp | 1594442 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640920733 |
Product | TonB-dependent receptor |
Protein accession | YP_001458289 |
Protein GI | 157160971 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGAG TTCTTATTCC TGGCGTCATT TTATGTGGCG CTGATGTGGC GCAGGCCGTC GATGACAAAA ACATGTACAT GCATTTTTTT GAAGAGATGA CGGTCTATGC TCCTGTCCCT GTACCCGTAA ACGGCAACAC GCATTACACC AGTGAAAGCA TCGAGCGTTT ACCGACCGGG AATGGCAATA TCAGCGATCT GCTGAGAACC AACCCTGCGG TACGCATGGA TTCAACGCAA AGTACCTCGT TGAACCAGGG AGATATTCGC CCGGAGAAAA TCTCTATTCA CGGTGCGTCG CCCTACCAGA ATGCCTATTT GATTGATGGT ATTAGTGCCA CTAATAACCT GAACCCAGCG AATGAGTCCG ATGCCAGTAG TGCAACCAAT ATTAGCGGGA TGTCACAGGG GTATTATCTT GATGTCAGCT TACTGGACTA TGTGACGCTT TATGACAGTT TTGTGCCGGT TGAATTCGGT CGTTTCAATG GCGGGGTAAT TGATGCAAAG ATCAAACGCT TCAACGCTGA TGATAGCAGC GTGAAACTGG GTTATCGCAC TACGCGTTCG GACTGGTTAA CATCGCATAT CGATGAGAAT AACAAGAGCG CATTTAATCA AGGTTCTTCA GGAAGTACTT ATTACTCTCC AGATTTTAAA AAGAACTTTT ATACCTTGTC GTTTAATCAG GAACTCGCTG ATAACTTTGG CGTTACCGCC GGTTTATCGC GCCGCCAGTC TGATATCACC CGTGCGGATT ATGTTTCGAA TGACGGCATT GTCGCCGGTC GGGCACAGTA TAAAAACGTT ATCGATACTG CATTGAGCAA ATTTACCTGG TTTGCCAGCG ACCGCTTTAC CCACGATTTA ACCTTAAAAT ATACCGGCTC CAGCCGTGAT TATAATACCA GCACCTTCCC GCAGTCTGAT CGCGAAATGG GTAATAAATC CTATGGTCTG GCATGGGATA TGGATACGCA GCTCGCATGG GCCAAACTAC GTACCACCGT TGGTTGGGAT CATATTAGTG ATTATACCCG TCACGATCAT GACATCTGGT ACACCGAACT TTCATGTACA TATGGTGATA TTACAGGGCG TTGCACCCGT GGCGGATTAG GACACATTTC CCAGGCTGTA GATAATTACA CCTTCAAAAC ACGCCTGGAC TGGCAAAAAT TCGCCGTGGG TAATGTTTCG CATCAAACCT ACTTCGGCGC GGAATACATC TATTCCGATG CATGGACTGA ACGCCATAAC CAGTCTGAAT CCTATGTGAT TAATGCTGCC GGAAAGAAAA CTAACCATAC CATTTACCAT AAAGGTAAAG GCAGCCTGGG AATTGACAAC TACACACTGT ATATGGCGGA TCGCATTAGC TGGCGTAATG TGTCATTAAT GCCCGGCGTG CGGTATGACT ATGACAACTA TCTGTCAAAC CACAATATCT CCCCGCGCTT TATGACGGAA TGGGATATTT TTGCTGATCA AACCTCAATG ATTACCGCAG GTTATAACCG TTACTATGGT GGGAATATTC TTGATATGGG ATTACGTGAT ATCCGCAATA GCTGGACGGA ATCGGTATCA GGCAATAAAA CCCTGACGCG TTATCAGGAT TTGAAAACGC CTTATAACGA TGAACTGGCA ATGGGATTGC AGCAAAAAAT CGGTAAGAAC GTTATTGCGC GCGCAAACTA TGTTTACCGT GAAGCGCATG ATCAAATCAG CAAAAGCAGT CGTACCGACA GCGCGACTAA AACCACCATT ACTGAATATA ACAACGATGG CAAAACCAAA ACGCATTCAT TCAACCTCAG TTTTGAGCTG GCCGAACCCC TGCATATCCG CCAGGTAGAT ATTAACCCAC AAATTGTCTT TAGCTATATC AAGAGCAAGG GCAACTTGTC GTTAAACAAT GGTTATGAGG AGAGCAATAC CGGTGATAAC CAGGTGGTTT ATAACGGTAA TCTGGTCTCT TACGATAGCG TTCCAGTGGC AGATTTTAAT AACCCATTAA AGATCTCCTT AAACATGGAT TTCACGCATC AACCGAGCGG GTTAGTGTGG GCGAATACGC TGGCCTGGCA AGAAGCGCGT AAAGCTCGCA TTATCCTGGG TAAGGCGAAT GCGCAATACA TCAGCGAATA TTCAGATTAC AAGCAGTATG TTGACGAAAA ACTGGATAGC AGCCTGACCT GGGACACCCG CTTGTCCTGG ACGCCACAAT TTCTGCAACA ACAAAACCTG ACGATCAGTG CCGATATTCT CAATGTACTG GATAGCAAAA CCGCTGTTGA TACAACGAAT ACCGGTGTGG CGACCTACGC CAGTGGCCGT ACTTTCTGGC TTGATGTCAG CATGAAATTT TAA
|
Protein sequence | MKRVLIPGVI LCGADVAQAV DDKNMYMHFF EEMTVYAPVP VPVNGNTHYT SESIERLPTG NGNISDLLRT NPAVRMDSTQ STSLNQGDIR PEKISIHGAS PYQNAYLIDG ISATNNLNPA NESDASSATN ISGMSQGYYL DVSLLDYVTL YDSFVPVEFG RFNGGVIDAK IKRFNADDSS VKLGYRTTRS DWLTSHIDEN NKSAFNQGSS GSTYYSPDFK KNFYTLSFNQ ELADNFGVTA GLSRRQSDIT RADYVSNDGI VAGRAQYKNV IDTALSKFTW FASDRFTHDL TLKYTGSSRD YNTSTFPQSD REMGNKSYGL AWDMDTQLAW AKLRTTVGWD HISDYTRHDH DIWYTELSCT YGDITGRCTR GGLGHISQAV DNYTFKTRLD WQKFAVGNVS HQTYFGAEYI YSDAWTERHN QSESYVINAA GKKTNHTIYH KGKGSLGIDN YTLYMADRIS WRNVSLMPGV RYDYDNYLSN HNISPRFMTE WDIFADQTSM ITAGYNRYYG GNILDMGLRD IRNSWTESVS GNKTLTRYQD LKTPYNDELA MGLQQKIGKN VIARANYVYR EAHDQISKSS RTDSATKTTI TEYNNDGKTK THSFNLSFEL AEPLHIRQVD INPQIVFSYI KSKGNLSLNN GYEESNTGDN QVVYNGNLVS YDSVPVADFN NPLKISLNMD FTHQPSGLVW ANTLAWQEAR KARIILGKAN AQYISEYSDY KQYVDEKLDS SLTWDTRLSW TPQFLQQQNL TISADILNVL DSKTAVDTTN TGVATYASGR TFWLDVSMKF
|
| |