Gene ECH74115_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2108 
Symbol 
ID6969872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2011092 
End bp2013464 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content46% 
IMG OID643386007 
ProductTonB-dependent receptor 
Protein accessionYP_002270496 
Protein GI209397189 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.347716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.376393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAG TTCTTATTCC TGGCGTCATT TTATGTGGCG CTGATGTGGC GCAGGCCGTC 
GATGACAAAA ACATGTACAT GCATTTTTTT GAAGAGATGA CGGTCTATGC TCCTGTCCCT
GTACCCGTAA ACGGCAACAC GCATTACACC AGTGAAAGCA TCGAGCGTTT ACCGACCGGG
AATGGCAATA TCAGCGATCT GCTGAGAACC AACCCTGCGG TACGCATGGA TTCAACGCAA
AGTACCTCGT TGAACCAGGG AGATATTCGC CCGGAGAAAA TCTCTATTCA CGGTGCGTCG
CCCTATCAGA ATGCCTATTT GATTGACGGT ATTAGTGCCA CTAATAACCT GAACCCAGCG
AATGAGTCCG ATGCCAGTAG TGCAACCAAT ATTAGCGGGA TGTCACAGGG GTATTATCTT
GATGTCAGCT TACTGGACAA TGTGACGCTT TATGACAGTT TTGTGCCGGT TGAATTCGGT
CGCTTCAATG GCGGGGTAAT TGATGCAAAG TTCAAACGCT TCAACGCTGA TGATAGCAGC
GTGAAACTGG GTTATCGCAC TACGCGTTCG GACTGGTTAA CATCGCATAT CAATGAGAAT
AACAAGAGCG CATTTAATCA AGGCTCTTCA GGAAGTACTT ATTACTCCCC AGATTTTAAA
AAGAACTTTT ATACCTTGTC GTTTAATCAG GAACTCGCTG ATAACTTTGG CGTTACCGCC
GGTTTATCGC GCCGCCAGTC TGATATCACC CGCGCGGATT ATGTTTCGAA TGACGGCATT
GTCGCCGGTC GGGCACAGTA TAAAAACGTT ATCGATACTG CATTGAGCAA ATTTACCTGG
TTTGCCAGCG ACCGCTTTAC CCACGATTTA ACCTTAAAAT ATACCGGCTC CAGCCGTGAT
TATAATACCA GCACCTTCCC GCAGTCTGAT CGCGAAATGG GTAATAAATC CTATGGTATG
GCATGGGATA TGGATACGCA GCTCGCATGG GCCAAACTAC GTACCACCGT TGGTTGGGAT
CATATTAGTG ATTATACCCG TCACGATCAT GACATCTGGT ACACCGAACT TTCATGTACA
TATGGTGATA TTACAGGGCG TTGCACCCGT GGCGGATTAG GACACATTTC CCAGGCTGTA
GATAATTACA CCTTCAAAAC ACGCCTGGAC TGGCAAAAAT TCGCCGTGGG TAATGTTTCG
CATCAACCCT ACTTCGGCGC GGAATACATC TATTCCGATG CGTGGACTGA ACGCCATAAC
CAGTCTGAAT CCTATGTGAT TAATGCTGCC GGAAAGAAAA CTAACCATAC CATTTACCAT
AAAGGTAAAG GCAGCCTGGG AATTGACAAC TACACACTGT ATATGGCGGA TCGCATTAGC
TGGCGTAATG TGTCATTAAT GCCCGGCGTG CGGTATGACT ATGACAACTA TCTGTCAAAC
CACAATATCT CCCCGCGCTT TATGACGGAA TGGGATATTT TTGCTGATCA AACCTCAATG
ATTACAGCAG GTTATAACCG TTACTATGGC GGGAATATTC TTGATATGGG ATTACGTGAT
ATCCGCAATA GCTGGACGGA ATCGGTATCA GGTAATAAAA CCCTGACGCG TTATCAGGAT
TTGAAAACGC CTTATAACGA TGAACTGGCA ATGGGATTGC AGCAAAAAAT CGGTAAGAAC
GTTATTGCGC GCGCAAACTA TGTTTACCGT GAAGCGCATG ATCAAATCAG CAAAAGCAGT
CGTACCGACA GCGCGACTAA AACCACCATT ACTGAATATA ACAACGACGG CAAAACCAAA
ACGCATTCGT TCAACCTCAG TTTTGAACTG GCCGAACCCC TGCATATCAG CCAGGTAGAT
ATTAACCCGC AAATTGTCTT TAGCTATATC AAGAGCAAGG GCAACTTGTC GTTAAACAAT
GGTTATGAGG AGAGCAATAC CGGTGATAAC CAGGTGGTTT ATAACGGTAA TCTTGTCTCT
TACGATAGCG TTCCAGTGGC AGATTTTAAT AACCCATTAA AGATCTCCTT AAACATGGAT
TTCACGCATC AACCGAGCGG GTTGGTGTGG GCGAATACGC TGGCCTGGCA AGAAGCGCGT
AAAGCTCGCA TTATCCTGGG TAAGACAAAT GCGCAATACA TCAGCGAATA TTCAGATTAC
AAGCAGTATG TTGACGAAAA ACTGGATAGC AGCCTGACCT GGGACACCCG CTTGTCCTGG
ACGCCACAAT TTCTGAAACA ACAAAACCTG ACGATCAGTG CCGATATTCT CAATGTACTG
GATAGCAAAA CCGCGGTTGA TACAACGAAT ACCGGTGTGG CGACCTACGC CAGTGGCCGT
ACTTTCTGGC TTGATGTCAG CATGAAATTT TAA
 
Protein sequence
MKRVLIPGVI LCGADVAQAV DDKNMYMHFF EEMTVYAPVP VPVNGNTHYT SESIERLPTG 
NGNISDLLRT NPAVRMDSTQ STSLNQGDIR PEKISIHGAS PYQNAYLIDG ISATNNLNPA
NESDASSATN ISGMSQGYYL DVSLLDNVTL YDSFVPVEFG RFNGGVIDAK FKRFNADDSS
VKLGYRTTRS DWLTSHINEN NKSAFNQGSS GSTYYSPDFK KNFYTLSFNQ ELADNFGVTA
GLSRRQSDIT RADYVSNDGI VAGRAQYKNV IDTALSKFTW FASDRFTHDL TLKYTGSSRD
YNTSTFPQSD REMGNKSYGM AWDMDTQLAW AKLRTTVGWD HISDYTRHDH DIWYTELSCT
YGDITGRCTR GGLGHISQAV DNYTFKTRLD WQKFAVGNVS HQPYFGAEYI YSDAWTERHN
QSESYVINAA GKKTNHTIYH KGKGSLGIDN YTLYMADRIS WRNVSLMPGV RYDYDNYLSN
HNISPRFMTE WDIFADQTSM ITAGYNRYYG GNILDMGLRD IRNSWTESVS GNKTLTRYQD
LKTPYNDELA MGLQQKIGKN VIARANYVYR EAHDQISKSS RTDSATKTTI TEYNNDGKTK
THSFNLSFEL AEPLHISQVD INPQIVFSYI KSKGNLSLNN GYEESNTGDN QVVYNGNLVS
YDSVPVADFN NPLKISLNMD FTHQPSGLVW ANTLAWQEAR KARIILGKTN AQYISEYSDY
KQYVDEKLDS SLTWDTRLSW TPQFLKQQNL TISADILNVL DSKTAVDTTN TGVATYASGR
TFWLDVSMKF