Gene ECH74115_0160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0160 
SymbolfhuA 
ID6971919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp171787 
End bp174030 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content52% 
IMG OID643384236 
Productferrichrome outer membrane transporter 
Protein accessionYP_002268759 
Protein GI209398761 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTT CCAAAACTGC TCAGCCAAAA CACTCACTGC GTAAAATCGC AGTTGTAGTA 
GCCACAGCGG TTAGCGGCAT GTCTGTTTAT GCACAGGCAG CGGTTGAACC GAAAGAAGAC
ACTATCACCG TTACCGCTGC ACCTGCGCCG CAAGAAAGCG CATGGGGGCC TGCTGCAACT
ATTGCGGCGC GACAGTCAGC TACCGGCACT AAAACCGATA CGCCGATTCA AAAAGTGCCA
CAGTCTATTT CTGTTGTGAC CGCCGAAGAG ATGGCGCTGC ATCAGCCGAA GTCGGTAAAA
GAAGCGCTTA GCTACACGCC GGGTGTCTCT GTTGGTACGC GTGGCGCATC CAACACCTAT
GACCACCTGA TCATTCGCGG TTTTGCGGCA GAAGGCCAAA GCCAGAATAA CTATCTGAAT
GGCCTGAAGT TGCAGGGCAA CTTCTATAAC GATGCGGTCA TTGATCCGTA TATGCTGGAA
CGCGCTGAAA TTATGCGTGG CCCGGTTTCC GTGCTTTACG GTAAAAGCAG TCCTGGCGGC
CTGTTGAATA TGGTCAGCAA GCGTCCGACC ACCGAACCGC TGAAAGAAGT TCAGTTTAAA
GCCGGTACTG ACAGCCTGTT CCAGACTGGT TTTGACTTTA GCGATGCGCT GGATGATGAC
GGCGTTTACT CTTATCGCCT GACCGGTCTT GCGCGTTCTG CCAATGCCCA GCAGAAAGGG
TCAGAAGAGC AGCGTTATGC TATTGCACCG GCGTTCACCT GGCGTCCGGA TGATAAAACC
AATTTCACCT TCCTTTCTTA CTTCCAGAAC GAGCCGGAAA CCGGTTATTA CGGCTGGTTG
CCGAAAGAGG GAACCGTTGA GCCGCTGCCG AACGGTAAGC GTCTGCCGAC AGACTTTAAC
GAAGGGGCGA AGAACAACAC CTATTCTCGT AATGAGAAGA TGGTGGGCTA CAGCTTCGAT
CACGAATTTA ACGACACCTT TACTGTGCGT CAGAACCTGC GCTTTGCTGA AAACAAAACC
TCGCAAAACA GCGTTTATGG TTACGGCGTC TGCTCCGATC CGGCGAATGC TTACAGCAAA
CAGTGTGCGG CATTAGCGCC AGCGGATAAA GGCCATTATC TGGCACGTAA ATACGTCGTT
GATGATGAGA AGCTGCAAAA CTTCTCCGTT GATACCCAGT TGCAGAGCAA GTTTGCCACT
GGCGATATCG ACCACACCCT GCTGACCGGT GTCGACTTTA TGCGTATGCG TAATGACATC
AACGCCTGGT TTGGTTACGA CGACTCCGTA CCGCTGCTCG ATCTGTACAA TCCGGTGAAT
ACCGATTTCG ACTTCAATGC CAAAGATCCG GCAAACTCCG GCCCTTACCG CATTCTGAAT
AAGCAGAAAC AAACGGGCGT TTATGTTCAG GATCAGGCGC AGTGGGATAA AGTGCTGGTC
ACCCTGGGCG GTCGTTATGA CTGGGCAGAT CAAGAATCTC TTAACCGCGT TGCCGGGACG
ACCGATAAAC GTGATGACAA ACAGTTTACC TGGCGTGGTG GTGTTAACTA CCTGTTTGAT
AATGGCGTAA CACCTTACTT TAGCTATAGC GAATCGTTTG AACCTTCTTC GCAAGTTGGG
AAGGATGGTA ATATTTTCGC ACCGTCTAAA GGTAAGCAGT ATGAAGTCGG CGTGAAATAT
GTACCGGAAG ATCGTCCGAT TGTAGTTACT GGTGCCGTGT ATAATCTCAC TAAAACCAAC
AACCTGATGG CGGACCCTGA GGGTTCCTTC TTCTCGGTTG AAGGTGGCGA GATCCGCGCA
CGTGGCGTAG AAATCGAAGC GAAAGCGGCG CTGTCGGCGA GTGTTAACGT AGTCGGTTCT
TATACTTACA CCGATGCGGA ATACACCACC GATACTACCT ATAAAGGCAA TACGCCTGCA
CAGGTGCCAA AACACATGGC TTCGCTGTGG GCTGACTATA CCTTCTTTGA CGGTCCGCTT
TCAGGTCTGA CGCTGGGCAC CGGTGGTCGT TATACTGGCT CCAGCTATGG TGATCCGGCT
AACTCCTTTA AAGTGGGAAG TTATACGGTC GTGGATGCGT TAGTGCGTTA TGATCTGGCG
CGAGTCGGCA TGGCGGGCTC CAACGTGGCG CTGCATGTTA ACAACCTGTT CGATCGTGAA
TACGTCGCCA GCTGCTTTAA CACTTATGGC TGCTTCTGGG GCGCAGAACG TCAGGTCGTT
GCAACCGCAA CCTTCCGTTT CTAA
 
Protein sequence
MARSKTAQPK HSLRKIAVVV ATAVSGMSVY AQAAVEPKED TITVTAAPAP QESAWGPAAT 
IAARQSATGT KTDTPIQKVP QSISVVTAEE MALHQPKSVK EALSYTPGVS VGTRGASNTY
DHLIIRGFAA EGQSQNNYLN GLKLQGNFYN DAVIDPYMLE RAEIMRGPVS VLYGKSSPGG
LLNMVSKRPT TEPLKEVQFK AGTDSLFQTG FDFSDALDDD GVYSYRLTGL ARSANAQQKG
SEEQRYAIAP AFTWRPDDKT NFTFLSYFQN EPETGYYGWL PKEGTVEPLP NGKRLPTDFN
EGAKNNTYSR NEKMVGYSFD HEFNDTFTVR QNLRFAENKT SQNSVYGYGV CSDPANAYSK
QCAALAPADK GHYLARKYVV DDEKLQNFSV DTQLQSKFAT GDIDHTLLTG VDFMRMRNDI
NAWFGYDDSV PLLDLYNPVN TDFDFNAKDP ANSGPYRILN KQKQTGVYVQ DQAQWDKVLV
TLGGRYDWAD QESLNRVAGT TDKRDDKQFT WRGGVNYLFD NGVTPYFSYS ESFEPSSQVG
KDGNIFAPSK GKQYEVGVKY VPEDRPIVVT GAVYNLTKTN NLMADPEGSF FSVEGGEIRA
RGVEIEAKAA LSASVNVVGS YTYTDAEYTT DTTYKGNTPA QVPKHMASLW ADYTFFDGPL
SGLTLGTGGR YTGSSYGDPA NSFKVGSYTV VDALVRYDLA RVGMAGSNVA LHVNNLFDRE
YVASCFNTYG CFWGAERQVV ATATFRF