Gene ECH74115_4738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4738 
Symbol 
ID6972170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4381381 
End bp4382796 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content48% 
IMG OID643388439 
Producthypothetical protein 
Protein accessionYP_002272867 
Protein GI209400201 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGGTG TCGGGCTTAC CGGTATTATT GAAGTTTGTA ATATCCTTAT CACGCCAACA 
ATTTATCTTC TACTCAACGT CTTTATGCTG ACGCTGGGGG CGATAATAAT ATTTTTCTCT
GGTCGCGTGT GGGCCGGTGA TAGCGCGCCA GAAAACAGAG AAATAGCCGT CTGGCGGCAA
TGCTTTTTTC TCTTACCCGC GCTATTAACC CTGGTTGGCT GGATAATCAC GCTACATCTG
GCAGATTATC AATTTCGCCA GATGGGAGCT GGTTGGTTGG CAAACCTTAT GCTTCCCTGG
TTGGGCGTTT TTTTAGTCTC ATTAGTGGGT GGTGAGTACT GGTGGATGGT CATTATTCCC
GTTGGGGCGC ATATCAGTTT TTCGCTGGGA TACGCCTGGC CGACCAGATA TCCTTTATCC
GGCACGTCCG GACTACGTTG CCGTAACTTA CTCCTGTTTC TACTTCTCTT ACTTGGTATT
GTCGCCGGGT ATCAGGCCCA TTTATATAAG CAGCAAAATC CTGGTGTCGG TGTACGCGAA
AATATTGATA TCAGGGCCTG GCGACCCGAT AAACTCAATA ATCGACTGAC GCCGCTGCGT
GGCAAACCGC AAATTCAGTT TAGGCAAAAC TGGCCGCGAA TCGATGGCGC CACGGCTGCG
TACCCAATTT ATGCTTCTGC ATTTTATGCA TTAAGTGTAA TACCAGAGGA TTTTCACGTT
TGGGAATATC TGGAGAACTC TCGTACCCCC GATGCATATA ACCGGATTGT TAAAGGTGAT
GCCGATATTA TTTTTGTGGC GCAACCCTCC GGCGGGCAGA AAAAACGCGC TGAGGAATCG
GGCGTCACTT TGCTATACAC GCCATTTGCC CGTGAAGCAT TTGTTTTCAT CGTCAATGCG
GATAATCCGG TTAATTCCCT GACTGAACAA CAGGTGCGTG ACATTTTCAG TGGTGCAATT
ACCAATTGGC GTACTGTTGG CGGTAACGAT CAGGAGATCC AGACCTGGCA GCGCCCGGAA
GACTCTGGCA GCCAGACAGT GATGCAATCA CAGGTCATGA AAAAAGTCCG CATGATCTCG
CCGCAGGAAA CGGAAGTGGC AAGCGTGATG GAGGGAATGA TTAAAGTCGT TGCCGAATAC
CGTAATACAA ACAACGCAAT AGGCTATACC TTCCGCTATT ACGCGACGCA AATGAATGCT
GATAAAAATA TAAGATTGCT AGCGATTAAC GGTATTACAC CGACGGCGGA AAACATTCGC
AACGGCAAAT ATGCGTACAT CGTCGATGCA TTTATGGTGA CGAGAGAAAA TACAACGTCA
GAAACACAAA AACTGGTCGA ATGGTTTTTA ACGCCGCAGG GGCAGAGTCT GGTAGAAGAT
GTGGGATATG TGCCGCTGTA TCTAACAATG GAATAA
 
Protein sequence
MLGVGLTGII EVCNILITPT IYLLLNVFML TLGAIIIFFS GRVWAGDSAP ENREIAVWRQ 
CFFLLPALLT LVGWIITLHL ADYQFRQMGA GWLANLMLPW LGVFLVSLVG GEYWWMVIIP
VGAHISFSLG YAWPTRYPLS GTSGLRCRNL LLFLLLLLGI VAGYQAHLYK QQNPGVGVRE
NIDIRAWRPD KLNNRLTPLR GKPQIQFRQN WPRIDGATAA YPIYASAFYA LSVIPEDFHV
WEYLENSRTP DAYNRIVKGD ADIIFVAQPS GGQKKRAEES GVTLLYTPFA REAFVFIVNA
DNPVNSLTEQ QVRDIFSGAI TNWRTVGGND QEIQTWQRPE DSGSQTVMQS QVMKKVRMIS
PQETEVASVM EGMIKVVAEY RNTNNAIGYT FRYYATQMNA DKNIRLLAIN GITPTAENIR
NGKYAYIVDA FMVTRENTTS ETQKLVEWFL TPQGQSLVED VGYVPLYLTM E