Gene ECH74115_4737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4737 
Symbol 
ID6969749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4380363 
End bp4381355 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content47% 
IMG OID643388438 
Producthypothetical protein 
Protein accessionYP_002272866 
Protein GI209400933 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT TCACCGGTGT TTTACTATTA GGCACGGCGT TACTGGCGGG ATGTGTCGAC 
CGGGAAGGGT ACTATAACAG CGTCAGGGAA GAAGAGAGCC ATGGACTGAC GTCTCTGCGG
GGGCAACCTG CATTACGTTA CAGCGATGAT TGGTCAAGAT GGCCGAGAGT GTACGGCGCT
ACAGCCTTAT ACCCGCTGTA TGCCTCCGCG TATTATAAAT TAGTACCCGA GCCAAAAGAT
AAGGATCGAA CCTCGCTGGC CTGGCAGGCG TATGGTTTGC AGCAAACCCG AACAGCTGAA
GCCTACGATA GTCTGATTAA AGGTTCCGCG ACGGTTATTT TTGTTGCACA ACCGTCGGAA
GGACAGAAAA AACGTGCAGA AGAAGCGGGT GTTAAACTGA AATATACCGC TTTCGCCCGC
GAAGCCTTTG TCTTTATCGT TGATATTAAT AACCCGGTAA ATTCTCTCTC TGAGCACCAG
GTTAAAGATA TTTTTAGCGG CAAAACTAGC CGCTGGAATA AAGTAGGTGG TAGTGACGAA
CATATAAAAG TCTGGCAGCG CCCTGAAGAT TCTGGAAGCC AAACGATTAT GAAGGGGTTG
GTTATGCAAG ACACCCCAAT GCTGCCAGCT AAAAAATCCA CTGTGATTGA TCTTATGGGC
GGTTTAATTA CTGAAGTTGC CGACTATCAA AACACGCCAT CTTCCATTGG GTACACCTTC
CACTATTACG TCACTCGTAT GAATGACAAT ATGCTCAAAA TGCGCAAGCA GATTAAACTT
TTGGCTATAA ATGGCGTTGC GCCTACCGAG GAAAATATCC GCAACGGCAC TTATCCATAC
ATTGTGGATG CCTATATGGT GACGCGTGAA AATCCCACGC CGGAAACGCA GAAATTTGTT
GACTGGTTTA TAAGTCAGCA GGGGCAACAG TTGGTAGAGG ATGTGGGGTA TGTGCCGCTG
TATGAAGCAT CCCCCGAATC ATCAGGACAA TAA
 
Protein sequence
MNKFTGVLLL GTALLAGCVD REGYYNSVRE EESHGLTSLR GQPALRYSDD WSRWPRVYGA 
TALYPLYASA YYKLVPEPKD KDRTSLAWQA YGLQQTRTAE AYDSLIKGSA TVIFVAQPSE
GQKKRAEEAG VKLKYTAFAR EAFVFIVDIN NPVNSLSEHQ VKDIFSGKTS RWNKVGGSDE
HIKVWQRPED SGSQTIMKGL VMQDTPMLPA KKSTVIDLMG GLITEVADYQ NTPSSIGYTF
HYYVTRMNDN MLKMRKQIKL LAINGVAPTE ENIRNGTYPY IVDAYMVTRE NPTPETQKFV
DWFISQQGQQ LVEDVGYVPL YEASPESSGQ