Gene ECH74115_3438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3438 
Symbol 
ID6969510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3184592 
End bp3186133 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content52% 
IMG OID643387244 
Producthypothetical protein 
Protein accessionYP_002271707 
Protein GI209399207 
COG category[S] Function unknown 
COG ID[COG1288] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000568736 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGGGA ATATTTGCGC TATGTCCGCA ATCACTGAAT CCAAACCAAC AAGAAGATGG 
GCAATGCCCG ATACGTTGGT GATTATCTTT TTTGTTGCTA TTTTAACCAG CCTTGCCACC
TGGGTAGTTC CGGTGGGGAT GTTTGACAGT CAGGAAGTGC AGTATCAGGT TGATGGTCAA
ACAAAAACAC GCAAAGTCGT AGATCCACAC TCATTTCGCC TTCTGACTAA CGAAGCAGGC
GAACCTGAGT ATCACCGCGT ACAGCTGTTC ACGACGGGCG ATGAACGCCC GGGCCTGATG
AACTTCCCGT TTGAAGGGTT AACCTCAGGA TCGAAATACG GGACAGCCGT TGGCATCATC
ATGTTTATGC TGGTGATTGG CGGCGCGTTT GGCATTGTGA TGCGTACAGG AACCATTGAT
AACGGTATCC TGGCGCTTAT TCGCCATACC CGTGGGAATG AAATTCTCTT TATTCCTGCG
CTGTTTATTC TGTTTTCACT TGGCGGCGCG GTATTTGGTA TGGGGGAAGA GGCCGTCGCC
TTTGCCATTA TCATCGCACC GCTAATGGTC CGGCTGGGCT ATGACAGTAT TACCACCGTC
CTGGTGACCT ATATTGCCAC GCAAATCGGT TTTGCCAGTT CGTGGATGAA CCCGTTTTGT
GTGGTCGTTG CTCAGGGGAT TGCCGGCGTT CCGGTGCTTT CTGGCTCCGG GTTGCGCATC
GTGGTGTGGG TTATCGCCAC TCTGATTGGC CTGATCTTTA CCATGGTGTA CGCCTCACGA
GTGAAAAAGA ATCCTCTTCT GTCACGCGTG CATGAGTCCG ACCGCTTCTT TCGTGAAAAG
CAGGCGGATG TTGAACAACG TCCGTTTACC TTTGGTGACT GGCTGGTATT GATTGTCCTG
ACCGCCGTAA TGGTCTGGGT GATTTGGGGC GTGATCGTTA ATGCCTGGTT TATTCCAGAA
ATTGCCAGCC AGTTCTTCAC CATGGGTCTG GTGATTGGCA TCATCGGCGT TGTTTTCCGC
CTTAACGGTA TGACGGTTAA TACCATGGCT TCATCCTTCA CCGAAGGGGC GCGAATGATG
ATCGCCCCTG CCCTGCTGGT GGGTTTCGCC AAAGGGATTT TGCTGCTGGT CGGTAATGGT
GAAGCGGGTG ATGCCAGCGT GTTAAATACT ATCCTCAACA GCATTGCCAA TGCCATTAGC
GGTCTGGACA ACGCGGTCGC GGCCTGGTTT ATGTTGCTCT TCCAGGCGGT ATTTAATTTC
TTCGTGACGT CCGGTTCTGG TCAGGCGGCG TTAACCATGC CGTTACTGGC ACCGCTTGGC
GATCTGGTCG GTGTTAACCG TCAGGTTACC GTGCTGGCTT TCCAGTTTGG TGATGGCTTC
AGTCACATCA TTTACCCGAC CTCAGCTTCG TTAATGGCGA CGCTCGGTGT TTGCCGGGTG
GACTTCCGTA ACTGGCTGAA GGTGGGCGCG ACCCTGCTTG GACTGCTGTT TATTATGTCC
AGCGTCGTGG TGATCGGCGC TCAGTTGATG GGCTACCACT AA
 
Protein sequence
MQGNICAMSA ITESKPTRRW AMPDTLVIIF FVAILTSLAT WVVPVGMFDS QEVQYQVDGQ 
TKTRKVVDPH SFRLLTNEAG EPEYHRVQLF TTGDERPGLM NFPFEGLTSG SKYGTAVGII
MFMLVIGGAF GIVMRTGTID NGILALIRHT RGNEILFIPA LFILFSLGGA VFGMGEEAVA
FAIIIAPLMV RLGYDSITTV LVTYIATQIG FASSWMNPFC VVVAQGIAGV PVLSGSGLRI
VVWVIATLIG LIFTMVYASR VKKNPLLSRV HESDRFFREK QADVEQRPFT FGDWLVLIVL
TAVMVWVIWG VIVNAWFIPE IASQFFTMGL VIGIIGVVFR LNGMTVNTMA SSFTEGARMM
IAPALLVGFA KGILLLVGNG EAGDASVLNT ILNSIANAIS GLDNAVAAWF MLLFQAVFNF
FVTSGSGQAA LTMPLLAPLG DLVGVNRQVT VLAFQFGDGF SHIIYPTSAS LMATLGVCRV
DFRNWLKVGA TLLGLLFIMS SVVVIGAQLM GYH