Gene ECH74115_4423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4423 
Symbol 
ID6971477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4098053 
End bp4099363 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content57% 
IMG OID643388144 
Producthypothetical protein 
Protein accessionYP_002272581 
Protein GI209399521 
COG category[S] Function unknown 
COG ID[COG3681] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.0648812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATT CGACTTTAAA TCCGTTATGG CAGCGTTACA TCCTCGCCGT TCAGGAGGAA 
GTAAAACCGG CGCTGGGATG TACTGAACCG ATTTCACTGG CGCTGGCGGC GGCGGTTGCT
GCGGCAGAAC TGGAAGGTCC GGTTGAACGT GTAGAAGCCT GGGTTTCGCC AAATCTGATG
AAGAACGGTC TGGGCGTCAC CGTTCCCGGC ACGGGAATGG TGGGGCTGCC GATTGCGGCG
GCGCTGGGGG CGTTAGGTGG AAATGCCAAC GCCGGGCTGG AAGTGCTGAA AGACGCAACT
GCGCAGGCAA TTGCCGATGC CAAAGCACTG CTGGCGGCGG GGAAAGTCTC CGTTAAGATC
CAGGAACCTT GCAATGAAAT CCTCTTCTCA CGCGCCAAAG TCTGGAACGG TGAGAAGTGG
GCGTGTGTCA CTATCGTCGG CGGGCATACC AACATTGTGC ATATTGAGAC GCACGATGGT
GTGGTGTTTA CCCAGCAGGC GTGTGTGGCA GAGGGCGAGC AAGAGTCTCC GCTATCGGTG
CTTTCCAGAA CGACGCTGGC TGAGATCCTG AAGTTCGTCA ATGAAGTCCC GTTTGCGGCG
ATCCGCTTTA TTCTCGATTC CGCGAAGCTA AATTGTGCGT TATCGCAGGA AGGTTTGAGC
GGTAAGTGGG GGCTGCATAT TGGCGCGACG CTGGAAAAAC AGTGCGAGCG CGGTTTGCTG
GCGAAAGATC TCTCTTCATC CATTGTGATT CGTACCAGCG CGGCATCCGA TGCGCGTATG
GGCGGCGCCA CGCTTCCGGC AATGAGTAAC TCCGGCTCGG GTAACCAGGG GATCACCGCA
ACAATGCCCG TGGTGGTGGT AGCAGAACAC TTCGGAGCGG ATGATGAACG GCTGGCGCGT
GCGCTGATGC TTTCTCATTT GAGCGCAATT TACATCCATA ACCAGTTACC GCGTTTGTCT
GCGCTGTGTG CCGCAACGAC CGCAGCAATG GGGGCCGCCG CCGGGATGGC ATGGCTGGTG
GATGGGCGTT ATGAAACCAT CTCGATGGCG ATCAGCAGTA TGATCGGCGA TGTCAGCGGC
ATGATTTGCG ATGGTGCGTC GAACAGCTGC GCGATGAAGG TTTCGACCAG TGCTTCGGCT
GCGTGGAAAG CGGTGTTAAT GGCGCTGGAT GATACCGCCG TGACCGGCAA TGAAGGGATC
GTGGCGCATG ATGTTGAGCA GTCGATTGCC AACCTGTGTG CGTTAGCAAG CCATTCGATG
CAGCAAACGG ATCGGCAGAT TATCGAGATT ATGGCGAGCA AGGCCAGATA A
 
Protein sequence
MFDSTLNPLW QRYILAVQEE VKPALGCTEP ISLALAAAVA AAELEGPVER VEAWVSPNLM 
KNGLGVTVPG TGMVGLPIAA ALGALGGNAN AGLEVLKDAT AQAIADAKAL LAAGKVSVKI
QEPCNEILFS RAKVWNGEKW ACVTIVGGHT NIVHIETHDG VVFTQQACVA EGEQESPLSV
LSRTTLAEIL KFVNEVPFAA IRFILDSAKL NCALSQEGLS GKWGLHIGAT LEKQCERGLL
AKDLSSSIVI RTSAASDARM GGATLPAMSN SGSGNQGITA TMPVVVVAEH FGADDERLAR
ALMLSHLSAI YIHNQLPRLS ALCAATTAAM GAAAGMAWLV DGRYETISMA ISSMIGDVSG
MICDGASNSC AMKVSTSASA AWKAVLMALD DTAVTGNEGI VAHDVEQSIA NLCALASHSM
QQTDRQIIEI MASKAR