Gene ECH74115_3531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3531 
Symbol 
ID6971709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3268583 
End bp3270520 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content55% 
IMG OID643387332 
Producthypothetical protein 
Protein accessionYP_002271795 
Protein GI209398813 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0000115281 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCATTTA AACACTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCGAAACGA 
ATAACTCAAA AACTGAAGGA AGGGTGGCAG CCTTATGGTA GTGCGCTGAT TTCGACAGCT
GGTTATGGTG CGGAGTTCAT CCAGCCAGTT GTGAGTGAGG GGAGCATCTC ATCACCAGAG
GAGCCAGGCA ACCGTCCGAC GACCTCAGCG CCTTCTGTTG CGCCAGAATA TTACTATGTG
ATCGCGCTTG CTGGTCAGTC CAATGGTATG TCATACGGTG AGGGACTGCC ATTGCCGGAT
ACATTCGACA GCCCTGATCC ACGTATTAAA CAGTTAGCGC GTCGCAGTAC GGTGACACCG
GGCGGTGCAG TATGCAAATA TAACGACATC ATTCCGGCGG ACCATTGTCT GCATGATGTG
CAGGACATGA GCCGTCTTAA CCATCCGAAA GCGGACCTGT CAAAGGGGCA GTACGGAACC
GTGGGGCAGG GGCTGCATAT CGCCAAAAAA CTGCTGCCGT TTATACCGGC GAATGCGGGC
ATTCTGCTGG TTCCGTGCTG TCGTGGTGGT TCAGCGTTCA CCACCGGAGC CGATGGCACA
TACAGTGACG CGAGTGGTGC CTCGGAGAAT TCAACCCGCT GGGGTGTGGA CAAGCCGCTG
TATAAGGACC TTATCGGTCG AACAAAAGCA GCACTGAAGA AGAATCCGAA AAATGTGCTG
TTTGCCGTGG TGTGGATGCA GGGGGAATTT GATTTTGGCG GTACGCCGGT AAATCACGCC
GCACAGTTTG GTGCGCTGGT TGATAAATTC CGTGCAGACC TGGCGGATAT GGCAGGCCAG
TGCGTCGGTG GCTCTGCTGG CGGTGTTCCC TGGATATGCG GGGACACGAC GTATTTCTGG
AAGCAGAAGA ACGAATCCAC GTACCAGACG GTGTATGGCA GCTATAAAAA CAAAACGGAA
AAGAATATCC ATTTCGTACC GTTCATGACG GATGAGAACG GGGTGAATGT GCCGACGAAC
AAACCGGAAG AAGACCCGGA CATTCCGGGT ATCGGATATT ACGGTTCGAA ATGGCGTGAC
AGCTCAGCCA CCTGGACGTC ACAGGACAGG GCGAGCCATT TCAGTTCATG GGCTCGCCGC
GGGATTATTT CCGACCGTCT GGCAACGGCG ATTTTGCGCC ATGCGGGAAG AGTGGCGCTA
AACGCGGGGG CATCATCGAC AGTATCAGAG GTGCGCCCGT CATCGCCTTC CGGTGCAGAA
GCCACAGGCG TCACAACACT GCTCTCTTAC CTTGCCAGCG AGTCAGAGGG AAGCCTGAAA
GTACAGGGAT GGTCAGCCAG TGGCGGCAGG GCAGAAGTGG TCAGCGATGC GGAGGGAACC
GGAGGTAAGG CAGTGAAGCT GACCAAGGAA GCCGGTAAAA GCAGCTGGGT GCTGGAGTAC
GCCGCGGGCA ACGGTGCGGC TCTGTTACAG AAAGGGGGGC AGATTCGCTG CCGCTTTAAG
GTTTCGGGAG CGCTGGCTGC GAACCAGTAT GTTATGGCGT TTTACTGGCC GGTATCTTCA
CTGCCACAGG GCGTTGCCCT GACCGGAGAC GGGGGGAATA ACCTGCTGGC AGCGTTCTAC
ATCCAGACAG ATGCAAAAGA CCTGAATGTG ATGTACCACA ATGCGAAAGT GGCGACAAAC
AACCTGAAAC TGGGAACCTT TGGCGCATTT GATAACGAAT GGCATACGCT GGCTTTCCGC
TTTGCCGGGA ATAACAGCCT GCAGGTGACG CCGGTTATTG ATGGTCAGGA TGGCACACCG
TTCACGCTGA CGCAGTCACC GGTCAGTGCC TTTGCGGCGG ATAAACTGCA TGTGACAGAC
ATTACCAGAG GTGCGACTTA CCCGGTACTG ATAGACAGCA TTGCGGTGGA AGTGAACAGC
ACAGACACTG CGGCATGA
 
Protein sequence
MAFKHYDVVR AASPSDLAKR ITQKLKEGWQ PYGSALISTA GYGAEFIQPV VSEGSISSPE 
EPGNRPTTSA PSVAPEYYYV IALAGQSNGM SYGEGLPLPD TFDSPDPRIK QLARRSTVTP
GGAVCKYNDI IPADHCLHDV QDMSRLNHPK ADLSKGQYGT VGQGLHIAKK LLPFIPANAG
ILLVPCCRGG SAFTTGADGT YSDASGASEN STRWGVDKPL YKDLIGRTKA ALKKNPKNVL
FAVVWMQGEF DFGGTPVNHA AQFGALVDKF RADLADMAGQ CVGGSAGGVP WICGDTTYFW
KQKNESTYQT VYGSYKNKTE KNIHFVPFMT DENGVNVPTN KPEEDPDIPG IGYYGSKWRD
SSATWTSQDR ASHFSSWARR GIISDRLATA ILRHAGRVAL NAGASSTVSE VRPSSPSGAE
ATGVTTLLSY LASESEGSLK VQGWSASGGR AEVVSDAEGT GGKAVKLTKE AGKSSWVLEY
AAGNGAALLQ KGGQIRCRFK VSGALAANQY VMAFYWPVSS LPQGVALTGD GGNNLLAAFY
IQTDAKDLNV MYHNAKVATN NLKLGTFGAF DNEWHTLAFR FAGNNSLQVT PVIDGQDGTP
FTLTQSPVSA FAADKLHVTD ITRGATYPVL IDSIAVEVNS TDTAA