Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3531 |
Symbol | |
ID | 6971709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3268583 |
End bp | 3270520 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387332 |
Product | hypothetical protein |
Protein accession | YP_002271795 |
Protein GI | 209398813 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0000115281 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCATTTA AACACTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCGAAACGA ATAACTCAAA AACTGAAGGA AGGGTGGCAG CCTTATGGTA GTGCGCTGAT TTCGACAGCT GGTTATGGTG CGGAGTTCAT CCAGCCAGTT GTGAGTGAGG GGAGCATCTC ATCACCAGAG GAGCCAGGCA ACCGTCCGAC GACCTCAGCG CCTTCTGTTG CGCCAGAATA TTACTATGTG ATCGCGCTTG CTGGTCAGTC CAATGGTATG TCATACGGTG AGGGACTGCC ATTGCCGGAT ACATTCGACA GCCCTGATCC ACGTATTAAA CAGTTAGCGC GTCGCAGTAC GGTGACACCG GGCGGTGCAG TATGCAAATA TAACGACATC ATTCCGGCGG ACCATTGTCT GCATGATGTG CAGGACATGA GCCGTCTTAA CCATCCGAAA GCGGACCTGT CAAAGGGGCA GTACGGAACC GTGGGGCAGG GGCTGCATAT CGCCAAAAAA CTGCTGCCGT TTATACCGGC GAATGCGGGC ATTCTGCTGG TTCCGTGCTG TCGTGGTGGT TCAGCGTTCA CCACCGGAGC CGATGGCACA TACAGTGACG CGAGTGGTGC CTCGGAGAAT TCAACCCGCT GGGGTGTGGA CAAGCCGCTG TATAAGGACC TTATCGGTCG AACAAAAGCA GCACTGAAGA AGAATCCGAA AAATGTGCTG TTTGCCGTGG TGTGGATGCA GGGGGAATTT GATTTTGGCG GTACGCCGGT AAATCACGCC GCACAGTTTG GTGCGCTGGT TGATAAATTC CGTGCAGACC TGGCGGATAT GGCAGGCCAG TGCGTCGGTG GCTCTGCTGG CGGTGTTCCC TGGATATGCG GGGACACGAC GTATTTCTGG AAGCAGAAGA ACGAATCCAC GTACCAGACG GTGTATGGCA GCTATAAAAA CAAAACGGAA AAGAATATCC ATTTCGTACC GTTCATGACG GATGAGAACG GGGTGAATGT GCCGACGAAC AAACCGGAAG AAGACCCGGA CATTCCGGGT ATCGGATATT ACGGTTCGAA ATGGCGTGAC AGCTCAGCCA CCTGGACGTC ACAGGACAGG GCGAGCCATT TCAGTTCATG GGCTCGCCGC GGGATTATTT CCGACCGTCT GGCAACGGCG ATTTTGCGCC ATGCGGGAAG AGTGGCGCTA AACGCGGGGG CATCATCGAC AGTATCAGAG GTGCGCCCGT CATCGCCTTC CGGTGCAGAA GCCACAGGCG TCACAACACT GCTCTCTTAC CTTGCCAGCG AGTCAGAGGG AAGCCTGAAA GTACAGGGAT GGTCAGCCAG TGGCGGCAGG GCAGAAGTGG TCAGCGATGC GGAGGGAACC GGAGGTAAGG CAGTGAAGCT GACCAAGGAA GCCGGTAAAA GCAGCTGGGT GCTGGAGTAC GCCGCGGGCA ACGGTGCGGC TCTGTTACAG AAAGGGGGGC AGATTCGCTG CCGCTTTAAG GTTTCGGGAG CGCTGGCTGC GAACCAGTAT GTTATGGCGT TTTACTGGCC GGTATCTTCA CTGCCACAGG GCGTTGCCCT GACCGGAGAC GGGGGGAATA ACCTGCTGGC AGCGTTCTAC ATCCAGACAG ATGCAAAAGA CCTGAATGTG ATGTACCACA ATGCGAAAGT GGCGACAAAC AACCTGAAAC TGGGAACCTT TGGCGCATTT GATAACGAAT GGCATACGCT GGCTTTCCGC TTTGCCGGGA ATAACAGCCT GCAGGTGACG CCGGTTATTG ATGGTCAGGA TGGCACACCG TTCACGCTGA CGCAGTCACC GGTCAGTGCC TTTGCGGCGG ATAAACTGCA TGTGACAGAC ATTACCAGAG GTGCGACTTA CCCGGTACTG ATAGACAGCA TTGCGGTGGA AGTGAACAGC ACAGACACTG CGGCATGA
|
Protein sequence | MAFKHYDVVR AASPSDLAKR ITQKLKEGWQ PYGSALISTA GYGAEFIQPV VSEGSISSPE EPGNRPTTSA PSVAPEYYYV IALAGQSNGM SYGEGLPLPD TFDSPDPRIK QLARRSTVTP GGAVCKYNDI IPADHCLHDV QDMSRLNHPK ADLSKGQYGT VGQGLHIAKK LLPFIPANAG ILLVPCCRGG SAFTTGADGT YSDASGASEN STRWGVDKPL YKDLIGRTKA ALKKNPKNVL FAVVWMQGEF DFGGTPVNHA AQFGALVDKF RADLADMAGQ CVGGSAGGVP WICGDTTYFW KQKNESTYQT VYGSYKNKTE KNIHFVPFMT DENGVNVPTN KPEEDPDIPG IGYYGSKWRD SSATWTSQDR ASHFSSWARR GIISDRLATA ILRHAGRVAL NAGASSTVSE VRPSSPSGAE ATGVTTLLSY LASESEGSLK VQGWSASGGR AEVVSDAEGT GGKAVKLTKE AGKSSWVLEY AAGNGAALLQ KGGQIRCRFK VSGALAANQY VMAFYWPVSS LPQGVALTGD GGNNLLAAFY IQTDAKDLNV MYHNAKVATN NLKLGTFGAF DNEWHTLAFR FAGNNSLQVT PVIDGQDGTP FTLTQSPVSA FAADKLHVTD ITRGATYPVL IDSIAVEVNS TDTAA
|
| |