Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2835 |
Symbol | |
ID | 6972402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2639486 |
End bp | 2640421 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643386685 |
Product | hypothetical protein |
Protein accession | YP_002271156 |
Protein GI | 209400843 |
COG category | [S] Function unknown |
COG ID | [COG1376] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000168176 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.833391 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCGTC GTGTAAATAT TCTTTGCTCA TTTGCTCTGC TTTTTGCCAG CCATACTAGC CTGGCGGTAA CTTATCCATT ACCTCCAGAG GGTAGCCGTT TAGTGGGGCA GTCGCTTACT GTAACTGTTC CTGATCACAA TACCCAGCCG CTGGAGACTT TTGCCGCACA ATACGGGCAA GGGTTAAGTA ACATGCTGGA AGCGAATCCG GGCGCTGATG TCTTCTTACC GAAGTCAGGT TCGCAACTCA CCATTCCGCA GCAACTGATT TTGCCCGTCA CTGTTCGTAA AGGGATTGTG GTTAACGTCG CTGAGATGCG TCTTTATTAC TACCCACCAG ACAGTAATAC TGTGGAAGTC TTTCCTATTG GTATCGGCCA GGCTGGGCGA GAAACCCCGC GTAACTGGGT GACTACCGTT GAACGTAAAC AAGAAGCACC AACCTGGACG CCAACGCCGA ACACCCGGCG CGAATATGCG AAACGAGGGG AGAGTTTGCC CGCATTTGTT CCTGCGGGGC CCGATAATCC CATGGGGCTG TACGCGATTT ATATTGGCAG GCTATATGCC ATCCACGGTA CCAATGCCAA TTTTGGTATT GGGCTCCGGG TAAGTCAGGG CTGTATTCGT CTGCGCAATG ACGATATCAA ATATCTGTTT GATAATGTTC CTGTAGGTAC TCGTGTGCAA ATTATTGATC AGCCAGTGAA ATACACAACG GAACCGGATG GTTCAAAGTG GCTGGAAGTT CATGAGCCGC TGTCGCGCAA TCGTGCTGAA TATGAGTCTG ACCGAAAAGT GCCATTGCCG GTAACCCCAT CTTTGCGGGC GTTTATCAAC GGGCAAGAAG TTGATGTGAA TCGCGCAAAT GCTGCGTTGC AACGTCGATC GGGAATGCCT GTGCAAATTA GTTCTGGTTC AAGACAGATG TTTTAA
|
Protein sequence | MMRRVNILCS FALLFASHTS LAVTYPLPPE GSRLVGQSLT VTVPDHNTQP LETFAAQYGQ GLSNMLEANP GADVFLPKSG SQLTIPQQLI LPVTVRKGIV VNVAEMRLYY YPPDSNTVEV FPIGIGQAGR ETPRNWVTTV ERKQEAPTWT PTPNTRREYA KRGESLPAFV PAGPDNPMGL YAIYIGRLYA IHGTNANFGI GLRVSQGCIR LRNDDIKYLF DNVPVGTRVQ IIDQPVKYTT EPDGSKWLEV HEPLSRNRAE YESDRKVPLP VTPSLRAFIN GQEVDVNRAN AALQRRSGMP VQISSGSRQM F
|
| |