Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5145 |
Symbol | |
ID | 6972151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4787076 |
End bp | 4788413 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388816 |
Product | inorganic anion transporter, sulfate permease (SulP) family |
Protein accession | YP_002273242 |
Protein GI | 209399820 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.164927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAAC AACACACAAC CCAGGCTTCT GGCCAGGGGA TGCTGGAACG CGTGTTTAAA CTGCGCGAAC ATGGCACGAC GGCACGGACC GAAGTGATCG CCGGTTTTAC CACCTTCCTG ACGATGGTTT ACATCGTTTT TGTTAACCCG CAAATTCTTG GCGTTGCTGG CATGGATACC AGCGCCGTCT TCGTCACTAC CTGTCTGATT GCTGCATTCG GCAGTATTAT GATGGGGCTG TTTGCTAACC TGCCAGTTGC ACTGGCACCC GCTATGGGCC TGAATGCGTT CTTCGCTTTT GTCGTTGTGC AGGCGATGGG CTTGCCGTGG CAAGTCGGGA TGGGCGCAAT CTTCTGGGGC GCGATTGGTC TGCTGTTACT GACGATTTTC CGCGTTCGCT ACTGGATGAT TGCCAACATT CCGGTGAGTC TGCGTGTGGG CATTACCAGC GGTATCGGTC TGTTCATTGG CATGATGGGG CTGAAAAACG CAGGTGTGAT TGTCGCTAAC CCGGAAACGC TGGTCAGCAT CGGTAATCTG ACTTCTCACA GCGTGCTGCT GGGGATCCTC GGCTTCTTCA TCATTGCTAT TCTGGCCTCA CGCAACATTC ACGCAGCGGT GCTGGTTTCT ATCGTGGTGA CAACGCTGCT GGGCTGGATG CTGGGTGATG TCCACTACAA TGGCATCGTT TCTGCGCCGC CGAGCGTAAT GACTGTCGTG GGTCATGTAG ATTTAGCCGG GTCGTTTAAC CTCGGGCTGG CAGGGGTGAT TTTCTCTTTC ATGCTGGTCA ACCTGTTTGA CTCCTCCGGC ACGTTGATTG GCGTGACCGA TAAAGCCGGT CTGGCAGATG AGAAGGGGAA ATTCCCGCGC ATGAAGCAGG CGCTGTATGT CGACAGTATC TCTTCCGTGA CCGGTTCGTT TATCGGTACT TCTTCCGTTA CGGCTTATAT TGAGTCCTCT TCCGGCGTAT CGGTTGGCGG TCGTACCGGT CTGACGGCAG TGGTTGTTGG TCTGCTGTTC CTGCTGGTTA TCTTTCTGTC GCCGCTGGCG GGAATGGTGC CAGGCTACGC TGCAGCTGGC GCGCTGATTT ACGTTGGCGT GTTGATGACC TCAAGTCTTG CTCGCGTGAA CTGGCAGGAT CTTACTGAAT CTGTTCCGGC GTTTATTACC GCTGTGATGA TGCCGTTCAG CTTCTCGATT ACCGAAGGTA TTGCGCTGGG CTTTATCTCC TACTGCGTGA TGAAGATTGG TACCGGGCGT ATTCGTGACC TTAGCCCGTG CGTAATCATC GTTGCGCTGC TGTTTATCCT GAAGATTGTA TTTATCGACG CTCACTAA
|
Protein sequence | MSQQHTTQAS GQGMLERVFK LREHGTTART EVIAGFTTFL TMVYIVFVNP QILGVAGMDT SAVFVTTCLI AAFGSIMMGL FANLPVALAP AMGLNAFFAF VVVQAMGLPW QVGMGAIFWG AIGLLLLTIF RVRYWMIANI PVSLRVGITS GIGLFIGMMG LKNAGVIVAN PETLVSIGNL TSHSVLLGIL GFFIIAILAS RNIHAAVLVS IVVTTLLGWM LGDVHYNGIV SAPPSVMTVV GHVDLAGSFN LGLAGVIFSF MLVNLFDSSG TLIGVTDKAG LADEKGKFPR MKQALYVDSI SSVTGSFIGT SSVTAYIESS SGVSVGGRTG LTAVVVGLLF LLVIFLSPLA GMVPGYAAAG ALIYVGVLMT SSLARVNWQD LTESVPAFIT AVMMPFSFSI TEGIALGFIS YCVMKIGTGR IRDLSPCVII VALLFILKIV FIDAH
|
| |