Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3641 |
Symbol | |
ID | 6968024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3355442 |
End bp | 3356440 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643387435 |
Product | sodium:bile acid symporter family protein |
Protein accession | YP_002271888 |
Protein GI | 209397693 |
COG category | [R] General function prediction only |
COG ID | [COG0385] Predicted Na+-dependent transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000261961 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTT TTCGTATCCT CGATCCTTTC ACCTTAACCC TGATCACGGT GGTGTTGCTG GCCTCTTTCT TTCCTGCCAG AGGCGATTTC GTCCCCTTCT TTGAAAATCT GACCACCGCA GCTATTGCCC TGCTGTTCTT TATGCACGGC GCGAAGTTGT CGCGTGAGGC GATTATTGCT GGCGGTGGTC ACTGGCGACT GCATTTGTGG GTAATGTGCA GCACCTTCGT GCTGTTTCCG ATTCTGGGTG TACTGTTTGC CTGGTGGAAA CCGGTAAATG TCGACCCGAT GCTCTACTCC GGTTTTCTCT ACTTGTGCAT TCTCCCGGCT ACCGTGCAGT CTGCAATCGC CTTCACGTCA ATGGCGGGCG GTAACGTCGC GGCGGCGGTT TGTTCTGCGT CAGCATCCAG TCTGCTGGGG ATTTTCCTTT CACCATTGCT GGTTGGTCTG GTGATGAATG TTCACGGTGC AGGGGGCAGC CTTGAGCAGG TCGGTAAAAT TATGCTGCAA CTGCTGCTGC CGTTTGTGTT GGGGCATCTT TCCCGGCCGT GGATTGGTGA CTGGGTGTCG CGCAATAAAA AATGGATTGC GAAAACTGAC CAGACGTCCA TTCTGTTGGT GGTTTATACA GCGTTCAGCG AAGCCGTCGT TAATGGTATC TGGCATAAAG TTGGCTGGGG ATCATTGCTG TTTATCGTGG TGGTCAGCTG CGTTCTTCTG GCTATCGTGA TTGTAGTTAA CGTCTTTATG GCACGCCGAC TGGGCTTCAA TAAGGCAGAT GAAATTACTA TCGTCTTTTG TGGTTCGAAA AAGAGTCTGG CAAATGGCAT CCCGATGGCA AACATTCTGT TCCCCACATC GGTGATCGGT ATGATGGTGC TGCCCCTGAT GATTTTCCAT CAGATCCAAT TGATGGTCTG TGCGGTGCTG GCGCGTCGAT ACAAACGCCA GACCGAACAG TTACAGGCGC AGCAGGAAAG CAGCGCCGAT AAAGCTTAA
|
Protein sequence | MKLFRILDPF TLTLITVVLL ASFFPARGDF VPFFENLTTA AIALLFFMHG AKLSREAIIA GGGHWRLHLW VMCSTFVLFP ILGVLFAWWK PVNVDPMLYS GFLYLCILPA TVQSAIAFTS MAGGNVAAAV CSASASSLLG IFLSPLLVGL VMNVHGAGGS LEQVGKIMLQ LLLPFVLGHL SRPWIGDWVS RNKKWIAKTD QTSILLVVYT AFSEAVVNGI WHKVGWGSLL FIVVVSCVLL AIVIVVNVFM ARRLGFNKAD EITIVFCGSK KSLANGIPMA NILFPTSVIG MMVLPLMIFH QIQLMVCAVL ARRYKRQTEQ LQAQQESSAD KA
|
| |