Gene ECH74115_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3641 
Symbol 
ID6968024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3355442 
End bp3356440 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content52% 
IMG OID643387435 
Productsodium:bile acid symporter family protein 
Protein accessionYP_002271888 
Protein GI209397693 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000261961 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTT TTCGTATCCT CGATCCTTTC ACCTTAACCC TGATCACGGT GGTGTTGCTG 
GCCTCTTTCT TTCCTGCCAG AGGCGATTTC GTCCCCTTCT TTGAAAATCT GACCACCGCA
GCTATTGCCC TGCTGTTCTT TATGCACGGC GCGAAGTTGT CGCGTGAGGC GATTATTGCT
GGCGGTGGTC ACTGGCGACT GCATTTGTGG GTAATGTGCA GCACCTTCGT GCTGTTTCCG
ATTCTGGGTG TACTGTTTGC CTGGTGGAAA CCGGTAAATG TCGACCCGAT GCTCTACTCC
GGTTTTCTCT ACTTGTGCAT TCTCCCGGCT ACCGTGCAGT CTGCAATCGC CTTCACGTCA
ATGGCGGGCG GTAACGTCGC GGCGGCGGTT TGTTCTGCGT CAGCATCCAG TCTGCTGGGG
ATTTTCCTTT CACCATTGCT GGTTGGTCTG GTGATGAATG TTCACGGTGC AGGGGGCAGC
CTTGAGCAGG TCGGTAAAAT TATGCTGCAA CTGCTGCTGC CGTTTGTGTT GGGGCATCTT
TCCCGGCCGT GGATTGGTGA CTGGGTGTCG CGCAATAAAA AATGGATTGC GAAAACTGAC
CAGACGTCCA TTCTGTTGGT GGTTTATACA GCGTTCAGCG AAGCCGTCGT TAATGGTATC
TGGCATAAAG TTGGCTGGGG ATCATTGCTG TTTATCGTGG TGGTCAGCTG CGTTCTTCTG
GCTATCGTGA TTGTAGTTAA CGTCTTTATG GCACGCCGAC TGGGCTTCAA TAAGGCAGAT
GAAATTACTA TCGTCTTTTG TGGTTCGAAA AAGAGTCTGG CAAATGGCAT CCCGATGGCA
AACATTCTGT TCCCCACATC GGTGATCGGT ATGATGGTGC TGCCCCTGAT GATTTTCCAT
CAGATCCAAT TGATGGTCTG TGCGGTGCTG GCGCGTCGAT ACAAACGCCA GACCGAACAG
TTACAGGCGC AGCAGGAAAG CAGCGCCGAT AAAGCTTAA
 
Protein sequence
MKLFRILDPF TLTLITVVLL ASFFPARGDF VPFFENLTTA AIALLFFMHG AKLSREAIIA 
GGGHWRLHLW VMCSTFVLFP ILGVLFAWWK PVNVDPMLYS GFLYLCILPA TVQSAIAFTS
MAGGNVAAAV CSASASSLLG IFLSPLLVGL VMNVHGAGGS LEQVGKIMLQ LLLPFVLGHL
SRPWIGDWVS RNKKWIAKTD QTSILLVVYT AFSEAVVNGI WHKVGWGSLL FIVVVSCVLL
AIVIVVNVFM ARRLGFNKAD EITIVFCGSK KSLANGIPMA NILFPTSVIG MMVLPLMIFH
QIQLMVCAVL ARRYKRQTEQ LQAQQESSAD KA