Gene ECH74115_4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4114 
Symbol 
ID6968524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3812645 
End bp3813874 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content44% 
IMG OID643387869 
Productserine transporter family protein 
Protein accessionYP_002272309 
Protein GI209398062 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00814] serine transporter 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.000104615 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTAATA TTTGGTCTAA AGAAGAAACT CTGTGGAGTT TCGCGCTCTA CGGCACAGCC 
GTTGGTGCAG GCACGCTCTT CCTTCCTATT CAGTTAGGTT CGGCAGGGGC TGTGGTCCTG
TTTATTACTG CTCTGGTCGC CTGGCCTTTA ACATACTGGC CACATAAAGC TTTATGCCAG
TTCATCCTCT CATCGAAAAC ATCAGCAGGT GAAGGGATAA CGGGCGCGGT AACACACTAC
TATGGCAAGA AGATTGGTAA TCTGATTACC ACGCTGTACT TCATCGCCTT TTTTGTCGTC
GTGTTGATAT ATGCAGTGGC AATTACCAAC TCACTTACGG AACAGCTGGC AAAGCATATG
GTTATTGATC TTCGCATCCG TATGTTGGTG AGTCTGGGTG TTGTATTAAT TCTGAATCTC
ATTTTTCTGA TGGGACGTCA TGCCACTATT CGGGTAATGG GATTTTTGGT ATTCCCATTG
ATTGCCTATT TCTTATTTCT TTCCATTTAC CTTGTAGGTA GTTGGCAACC TGATCTATTA
ACAACCCAGG TAGAGTTCAA TCAGAATACC CTTCACCAGA TATGGATATC GATTCCCGTG
ATGGTTTTCG CCTTTAGCCA TACGCCCATT ATTTCTACGT TTGCCATAGA CAGACGTGAA
AAATATGGCG AACACGCTAT GGATAAATGC AAAAAAATTA TGAAAGTCGC TTATCTCATC
ATCTGCATAA GTGTACTGTT CTTTGTCTTT AGCTGCCTGC TTTCTATTCC ACCTTCGTAT
ATTGAAGCGG CTAAAGAAGA AGGGGTCACC ATTTTATCGG CGCTTTCTAT GCTGCCGAAC
GCCCCAGCAT GGTTGTCAAT TTCCGGGATT ATTGTCGCAG TAGTTGCGAT GTCGAAATCA
TTCCTGGGTA CGTACTTTGG CGTTATTGAA GGTGCCACGG AGGTCGTCAA AACAACACTA
CAGCAGGTTG GTGTAAAGAA AAGTCGTGCA TTTAACCGCG CACTATCAAT TATGTTGGTA
TCGCTGATTA CCTTCATTGT TTGTTGCATT AACCCGAACG CGATTTCGAT GATTTATGCG
ATCAGCGGCC CGCTCATTGC CATGATACTT TTCATCATGC CTACGCTATC AACATATCTC
ATCCCGGCGC TTAAACCCTG GCGCTCCATC GGAAATCTGA TTACGCTGAT CGTGGGTATC
CTGTGCGTAT CGGTAATGTT CTTTAGCTAA
 
Protein sequence
MSNIWSKEET LWSFALYGTA VGAGTLFLPI QLGSAGAVVL FITALVAWPL TYWPHKALCQ 
FILSSKTSAG EGITGAVTHY YGKKIGNLIT TLYFIAFFVV VLIYAVAITN SLTEQLAKHM
VIDLRIRMLV SLGVVLILNL IFLMGRHATI RVMGFLVFPL IAYFLFLSIY LVGSWQPDLL
TTQVEFNQNT LHQIWISIPV MVFAFSHTPI ISTFAIDRRE KYGEHAMDKC KKIMKVAYLI
ICISVLFFVF SCLLSIPPSY IEAAKEEGVT ILSALSMLPN APAWLSISGI IVAVVAMSKS
FLGTYFGVIE GATEVVKTTL QQVGVKKSRA FNRALSIMLV SLITFIVCCI NPNAISMIYA
ISGPLIAMIL FIMPTLSTYL IPALKPWRSI GNLITLIVGI LCVSVMFFS