Gene ECH74115_5229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5229 
Symbol 
ID6969644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4877389 
End bp4878774 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content54% 
IMG OID643388894 
Productputative transport protein YifK 
Protein accessionYP_002273314 
Protein GI209398945 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000882979 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATA ACAAACCAGA GCTACAGCGT GGGCTGGAAG CTCGACATAT CGAACTCATC 
GCCCTGGGGG GCACCATTGG CGTCGGCCTG TTTATGGGGG CCGCCAGTAC CCTGAAATGG
GCCGGGCCAT CCGTATTGTT GGCCTATATC ATCGCCGGGC TGTTCGTCTT TTTCATCATG
CGTTCAATGG GCGAAATGTT GTTCCTCGAA CCGGTTACCG GTTCGTTCGC CGTTTATGCG
CATCGTTATA TGAGCCCGTT CTTTGGCTAT CTCACCGCCT GGTCTTACTG GTTTATGTGG
ATGGCGGTGG GGATCTCTGA AATCACCGCC ATTGGCGTTT ATGTCCAGTT CTGGTTCCCG
GAGATGGCGC AGTGGATACC CGCATTGATC GCAGTGGCGC TGGTGGCGTT GGCGAATCTG
GCAGCGGTGC GGTTGTACGG CGAAATCGAG TTCTGGTTCG CGATGATCAA AGTCACCACG
ATTATCGTGA TGATTGTCAT TGGCCTGGGC GTGATTTTCT TTGGCTTTGG CAATGGCGGG
CAGTCGATTG GTTTTAGCAA TCTCACAGAG CATGGCGGTT TCTTTGCGGG TGGCTGGAAA
GGGTTCCTGA CCGCGCTGTG TATTGTGGTG GCGTCCTACC AGGGCGTGGA GCTGATTGGC
ATTACTGCCG GTGAAGCGAA GAATCCGCAG GTGACGCTGC GCAGTGCCGT AGGCAAGGTG
CTGTGGCGGA TCCTGATTTT CTACGTTGGC GCGATTTTCG TTATCGTCAC CATCTTCCCG
TGGAATGAAA TAGGCAGCAA CGGCAGCCCG TTCGTACTGA CTTTTGCCAA AATCGGTATT
ACCGCAGCGG CGGGCATTAT CAACTTTGTG GTGCTGACGG CTGCGCTCTC TGGCTGTAAC
AGCGGCATGT ACAGTTGCGG ACGTATGCTC TACGCACTGG CGAAAAACCG TCAGTTACCG
GCGGCGATGG CGAAAGTTTC CCGTCACGGC GTACCGGTTG CGGGTGTGGC AGTATCTATT
GCTATTCTGC TAATTGGCTC ATGCCTGAAC TACATCATTC CCAATCCGCA GCGTGTGTTT
GTCTACGTCT ACAGTGCCAG CGTGCTTCCG GGGATGGTGC CATGGTTTGT GATATTGATA
AGCCAGCTGC GTTTTCGGCG TGCACATAAA GCGGCGATTG CCAGCCATCC GTTCCGCTCA
ATCCTGTTCC CGTGGGCCAA CTACGTAACA ATGGCATTCC TGATTTGCGT TTTGATCGGC
ATGTACTTTA ATGAAGATAC GCGTATGTCG CTGTTTGTTG GCATCATCTT TATGCTGGCG
GTGACGGCGA TTTATAAAGT TTTTGGCCTT AATCGCCACG GGAAAGCGCA TAAACTGGAG
GAATAA
 
Protein sequence
MADNKPELQR GLEARHIELI ALGGTIGVGL FMGAASTLKW AGPSVLLAYI IAGLFVFFIM 
RSMGEMLFLE PVTGSFAVYA HRYMSPFFGY LTAWSYWFMW MAVGISEITA IGVYVQFWFP
EMAQWIPALI AVALVALANL AAVRLYGEIE FWFAMIKVTT IIVMIVIGLG VIFFGFGNGG
QSIGFSNLTE HGGFFAGGWK GFLTALCIVV ASYQGVELIG ITAGEAKNPQ VTLRSAVGKV
LWRILIFYVG AIFVIVTIFP WNEIGSNGSP FVLTFAKIGI TAAAGIINFV VLTAALSGCN
SGMYSCGRML YALAKNRQLP AAMAKVSRHG VPVAGVAVSI AILLIGSCLN YIIPNPQRVF
VYVYSASVLP GMVPWFVILI SQLRFRRAHK AAIASHPFRS ILFPWANYVT MAFLICVLIG
MYFNEDTRMS LFVGIIFMLA VTAIYKVFGL NRHGKAHKLE E