Gene ECH74115_3317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3317 
Symbol 
ID6967116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3051565 
End bp3052590 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content53% 
IMG OID643387129 
ProductABC transporter, permease protein 
Protein accessionYP_002271593 
Protein GI209398171 
COG category[R] General function prediction only 
COG ID[COG4239] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.655724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.274397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGAC TCAACCCCGT CAATCAGGCC CGTTGGGCGC GTTTTCGCCA TAACCGTCGC 
GGCTACTGGT CGTTATGGAT TTTCCTCGTT TTGTTTGGTT TGAGTTTGTG TTCTGAACTT
ATCGCCAACG ATAAACCGTT GCTGGTGCGT TATGACGGCA GTTGGTATTT CCCGTTGTTG
AAAAACTACA GCGAAAGCGA TTTTGGCGGC CCGCTGGCAA GTCAGGCTGA TTATCAGGAC
CCGTGGCTGA AACAACGGCT GGAAAATAAT GGCTGGGTAC TGTGGGCACC GATTCGCTTT
GGTGCTACCA GTATCAACTT TGCTACCGAT AAGCCCTTCC CTTCTCCCCC CTCCCGGCAA
AACTGGCTGG GAACGGATGC CAACGGCGGC GATGTGCTGG CACGTATTCT CTATGGCACG
CGGATCTCGG TTCTGTTTGG CCTGATGCTG ACTCTCTGTT CCAGCGTGAT GGGCGTGCTG
GCGGGGGCGC TACAAGGCTA TTACGGCGGT AAGGTCGATC TCTGGGGGCA GCGCTTTATT
GAAGTATGGT CGGGGATGCC AACGCTGTTT TTGATTATTC TGCTTTCCAG CGTTGTGCAA
CCTAACTTCT GGTGGCTGCT GGCAATTACT GTCTTATTTG GCTGGATGAG TCTGGTCGGC
GTGGTGCGGG CGGAGTTTTT ACGTACCCGC AATTTCGACT ACATCCGCGC GGCACAGGCG
CTTGGCGTCA GCGATCGCAG TATTATTCTG CGTCATATGT TGCCGAATGC AATGGTCGCT
ACCCTCACCT TTTTACCGTT TATTTTATGT AGTTCGATCA CCACCCTGAC CTCACTCGAT
TTCCTCGGCT TCGGTCTGCC GCTCGGTTCA CCGTCACTCG GTGAACTGCT GTTACAAGGG
AAAAATAACC TTCAGGCTCC GTGGCTTGGG ATCACCGCCT TCTTGTCGGT GGCGATATTA
TTGTCTTTGC TGATCTTTAT TGGTGAAGCC GTCCGCGACG CATTTGATCC TAATAAGGCG
GTGTAG
 
Protein sequence
MSRLNPVNQA RWARFRHNRR GYWSLWIFLV LFGLSLCSEL IANDKPLLVR YDGSWYFPLL 
KNYSESDFGG PLASQADYQD PWLKQRLENN GWVLWAPIRF GATSINFATD KPFPSPPSRQ
NWLGTDANGG DVLARILYGT RISVLFGLML TLCSSVMGVL AGALQGYYGG KVDLWGQRFI
EVWSGMPTLF LIILLSSVVQ PNFWWLLAIT VLFGWMSLVG VVRAEFLRTR NFDYIRAAQA
LGVSDRSIIL RHMLPNAMVA TLTFLPFILC SSITTLTSLD FLGFGLPLGS PSLGELLLQG
KNNLQAPWLG ITAFLSVAIL LSLLIFIGEA VRDAFDPNKA V