Gene ECH74115_4809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4809 
Symbol 
ID6971264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4444161 
End bp4446479 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content58% 
IMG OID643388501 
Producthypothetical protein 
Protein accessionYP_002272929 
Protein GI209396647 
COG category[R] General function prediction only 
COG ID[COG4258] Predicted exporter 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG CCAACGTTTT GCCGCCCAGT AAACGCCCCG CGCTGTTATG GGGGCTAGTC 
TGCCTGGTCA TGGCGGTGGC GTTGCTGATC CTGCTGCCGC AATCACGGCT GAACAGTAGC
GTGCTGGCTA TGTTACCCAA ACAGACGATG GGCGATATTC CTCCGGCGCT GAATGACGGC
TTTATGCAGC GTCTTGACCG CCAACTGGTG TGGCTGGTCA GCCCCGGTAA AGAGGCTAAT
CCTTTGGTCG CTCAGGAGTG GCTGACGCTG CTGCAAAAAT CCGCTGCGCT CGGCGACGTT
AAAGGGCCAA TGGATGCCGC CAGCCAGCAG GCGTGGGGAG CGTTTTTCTG GCAGCATCGC
AACGGCCTGA TTGACCCCAA CACCCGCGCC CGCCTGCAAA ACGGCGGCGA AGCGCAGGCA
CAGTGGATCC TCTCCCAGCT TTATTCCGCA TTCTCCGGCG TAAGCGGCAA GGAGCTGCAA
AACGATCCGC TGATGTTAAT GCGCGGCTCG CAGCTGGCAA TGGCGAAAAA CGGCCAGCGT
TTGCGGCTGA TGGACGGTTG GCTGGTGACG CAGGATCCCC AGGGCAACTA CTGGTATCTG
CTGCACGGCG AACTGGCGGG ATCGTCGTTT GATATGCAGC AAACCCACCA GCTGATCACG
ACCCTGAATA CGCTGGAAAA GGATCTGAAA ACGCGTTACC CGCAGGCACA GTTGCTCTCG
CGCGGCACGG TGTTTTACAG CGATTACGCC AGCCAACAGG CGAAGCAGGA TATCTCCACC
CTGGGCGTGG CTACGCTGCT GGGGGTGATA TTGCTGATTG TGGCGGTGTT CCGCTCTTTA
CGCCCGTTGC TGCTTTGCGT GATTTCCATC GGCATCGGCG CGCTGGCGGG AACGGTCGCC
ACTTTATTGA TTTTCGGTGA ATTACACCTG ATGACGCTGG TGATGAGCAT GAGCGTTATC
GGCATTTCCG CTGACTACAC GCTCTATTAT CTCACCGAGC GGATGGTTCA CGGCAACGAC
GTTTCGCCGT GGCAAAGCCT GGCGAAAGTA CGCAATGCCT TGCTACTGGC GCTGCTCACC
ACCGTGGCGG CGTATCTGAT TATGATGCTC GCCCCCTTCC CCGGCATTCG CCAGATGGCG
ATTTTTGCCG CCGTCGGGTT GAGCGCCTCC TGTCTGACCG TCCTGTTCTG GCATCCGTGG
CTGTGCCGTG GCCTGCCGGT GCGTCCGGTT CCGGCGATGG CGCTGATGCT ACGCTGGCTG
GCAGCGTGGC GGCGTAATAA AAAACTGTCG CTGGGTCTGC CCGTCGCGCT GGCGCTGTTT
TCGCTGGCGG GGATGTCAAT GCTACGCGTC GATGACGATA TCTCGCAGTT ACAGGCGCTA
CCGCAGCATA TTCTGGCGCA GGAAAAAGCC ATTACCGCCC TGACCGGGCA GAGCGTCGAT
CAAAAATGGT TTGTGGTTTA CGGCGATTCG CCACAGCAAA CATTGCGGCG ACTGGAGAAA
TATACCGCCT CACTTGAGTA TGCGAAAAAA GAGGGGCTTA TCAGCAACTA CCGCACCATT
CCGCTGAACT CCCTTGCGCG GCAGGAGGAA GATTTACAAC TGCTGAAAAC GGCGGCCCCG
ACAGTAACAA AAGCGCTGCA AAATGCCGGG CTGACGGCAG TGAACCCGGA TCTCAACGCC
ATGCCAGTGA ACGTTGATGA ATGGCTGGCA AGCCCCGCCA GTGAAGGCTG GCGTCTGCTG
TGGCTGACGC TGGAAAACGG CGAAAGCGGC GTACTGGTGC CGGTTGAAGG GGTTAAAAGT
AGCGCGTTGA TGCAGGAAAT CGCCACATAT TACCCTTGCG GCATTGCCTG GGTTGATCGC
AAAAGCACCT TTGATGAATT GTTCGCACTT TACCGCTACG TCTTAACCGG CTTGTTGCTG
GTGGCGCTGG CAGTGATTGC CTGCGGCGCA GTGGCCCGTC TCGGCTGGCG CAAAGGGCTT
ATCAGCCTGG TGCCTTCGGT GCTTTCGCTG GGCTGTGGTC TGGCGGTGCT GGCGATGAGC
GGGCAGGCGG TGAATCTCTT TTCGCTGCTG GCGCTGGTGC TGGTGCTTGG CATCGGTATC
AACTACACGC TGTTTTTCAG TAATCCGCGC GGTACACCGT TAACTTCGCT ACTGGCGATC
GCGCTGGCAA TGCTCACCAC CTTGCTGACG CTGGGTATGC TGGTATTCAG CGCCACCCAG
GCCATCAGCA GTTTTGGCAT TGTGCTGGTG AGCGGTATTT TCACCGCCTT CCTGCTTTCG
CCGCTGGCTA TGCCCGATAA AAAGAGAACA AAAAAATGA
 
Protein sequence
MTNANVLPPS KRPALLWGLV CLVMAVALLI LLPQSRLNSS VLAMLPKQTM GDIPPALNDG 
FMQRLDRQLV WLVSPGKEAN PLVAQEWLTL LQKSAALGDV KGPMDAASQQ AWGAFFWQHR
NGLIDPNTRA RLQNGGEAQA QWILSQLYSA FSGVSGKELQ NDPLMLMRGS QLAMAKNGQR
LRLMDGWLVT QDPQGNYWYL LHGELAGSSF DMQQTHQLIT TLNTLEKDLK TRYPQAQLLS
RGTVFYSDYA SQQAKQDIST LGVATLLGVI LLIVAVFRSL RPLLLCVISI GIGALAGTVA
TLLIFGELHL MTLVMSMSVI GISADYTLYY LTERMVHGND VSPWQSLAKV RNALLLALLT
TVAAYLIMML APFPGIRQMA IFAAVGLSAS CLTVLFWHPW LCRGLPVRPV PAMALMLRWL
AAWRRNKKLS LGLPVALALF SLAGMSMLRV DDDISQLQAL PQHILAQEKA ITALTGQSVD
QKWFVVYGDS PQQTLRRLEK YTASLEYAKK EGLISNYRTI PLNSLARQEE DLQLLKTAAP
TVTKALQNAG LTAVNPDLNA MPVNVDEWLA SPASEGWRLL WLTLENGESG VLVPVEGVKS
SALMQEIATY YPCGIAWVDR KSTFDELFAL YRYVLTGLLL VALAVIACGA VARLGWRKGL
ISLVPSVLSL GCGLAVLAMS GQAVNLFSLL ALVLVLGIGI NYTLFFSNPR GTPLTSLLAI
ALAMLTTLLT LGMLVFSATQ AISSFGIVLV SGIFTAFLLS PLAMPDKKRT KK