Gene ECH74115_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1220 
SymbolymcA 
ID6969340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1230013 
End bp1232109 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content52% 
IMG OID643385215 
Productgroup 4 capsule (G4C) polysaccharide, lipoprotein YmcA 
Protein accessionYP_002269710 
Protein GI209395798 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.221159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA ATTCTTATCT TTTAAGCTGC CTGGCCATTG CCGTCTCCAG TGCCTGTCAT 
GCTGAAGTAT TAACCTACCC GGATCCGCTG GGTTCGTCGC AATCGGACTT TGGCGGCACA
GGATTGTTGC AGATGCCAAA TGCGCGCATC GCACCGGAAG GTGAATTCAG CGTCAACTAC
CGGGATAACG ATCAATACCG TTTCTACTCC ACCTCCGTGG CGCTGTTTCC ATGGCTGGAA
GGCACCATTC GTTATACGGA TGTGCGCACA CGTAAATATA GCCAGTGGGA AGATTTCAGC
GGCGATCAGT CATACAAAGA CAAATCATTC GATTTTAAAC TTCGCCTGTG GGAAGAAGGT
TACTGGCTAC CGCAAGTGGC GTTTGGTAAA CGTGATATTG CTGGTACGGG TCTGTTTGAC
GGTGAGTATC TGGTGGCCAG CAAGCAAGCG GGGCCATTTG ATTTCACCCT CGGGATGGCA
TGGGGCTACG CCGGTAATGC GGGCAATATT ACCAACCCGT TTTGCCGGGT GAGCGATAAA
TATTGTCATC GCGCAGAGTC TCACGATGCG GGCGATATCA GCTTTAGCGA TATCTTTCGT
GGCCCGGCTT CCATCTTTGG CGGCATTGAG TATCAAACGC CGTGGAATCC CCTGCGTCTG
AAACTCGAAT ACGACGGCAA CAATTACCAG AATGATTTCG CTGGCAAACT GCCTCAGGCA
AGCCATTTCA ACGTCGGTGC AGTTTATCGC GCTGCCAGCT GGGCAGACCT CAACCTGAGT
TATGAACGCG GTAACACGTT GATGTTTGGT TTCACTTTAC GGACCAATTT CAACGATCTG
CGCCCTGCCC TGCGCGATAC GCCAAAACCG GCATATCAAC CTGCGCCTGA ATCTGAAGGA
TTGCAGTACA CCACGGTAGC AAACCAACTT ACCGCCCTGA AGTATAACGC GGGCTTTGAC
GCGCCAGAAA TTCAGCTACG CGATAAGACG CTGTATATGT CTGGTCAGCA ATACAAATAC
CGTGACTCTC GTGAAGCGGT CGATCGTGCC AACCGGATTC TGGTGAATAA CCTGCCGCAA
GGCGTTGAGA AGATTAGCGT GACGCAAAAG CGCGAGCATA TGGCGATGGT GACTACCGAA
ACCGACGTAG CCAGCCTGCG CAAACAGCTG GCAGGTACAG CGCCTGGTCA ATCAGAGCCA
CTGCAACAAC AACGTGTTGA AGCTGAAGAT CTTTCTGCCT TTGGTCGGGG CTACCGTATT
CGTGAAGATC GCTTTAGCTA CTCTTTCAAC CCAACACTTT CACAGTCGCT GGGCGGCCCG
GAAGATTTCT ATATGTTCCA GCTGGGGCTG ATGTCCAGCG CCCGCTACTG GTTTACCGAC
CACCTGCTGC TTGATGGCGG TATTTTCACC AATATTTACA ACAACTACGA CAAGTTTAAG
TCTTCGCTGT TGCCCGCGGA CTCTACCCTG CCCCGCGTGC GCACGCATAT CCGTGATTAC
GTTCGCAATG ACGTTTATCT CAACAACTTG CAAGCGAATT ACTTTGCCGA CTTAGGCAAT
GGTTTCTATG GCCAGGTGTA TGGCGGTTAT CTGGAAACGA TGTATGCCGG TGTCGGTTCC
GAGCTGCTTT ATCGCCCGCT AGATGCCAGC TGGGCGCTGG GTGTGGACGT TAACTACGTG
AAGCAGCGTG ACTGGGACAA CATGATGCGC TTCACCGATT ATTCCACGCC AACTGGTTTC
GTGACGGCTT ACTGGAACCC GCCGACGCTC AATGGCGTAC TGATGAAACT TAGCGTTGGG
CAATATCTGG CAAAAGATAA AGGGGCAACG ATCGACGTCG CCAAACGCTT TGACAGCGGC
GTGGCGGTAG GGGTATGGGC GGCAATCAGT AACGTATCTA AAGATGACTA CGGCGAAGGC
GGCTTTAGTA AAGGTTTTTA TATCTCGATT CCGTTCGACT TGATGACCAT TGGACCTAAC
CGCAACCGCG CGGTGGTTTC GTGGACACCA TTGACGCGTG ATGGTGGACA AATGCTGTCA
CGCAAATACC AGCTCTATCC AATGACGGCA GAGCGAGAAG TACCGGTTGG ACAATAA
 
Protein sequence
MKKNSYLLSC LAIAVSSACH AEVLTYPDPL GSSQSDFGGT GLLQMPNARI APEGEFSVNY 
RDNDQYRFYS TSVALFPWLE GTIRYTDVRT RKYSQWEDFS GDQSYKDKSF DFKLRLWEEG
YWLPQVAFGK RDIAGTGLFD GEYLVASKQA GPFDFTLGMA WGYAGNAGNI TNPFCRVSDK
YCHRAESHDA GDISFSDIFR GPASIFGGIE YQTPWNPLRL KLEYDGNNYQ NDFAGKLPQA
SHFNVGAVYR AASWADLNLS YERGNTLMFG FTLRTNFNDL RPALRDTPKP AYQPAPESEG
LQYTTVANQL TALKYNAGFD APEIQLRDKT LYMSGQQYKY RDSREAVDRA NRILVNNLPQ
GVEKISVTQK REHMAMVTTE TDVASLRKQL AGTAPGQSEP LQQQRVEAED LSAFGRGYRI
REDRFSYSFN PTLSQSLGGP EDFYMFQLGL MSSARYWFTD HLLLDGGIFT NIYNNYDKFK
SSLLPADSTL PRVRTHIRDY VRNDVYLNNL QANYFADLGN GFYGQVYGGY LETMYAGVGS
ELLYRPLDAS WALGVDVNYV KQRDWDNMMR FTDYSTPTGF VTAYWNPPTL NGVLMKLSVG
QYLAKDKGAT IDVAKRFDSG VAVGVWAAIS NVSKDDYGEG GFSKGFYISI PFDLMTIGPN
RNRAVVSWTP LTRDGGQMLS RKYQLYPMTA EREVPVGQ