Gene ECH74115_1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1349 
Symbol 
ID6971026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1354702 
End bp1355760 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content50% 
IMG OID643385332 
ProductATP-grasp domain protein 
Protein accessionYP_002269827 
Protein GI209399591 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.204009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.65425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGA AAATTTGGTT TATGGAAGGT TTATCCTCCC AGCGAGATAT TATTCAGGGG 
GTAAAATCAT TTGCACAAAA AAATAATTTT GCCATTACCG TTTTTGCCTC CCACCGTAAC
GAAAGAAATG AAATCCTTTC CGTTGCCGAT TATTCTTTGA CTGAACCTGA AGATCCTCAA
AAACGTCTTC AGTTTATCCA GGAAACCATT CAGACCTACG GCATCCACCA TATTCATACT
GGCCGTAACA GCCAGTGGTT TGAAGAACAC CGTTCAGCCA TTGAACCGAC CGGTGCCACC
CTTACTACCG GTGCAACGGG CGTCGACTGG TTAACTCTGG CTGACGAAAA AGTTACTTTT
GCTCAGTTTA TGGAGCAAAA GGGTCTCCCG GTCGTACCAT CCTGGCGGGT GAATACGCTG
GCAGAATTAA AGACACACCT CGCGGCCCCG CCGTTCACTG ACAGCCCGGT ATGCGTGAAG
CCGGTGACGG GTATCTATGG CATGGGATTC TGGCGCTTTG ATGACAGTGC TTCGCCTATG
GCCGTCTTTA ATCATCCCGA ACATCGTCTG GTCAGTCCGC AACAGTATAT TGCAGCAGCA
TCAGCTGCTG AGTCGTTTAA ACCCCTTGTT TTGATGCCGT ACCTGCCAGG CCCGGAATTT
TCCGTCGATA TCCTCGCGGA TAAGGGCGAA ATACTCGCAG CCGTGGGACG CCGTAAGGAA
GGGGCTATCC AGTATCTGGT AAACGAAGGA AGCGCCTGGG AACTGGCGTG TGACTGCGCC
CGTGTTATGA AGGCCGACGG GCTGGTGAAT GTTCAGACGC GAAACGATGT GAATGGCAAC
CCGGTGCTGC TTGAAACCAA CATGCGTCCG TCAGGGGGGG TGGGTTATAC CCTTCACAGC
GGTGTGAACC TTCCTGGGTT ATTTGCTGCC TTTAAGCTCG GTCTGATGTC TGAAGATATG
GTACGCCAGA GCGCTAAAAA CACCTTTTCT CCGGTTGCGG TGAGATCCAT TACGGATGTA
ATTGCATACC CGGAATCACT CTCTAACCTT CTGAATTAA
 
Protein sequence
MNKKIWFMEG LSSQRDIIQG VKSFAQKNNF AITVFASHRN ERNEILSVAD YSLTEPEDPQ 
KRLQFIQETI QTYGIHHIHT GRNSQWFEEH RSAIEPTGAT LTTGATGVDW LTLADEKVTF
AQFMEQKGLP VVPSWRVNTL AELKTHLAAP PFTDSPVCVK PVTGIYGMGF WRFDDSASPM
AVFNHPEHRL VSPQQYIAAA SAAESFKPLV LMPYLPGPEF SVDILADKGE ILAAVGRRKE
GAIQYLVNEG SAWELACDCA RVMKADGLVN VQTRNDVNGN PVLLETNMRP SGGVGYTLHS
GVNLPGLFAA FKLGLMSEDM VRQSAKNTFS PVAVRSITDV IAYPESLSNL LN