Gene ECH74115_3396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3396 
SymbolarnA 
ID6969474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3139914 
End bp3141896 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content52% 
IMG OID643387204 
Productbifunctional UDP-glucuronic acid decarboxylase/UDP-4-amino-4-deoxy-L-arabinose formyltransferase 
Protein accessionYP_002271667 
Protein GI209399006 
COG category[G] Carbohydrate transport and metabolism
[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0223] Methionyl-tRNA formyltransferase
[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG TCGTTTTTGC CTACCACGAT ATGGGATGCC TCGGTATTGA AGCCCTGCTG 
GCTGCCGGTT ACGAAATTAG CGCCATTTTT ACCCATACCG ATAATCCCGG TGAAAAAGCC
TTTTATGGTT CGGTGGCTCG TCTGGCGGCG GAAAGAGGCA TTCCGGTTTA TGCGCCGGAT
AACGTTAATC ATCCGCTGTG GGTGGAACGC ATTGCCCAAC TCTCACCAGA GGTGATTTTC
TCTTTTTATT ATCGCCATCT TATTTGCGAC GAGATTTTGC AGCTCGCTCC CGCAGGTGCA
TTTAATCTGC ACGGTTCGCT GTTACCAAAA TATCGTGGTC GCGCGCCGCT GAACTGGGTG
CTGGTGAACG GTGAAACAGA AACTGGCGTT ACATTGCACC GAATGGTAAA ACGTGCCGAT
GCCGGGGCCA TTGTGGCGCA ACTGCGCGTT GCCATTGCGC CAGACGATAT CGCTATTACG
CTGCATCATA AATTGTGCCA TGCCGCGCGC CAGCTACTGG AACAGACATT ACCCGCCATT
AAACACGGTA ATATTCTGGA AATCGCCCAG CGCGAAAACG AAGCCACCTG TTTTGGTCGC
AGAACGCCGG ATGACAGTTT CCTCGAATGG CATAAGCCGG CATCCGTACT GCACAACATG
GTACGTGCCG TTGCCGATCC GTGGCCGGGT GCCTTCAGCT ATGTTGGCAA TCAGAAATTC
ACCGTCTGGT CGTCGCGTGT TCATCCTCAT GCCAGCAAAG CACAGCCGGG GAGCGTGATT
TCTGTTGCGC CACTGCTGAT TGCCTGTGGC GATGGCGCGC TGGAAATCGT CACCGGACAA
GCGGGCGACG GCATTACTAT GCAGGGCTCG CAATTAGCGC AGACGCTGGG CCTGGTGCAA
GGTTCACGCT TGAATAGCCA GCCTGCCTGC ACCGCCCGAC GCCGTACCCG GGTACTCATC
CTCGGGGTGA ATGGCTTTAT TGGCAACCAT CTGACAGAAC GCCTGCTGCG CGAAGATCAT
TATGAAGTTT ACGGTCTGGA TATTGGCAGC GATGCGATAA GCCGTTTTCT GAATCATCCG
CATTTTCACT TTGTCGAAGG CGATATCAGT ATTCATTCCG AATGGATTGA GTATCACGTC
AAAAAATGTG ATGTCGTCTT GCCGCTGGTG GCGATAGCCA CGCCGATTGA ATATACCCGC
AACCCGCTGC GCGTATTTGA ACTCGATTTT GAAGAGAATC TGCGCATTAT CCGCTACTGC
GTGAAGTACC GTAAGCGAAT CATCTTCCCG TCAACTTCAG AAGTTTATGG GATGTGTAGC
GATAAATACT TCGATGAGGA CCATTCTAAT TTAATCGTCG GCCCGGTGAA TAAACCACGC
TGGATTTATT CGGTATCAAA ACAATTACTT GATCGGGTGA TCTGGGCCTA TGGCGAAAAA
GAGGGTTTAC AGTTCACCCT CTTCCGCCCG TTTAACTGGA TGGGACCACG ACTGGATAAC
CTTAATGCGG CACGAATCGG CAGCTCCCGC GCGATTACGC AACTCATTCT CAATCTGGTA
GAAGGTTCAC CGATTAAGCT GATTGATGGC GGAAAACAAA AACGCTGCTT TACTGATATT
CGCGATGGTA TCGAGGCGTT ATACCGCATT ATCGAAAATG CGGGAAATCG CTGCGACGGC
GAAATTATCA ACATTGGCAA TCCTGAGAAC GAAGCGAGCA TTGAGGAACT GGGCGAGATG
CTGCTGGCGA GCTTCGAAAA ACATCCGCTG CGCCATCATT TCCCACCGTT TGCGGGCTTT
CGCGTTGTCG AAAGTAGCAG CTACTACGGC AAAGGATATC AGGACGTAGA GCATCGTAAA
CCGAGCATCC GCAATGCCCA CCACTGCCTG GACTGGGAGC CGAAAATTGA TATGCAGGAA
ACCATCGACG AAACGCTGGA TTTCTTCCTG CGCACCGTTG ATCTTACGGA TAAACCATCA
TGA
 
Protein sequence
MKTVVFAYHD MGCLGIEALL AAGYEISAIF THTDNPGEKA FYGSVARLAA ERGIPVYAPD 
NVNHPLWVER IAQLSPEVIF SFYYRHLICD EILQLAPAGA FNLHGSLLPK YRGRAPLNWV
LVNGETETGV TLHRMVKRAD AGAIVAQLRV AIAPDDIAIT LHHKLCHAAR QLLEQTLPAI
KHGNILEIAQ RENEATCFGR RTPDDSFLEW HKPASVLHNM VRAVADPWPG AFSYVGNQKF
TVWSSRVHPH ASKAQPGSVI SVAPLLIACG DGALEIVTGQ AGDGITMQGS QLAQTLGLVQ
GSRLNSQPAC TARRRTRVLI LGVNGFIGNH LTERLLREDH YEVYGLDIGS DAISRFLNHP
HFHFVEGDIS IHSEWIEYHV KKCDVVLPLV AIATPIEYTR NPLRVFELDF EENLRIIRYC
VKYRKRIIFP STSEVYGMCS DKYFDEDHSN LIVGPVNKPR WIYSVSKQLL DRVIWAYGEK
EGLQFTLFRP FNWMGPRLDN LNAARIGSSR AITQLILNLV EGSPIKLIDG GKQKRCFTDI
RDGIEALYRI IENAGNRCDG EIINIGNPEN EASIEELGEM LLASFEKHPL RHHFPPFAGF
RVVESSSYYG KGYQDVEHRK PSIRNAHHCL DWEPKIDMQE TIDETLDFFL RTVDLTDKPS