Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2550 |
Symbol | arnA |
ID | 5588571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2542896 |
End bp | 2544878 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640926208 |
Product | bifunctional UDP-glucuronic acid decarboxylase/UDP-4-amino-4-deoxy-L-arabinose formyltransferase |
Protein accession | YP_001463602 |
Protein GI | 157159038 |
COG category | [G] Carbohydrate transport and metabolism [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0223] Methionyl-tRNA formyltransferase [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCG TCGTTTTTGC CTACCACGAT ATGGGATGCC TCGGTATTGA AGCTCTGCTG GCTGCCGGTT ACGAAATTAG CGCCATTTTT ACCCATACCG ATAATCCCGG TGAAAAAGCC TTTTATGGTT CGGTGGCTCG TCTGGCAGCG GAAAGAGGCA TTCCGGTTTA TGCGCCGGAT GACGTTAATC ATCCGCTGTG GGTGGAACGC ATTGCCCAAC TGTCGCCAGA TGTGATTTTC TCTTTTTATT ATCGCCATCT TATTTGCGAC GAAATTTTGC AGCTCGCTCC CGCAGGTGCA TTTAATCTGC ATGGTTCGCT GTTACCAAAA TATCGTGGTC GCGCGCCGCT GAACTGGGTG CTGGTCAACG GTGAAACGGA AACTGGCGTT ACATTGCACC GAATGGTGAA ACGTGCCGAT GCCGGGGCCA TTGTGGCGCA ACTGCGCATT GCCATTGCGC CAGACGATAT CGCTATTACG CTGCATCATA AATTGTGCCA TGCCGCGCGC CAGCTACTGG AACAGACATT ACCCGCCATT AAACACGGTA ATATTCTGGA AATCGCCCAG CGCGAAAACG AAGCCACCTG TTTTGGTCGC AGAACGCCGG ATGACAGTTT CCTTGAATGG CATAAACCGG CATCCGTACT GCACAACATG GTACGTGCCG TTGCCGATCC GTGGCCGGGT GCCTTCAGCT ATGTTGGCAA TCAGAAATTC ACCGTCTGGT CGTCGCGTGT TCATCCTCAT GCCAGCAAAG CACAGCCGGG GAGCGTGATT TCTGTTGCGC CACTGCTGAT TGCCTGTGGC GATGGCGCGC TGGAAATCGT CACCGGACAG GCGGGCGACG GCATTACTAT GCAGGGCTCG CAATTAGCGC AGACGCTGGG CCTGGTGCAA GGTTCACGCT TGAATAGCCA GCCTGCCTGC ACCGCCCGAC GCCGTACCCG GGTACTCATC CTCGGGGTGA ATGGCTTTAT TGGCAACCAT CTGACAGAAC GCCTGCTGCG CGAAGATCAT TATGAAGTTT ACGGTCTGGA TATTGGCAGC GATGCGATAA GCCGTTTTCT GAATCATCCG CATTTTCACT TTGTTGAAGG CGATATCAGT ATTCATTCCG AATGGATTGA GTATCATGTC AAAAAATGTG ATGTCGTCTT GCCGCTGGTG GCGATAGCCA CGCCGATTGA ATATACCCGC AACCCGCTGC GCGTATTTGA ACTCGATTTT GAAGAGAATC TGCGCATTAT CCGCTACTGC GTGAAGTACC GTAAGCGAAT CATCTTCCCG TCAACTTCAG AAGTTTATGG GATGTGTAGC GATAAATACT TCGATGAGGA CCATTCTAAT TTAATCGTCG GCCCGGTGAA TAAACCACGC TGGATTTATT CGGTATCAAA ACAATTACTT GATCGGGTGA TCTGGGCCTA TGGCGAAAAA GAGGGTTTAC AGTTCACCCT CTTCCGCCCG TTTAACTGGA TGGGACCGCG ACTGGATAAC CTTAATGCGG CACGAATCGG CAGCTCCCGC GCTATTACGC AACTCATTCT CAATCTGGTA GAAGGTTCAC CGATTAAGCT GATTGATGGC GGAAAACAAA AACGCTGCTT TACTGATATT CGCGATGGTA TCGAGGCGTT ATACCGCATT ATCGAAAATG CGGGAAATCG CTGCGACGGT GAAATTATCA ACATTGGCAA TCCTGAGAAC GAAGCGAGCA TTGAGGAACT GGGCGAGATG CTGCTGGCGA GCTTCGAAAA ACATCCGCTG CGCCATCATT TCCCACCGTT TGCGGGCTTT CGCGTTGTCG AAAGTAGCAG CTACTACGGC AAAGGATATC AGGACGTAGA GCATCGTAAA CCGAGCATCC GCAATGCCCA CCGCTGCCTG GACTGGGAGC CGAAAATTGA TATGCAGGAA ACCATCGACG AAACGCTGGA TTTCTTCCTG CGCACCGTTG ATCTTACGGA TAAACCATCA TGA
|
Protein sequence | MKTVVFAYHD MGCLGIEALL AAGYEISAIF THTDNPGEKA FYGSVARLAA ERGIPVYAPD DVNHPLWVER IAQLSPDVIF SFYYRHLICD EILQLAPAGA FNLHGSLLPK YRGRAPLNWV LVNGETETGV TLHRMVKRAD AGAIVAQLRI AIAPDDIAIT LHHKLCHAAR QLLEQTLPAI KHGNILEIAQ RENEATCFGR RTPDDSFLEW HKPASVLHNM VRAVADPWPG AFSYVGNQKF TVWSSRVHPH ASKAQPGSVI SVAPLLIACG DGALEIVTGQ AGDGITMQGS QLAQTLGLVQ GSRLNSQPAC TARRRTRVLI LGVNGFIGNH LTERLLREDH YEVYGLDIGS DAISRFLNHP HFHFVEGDIS IHSEWIEYHV KKCDVVLPLV AIATPIEYTR NPLRVFELDF EENLRIIRYC VKYRKRIIFP STSEVYGMCS DKYFDEDHSN LIVGPVNKPR WIYSVSKQLL DRVIWAYGEK EGLQFTLFRP FNWMGPRLDN LNAARIGSSR AITQLILNLV EGSPIKLIDG GKQKRCFTDI RDGIEALYRI IENAGNRCDG EIINIGNPEN EASIEELGEM LLASFEKHPL RHHFPPFAGF RVVESSSYYG KGYQDVEHRK PSIRNAHRCL DWEPKIDMQE TIDETLDFFL RTVDLTDKPS
|
| |