Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2409 |
Symbol | arnA |
ID | 6142809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2457418 |
End bp | 2459400 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617282 |
Product | bifunctional UDP-glucuronic acid decarboxylase/UDP-4-amino-4-deoxy-L-arabinose formyltransferase |
Protein accession | YP_001744454 |
Protein GI | 170682848 |
COG category | [G] Carbohydrate transport and metabolism [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0223] Methionyl-tRNA formyltransferase [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR00460] methionyl-tRNA formyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACCG TCGTTTTTGC CTACCACGAT ATGGGATGCC TCGGTATTGA AGCCCTGCTG GCTGCCGGTT ACGAAATTAG CGCCATTTTT ACCCATACTG ATAATCCCGG TGAAAAAGCC TTTTATGGTT CGGTGGCTCG TCTGGCAGCG GAAAGAGGCA TTCCGGTTTA TGCGCCGGAT GACGTTAACC ATCCGCTGTG GGTGGAACGC ATTGCCCAAC TATCACCAGA TGTGATTTTC TCTTTTTATT ATCGCCATCT TATTCACGAC AAGATTTTGC AGCTCGCTCC TGCAGGCGCA TTTAATCTGC ATGGTTCACT GTTACCAAAA TATCGTGGTC GCGCGCCGCT GAACTGGGTG CTGGTGAACG GTGAAACGGA AACTGGCGTG ACATTGCACC GAATGGTGAA ACGTGCCGAT GCTGGGGCCA TTGTAGCCCA ACTGCGCATT GCCATTGCGC CAGACGATAT CGCCATTACG CTGCATCATA AGTTATGCCA TGCCGCGCGA CAGCTACTGG AGCAGACATT ACCCGCCATT AAAGACGGTA ATATTCTGGA AATCGCCCAG TGCGAAAACG AAGCCACCTG TTTTGGTCGC AGAACGCCAG AAGACAGCTT CCTCGAGTGG CACAAATCGG CAGCAGTATT GCATAACATG GTGCGTGCAG TCGCCGATCC GTGGCCGGGT GCCTTCAGCT ATGTTGGTAA TCAAAAATTT ACCGTCTGGT CGTCACGCGT ACATTCTCAT GCGCCCGCAG CACAACCGGG GAGCGTGATT TCTGTTGCGC CACTGCTGAT TGCCTGTGGC GATGGCGCGC TGGAAATCGT CACTGGACAG GCGGGCGGCG GCATTACTAT GCAGGGCTCG CAATTAGCGC AGACGCTGGG CCTGGTGCAA GGTTCACGCT TGAATAGCCA GCCTGCCTGT GCCGCCCGAC GCCGTACCCG GGTACTCATC CTCGGGGTGA ATGGCTTTAT TGGCAACCAT CTGACAGAAC GCCTGCTGCG CGAAGATCAT TATGAAGTTT ACGGTCTGGA TATTGGCAGC GATGCGATAA GCCGTTTTCT GAATCATCCG CATTTTCACT TTGTCGAAGG CGATATCAGT ATTCATTCCG AATGGATTGA GTATCACGTC AAAAAATGTG ATGTCGTCTT GCCTTTGGTG GCGATAGCCA CGCCGATTGA ATATACCCGC AACCCGCTGC GCGTATTTGA ACTCGATTTC GAAGAGAATC TGCGCATTAT CCGCTACTGC GTGAAGTACC GTAAGCGAAT CATCTTCCCG TCGACTTCAG AAGTTTATGG GATGTGTAGC GATAAATACT TCGATGAGGA CCATTCTAAT TTAATCGTCG GCCCGGTGAA TAAACCACGC TGGATTTATT CGGTGTCTAA ACAATTACTT GATCGAGTGA TCTGGGCCTA TGGCGAAAAA GAAGGTTTAC AGTTCACCCT CTTCCGCCCG TTTAACTGGA TGGGGCCACG ACTGGATAAC CTTAATGCAG CGCGAATTGG CAGCTCCCGC GCTATTACGC AACTCATTCT CAATCTGGTA GAAGGTTCAC CGATTAAGCT GATTGATGGC GGAAAACAAA AACGCTGCTT TACTGATATT CGGGATGGTA TCGAGGCGTT ATACCGCATT ATCGAAAATG CGGGAAATCG CTGCGATGGC GAAATTATCA ACATTGGCAA TCCTGAGAAC GAAGCGAGCA TTGAAGAACT GGGGGAGATG CTGCTGGCGA GCTTCGAAAA ACATCCGCTG CGCCATTACT TCCCACCGTT TGCGGGCTTT CGTGTTGTCG AAAGTAGCAG CTACTACGGC AAAGGATATC AGGACGTAGA GCATCGTAAA CCGAGCATCC GCAATGCCCG CCGCTGCCTG GACTGGGAAC CGAAAATTGA TATGCAGGAA ACCATCGACG AAACGCTGGA TTTCTTCCTG CGCACCGTTG ATCTTACGGA TAAACCATCA TGA
|
Protein sequence | MKTVVFAYHD MGCLGIEALL AAGYEISAIF THTDNPGEKA FYGSVARLAA ERGIPVYAPD DVNHPLWVER IAQLSPDVIF SFYYRHLIHD KILQLAPAGA FNLHGSLLPK YRGRAPLNWV LVNGETETGV TLHRMVKRAD AGAIVAQLRI AIAPDDIAIT LHHKLCHAAR QLLEQTLPAI KDGNILEIAQ CENEATCFGR RTPEDSFLEW HKSAAVLHNM VRAVADPWPG AFSYVGNQKF TVWSSRVHSH APAAQPGSVI SVAPLLIACG DGALEIVTGQ AGGGITMQGS QLAQTLGLVQ GSRLNSQPAC AARRRTRVLI LGVNGFIGNH LTERLLREDH YEVYGLDIGS DAISRFLNHP HFHFVEGDIS IHSEWIEYHV KKCDVVLPLV AIATPIEYTR NPLRVFELDF EENLRIIRYC VKYRKRIIFP STSEVYGMCS DKYFDEDHSN LIVGPVNKPR WIYSVSKQLL DRVIWAYGEK EGLQFTLFRP FNWMGPRLDN LNAARIGSSR AITQLILNLV EGSPIKLIDG GKQKRCFTDI RDGIEALYRI IENAGNRCDG EIINIGNPEN EASIEELGEM LLASFEKHPL RHYFPPFAGF RVVESSSYYG KGYQDVEHRK PSIRNARRCL DWEPKIDMQE TIDETLDFFL RTVDLTDKPS
|
| |