Gene EcSMS35_2409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2409 
SymbolarnA 
ID6142809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2457418 
End bp2459400 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content51% 
IMG OID641617282 
Productbifunctional UDP-glucuronic acid decarboxylase/UDP-4-amino-4-deoxy-L-arabinose formyltransferase 
Protein accessionYP_001744454 
Protein GI170682848 
COG category[G] Carbohydrate transport and metabolism
[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0223] Methionyl-tRNA formyltransferase
[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR00460] methionyl-tRNA formyltransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG TCGTTTTTGC CTACCACGAT ATGGGATGCC TCGGTATTGA AGCCCTGCTG 
GCTGCCGGTT ACGAAATTAG CGCCATTTTT ACCCATACTG ATAATCCCGG TGAAAAAGCC
TTTTATGGTT CGGTGGCTCG TCTGGCAGCG GAAAGAGGCA TTCCGGTTTA TGCGCCGGAT
GACGTTAACC ATCCGCTGTG GGTGGAACGC ATTGCCCAAC TATCACCAGA TGTGATTTTC
TCTTTTTATT ATCGCCATCT TATTCACGAC AAGATTTTGC AGCTCGCTCC TGCAGGCGCA
TTTAATCTGC ATGGTTCACT GTTACCAAAA TATCGTGGTC GCGCGCCGCT GAACTGGGTG
CTGGTGAACG GTGAAACGGA AACTGGCGTG ACATTGCACC GAATGGTGAA ACGTGCCGAT
GCTGGGGCCA TTGTAGCCCA ACTGCGCATT GCCATTGCGC CAGACGATAT CGCCATTACG
CTGCATCATA AGTTATGCCA TGCCGCGCGA CAGCTACTGG AGCAGACATT ACCCGCCATT
AAAGACGGTA ATATTCTGGA AATCGCCCAG TGCGAAAACG AAGCCACCTG TTTTGGTCGC
AGAACGCCAG AAGACAGCTT CCTCGAGTGG CACAAATCGG CAGCAGTATT GCATAACATG
GTGCGTGCAG TCGCCGATCC GTGGCCGGGT GCCTTCAGCT ATGTTGGTAA TCAAAAATTT
ACCGTCTGGT CGTCACGCGT ACATTCTCAT GCGCCCGCAG CACAACCGGG GAGCGTGATT
TCTGTTGCGC CACTGCTGAT TGCCTGTGGC GATGGCGCGC TGGAAATCGT CACTGGACAG
GCGGGCGGCG GCATTACTAT GCAGGGCTCG CAATTAGCGC AGACGCTGGG CCTGGTGCAA
GGTTCACGCT TGAATAGCCA GCCTGCCTGT GCCGCCCGAC GCCGTACCCG GGTACTCATC
CTCGGGGTGA ATGGCTTTAT TGGCAACCAT CTGACAGAAC GCCTGCTGCG CGAAGATCAT
TATGAAGTTT ACGGTCTGGA TATTGGCAGC GATGCGATAA GCCGTTTTCT GAATCATCCG
CATTTTCACT TTGTCGAAGG CGATATCAGT ATTCATTCCG AATGGATTGA GTATCACGTC
AAAAAATGTG ATGTCGTCTT GCCTTTGGTG GCGATAGCCA CGCCGATTGA ATATACCCGC
AACCCGCTGC GCGTATTTGA ACTCGATTTC GAAGAGAATC TGCGCATTAT CCGCTACTGC
GTGAAGTACC GTAAGCGAAT CATCTTCCCG TCGACTTCAG AAGTTTATGG GATGTGTAGC
GATAAATACT TCGATGAGGA CCATTCTAAT TTAATCGTCG GCCCGGTGAA TAAACCACGC
TGGATTTATT CGGTGTCTAA ACAATTACTT GATCGAGTGA TCTGGGCCTA TGGCGAAAAA
GAAGGTTTAC AGTTCACCCT CTTCCGCCCG TTTAACTGGA TGGGGCCACG ACTGGATAAC
CTTAATGCAG CGCGAATTGG CAGCTCCCGC GCTATTACGC AACTCATTCT CAATCTGGTA
GAAGGTTCAC CGATTAAGCT GATTGATGGC GGAAAACAAA AACGCTGCTT TACTGATATT
CGGGATGGTA TCGAGGCGTT ATACCGCATT ATCGAAAATG CGGGAAATCG CTGCGATGGC
GAAATTATCA ACATTGGCAA TCCTGAGAAC GAAGCGAGCA TTGAAGAACT GGGGGAGATG
CTGCTGGCGA GCTTCGAAAA ACATCCGCTG CGCCATTACT TCCCACCGTT TGCGGGCTTT
CGTGTTGTCG AAAGTAGCAG CTACTACGGC AAAGGATATC AGGACGTAGA GCATCGTAAA
CCGAGCATCC GCAATGCCCG CCGCTGCCTG GACTGGGAAC CGAAAATTGA TATGCAGGAA
ACCATCGACG AAACGCTGGA TTTCTTCCTG CGCACCGTTG ATCTTACGGA TAAACCATCA
TGA
 
Protein sequence
MKTVVFAYHD MGCLGIEALL AAGYEISAIF THTDNPGEKA FYGSVARLAA ERGIPVYAPD 
DVNHPLWVER IAQLSPDVIF SFYYRHLIHD KILQLAPAGA FNLHGSLLPK YRGRAPLNWV
LVNGETETGV TLHRMVKRAD AGAIVAQLRI AIAPDDIAIT LHHKLCHAAR QLLEQTLPAI
KDGNILEIAQ CENEATCFGR RTPEDSFLEW HKSAAVLHNM VRAVADPWPG AFSYVGNQKF
TVWSSRVHSH APAAQPGSVI SVAPLLIACG DGALEIVTGQ AGGGITMQGS QLAQTLGLVQ
GSRLNSQPAC AARRRTRVLI LGVNGFIGNH LTERLLREDH YEVYGLDIGS DAISRFLNHP
HFHFVEGDIS IHSEWIEYHV KKCDVVLPLV AIATPIEYTR NPLRVFELDF EENLRIIRYC
VKYRKRIIFP STSEVYGMCS DKYFDEDHSN LIVGPVNKPR WIYSVSKQLL DRVIWAYGEK
EGLQFTLFRP FNWMGPRLDN LNAARIGSSR AITQLILNLV EGSPIKLIDG GKQKRCFTDI
RDGIEALYRI IENAGNRCDG EIINIGNPEN EASIEELGEM LLASFEKHPL RHYFPPFAGF
RVVESSSYYG KGYQDVEHRK PSIRNARRCL DWEPKIDMQE TIDETLDFFL RTVDLTDKPS