Gene EcSMS35_2468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2468 
SymbolpurF 
ID6142717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2515609 
End bp2517126 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content53% 
IMG OID641617340 
Productamidophosphoribosyltransferase 
Protein accessionYP_001744512 
Protein GI170684103 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGTA TTGTCGGTAT CGCCGGTGTT ATGCCGGTTA ACCAGTCGAT TTATGATGCC 
TTAACGGTGC TTCAGCATCG CGGTCAGGAT GCCGCCGGCA TCATCACCAT AGATGCCAAT
AACTGCTTCC GTTTGCGTAA AGCGAACGGG CTGGTGAGCG ATGTATTTGA AGCTCGCCAT
ATGCAGCGTT TGCAGGGCAA TATGGGCATT GGTCATGTGC GTTACCCTAC GGCTGGCAGC
TCCAGCGCCT CTGAAGCGCA GCCGTTTTAC GTTAACTCCC CGTATGGCAT TACGCTTGCC
CACAACGGCA ATCTGACCAA CGCTCACGAG TTGCGTAAAA AACTGTTTGA AGAAAAACGC
CGCCACATCA ACACCACTTC CGACTCGGAA ATTCTGCTTA ATATCTTTGC CAGCGAACTG
GACAACTTCC GCCACTACCC GCTGGAAGCC GACAATATTT TCGCTGCCAT CGCTGCTACA
AACCGCTTAA TCCGCGGCGC GTATGCCTGT GTGGCGATGA TCATCGGCCA CGGTATGGTT
GCTTTCCGCG ATCCTAACGG GATTCGTCCG CTGGTACTGG GAAAACGTGA TATTGACGAG
AACCGTACAG AATATATGGT CGCTTCCGAA AGCGTAGCGC TCGATACGCT GGGCTTTGAT
TTCCTGCGTG ACGTCGCGCC GGGCGAAGCG ATTTACATCA CTGAAGAAGG GCAGTTGTTT
ACCCGTCAAT GTGCTGACAA TCCGGTCAGC AATCCGTGCC TGTTTGAGTA TGTATACTTT
GCTCGCCCGG ACTCGTTCAT CGACAAAATT TCCGTTTACA GCGCGCGTGT GAATATGGGT
ACGAAACTGG GCGAGAAAAT TGCCCGCGAA TGGGAAGATC TGGAAATCGA CGTGGTGATC
CCGATCCCGG AAACCTCGTG TGATATCGCG CTGGAAATTG CGCGTATTCT AGGCAAGCCG
TACCGCCAGG GCTTCGTTAA AAACCGCTAT GTTGGCCGCA CCTTTATCAT GCCGGGCCAG
CAGCTGCGTC GTAAGTCCGT GCGCCGTAAA CTGAACGCCA ACCGCGCCGA GTTCCGCGAT
AAAAACGTCC TGCTGGTCGA CGACTCCATC GTCCGTGGCA CCACTTCTGA GCAGATTATC
GAGATGGCAC GCGAAGCCGG AGCGAAGAAA GTGTACCTCG CTTCTGCGGC ACCGGAAATT
CGCTTCCCGA ACGTTTACGG TATCGATATG CCGAGCGCCA CGGAACTGAT CGCTCACGGT
CGCGAAGTAG ATGAAATTCG CCAGATCATC GGTGCTGACG GGTTGATTTT CCAGGATCTG
AACGATCTGA TCGAAGCCGT TCGCGCTGAA AACCCGGATA TCCAGCAGTT TGAATGCTCG
GTATTCAACG GCGTCTACGT CACCAAAGAT GTTGATCAGG GCTACCTCGA TTTCCTCGAT
ACGTTACGTA ATGACGACGC CAAAGCAGTG CAACGTCAGA ACGAAGTGGA AAATCTCGAA
ATGCATAACG AAGGATGA
 
Protein sequence
MCGIVGIAGV MPVNQSIYDA LTVLQHRGQD AAGIITIDAN NCFRLRKANG LVSDVFEARH 
MQRLQGNMGI GHVRYPTAGS SSASEAQPFY VNSPYGITLA HNGNLTNAHE LRKKLFEEKR
RHINTTSDSE ILLNIFASEL DNFRHYPLEA DNIFAAIAAT NRLIRGAYAC VAMIIGHGMV
AFRDPNGIRP LVLGKRDIDE NRTEYMVASE SVALDTLGFD FLRDVAPGEA IYITEEGQLF
TRQCADNPVS NPCLFEYVYF ARPDSFIDKI SVYSARVNMG TKLGEKIARE WEDLEIDVVI
PIPETSCDIA LEIARILGKP YRQGFVKNRY VGRTFIMPGQ QLRRKSVRRK LNANRAEFRD
KNVLLVDDSI VRGTTSEQII EMAREAGAKK VYLASAAPEI RFPNVYGIDM PSATELIAHG
REVDEIRQII GADGLIFQDL NDLIEAVRAE NPDIQQFECS VFNGVYVTKD VDQGYLDFLD
TLRNDDAKAV QRQNEVENLE MHNEG