Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2468 |
Symbol | purF |
ID | 6142717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2515609 |
End bp | 2517126 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617340 |
Product | amidophosphoribosyltransferase |
Protein accession | YP_001744512 |
Protein GI | 170684103 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase |
TIGRFAM ID | [TIGR01134] amidophosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGGTA TTGTCGGTAT CGCCGGTGTT ATGCCGGTTA ACCAGTCGAT TTATGATGCC TTAACGGTGC TTCAGCATCG CGGTCAGGAT GCCGCCGGCA TCATCACCAT AGATGCCAAT AACTGCTTCC GTTTGCGTAA AGCGAACGGG CTGGTGAGCG ATGTATTTGA AGCTCGCCAT ATGCAGCGTT TGCAGGGCAA TATGGGCATT GGTCATGTGC GTTACCCTAC GGCTGGCAGC TCCAGCGCCT CTGAAGCGCA GCCGTTTTAC GTTAACTCCC CGTATGGCAT TACGCTTGCC CACAACGGCA ATCTGACCAA CGCTCACGAG TTGCGTAAAA AACTGTTTGA AGAAAAACGC CGCCACATCA ACACCACTTC CGACTCGGAA ATTCTGCTTA ATATCTTTGC CAGCGAACTG GACAACTTCC GCCACTACCC GCTGGAAGCC GACAATATTT TCGCTGCCAT CGCTGCTACA AACCGCTTAA TCCGCGGCGC GTATGCCTGT GTGGCGATGA TCATCGGCCA CGGTATGGTT GCTTTCCGCG ATCCTAACGG GATTCGTCCG CTGGTACTGG GAAAACGTGA TATTGACGAG AACCGTACAG AATATATGGT CGCTTCCGAA AGCGTAGCGC TCGATACGCT GGGCTTTGAT TTCCTGCGTG ACGTCGCGCC GGGCGAAGCG ATTTACATCA CTGAAGAAGG GCAGTTGTTT ACCCGTCAAT GTGCTGACAA TCCGGTCAGC AATCCGTGCC TGTTTGAGTA TGTATACTTT GCTCGCCCGG ACTCGTTCAT CGACAAAATT TCCGTTTACA GCGCGCGTGT GAATATGGGT ACGAAACTGG GCGAGAAAAT TGCCCGCGAA TGGGAAGATC TGGAAATCGA CGTGGTGATC CCGATCCCGG AAACCTCGTG TGATATCGCG CTGGAAATTG CGCGTATTCT AGGCAAGCCG TACCGCCAGG GCTTCGTTAA AAACCGCTAT GTTGGCCGCA CCTTTATCAT GCCGGGCCAG CAGCTGCGTC GTAAGTCCGT GCGCCGTAAA CTGAACGCCA ACCGCGCCGA GTTCCGCGAT AAAAACGTCC TGCTGGTCGA CGACTCCATC GTCCGTGGCA CCACTTCTGA GCAGATTATC GAGATGGCAC GCGAAGCCGG AGCGAAGAAA GTGTACCTCG CTTCTGCGGC ACCGGAAATT CGCTTCCCGA ACGTTTACGG TATCGATATG CCGAGCGCCA CGGAACTGAT CGCTCACGGT CGCGAAGTAG ATGAAATTCG CCAGATCATC GGTGCTGACG GGTTGATTTT CCAGGATCTG AACGATCTGA TCGAAGCCGT TCGCGCTGAA AACCCGGATA TCCAGCAGTT TGAATGCTCG GTATTCAACG GCGTCTACGT CACCAAAGAT GTTGATCAGG GCTACCTCGA TTTCCTCGAT ACGTTACGTA ATGACGACGC CAAAGCAGTG CAACGTCAGA ACGAAGTGGA AAATCTCGAA ATGCATAACG AAGGATGA
|
Protein sequence | MCGIVGIAGV MPVNQSIYDA LTVLQHRGQD AAGIITIDAN NCFRLRKANG LVSDVFEARH MQRLQGNMGI GHVRYPTAGS SSASEAQPFY VNSPYGITLA HNGNLTNAHE LRKKLFEEKR RHINTTSDSE ILLNIFASEL DNFRHYPLEA DNIFAAIAAT NRLIRGAYAC VAMIIGHGMV AFRDPNGIRP LVLGKRDIDE NRTEYMVASE SVALDTLGFD FLRDVAPGEA IYITEEGQLF TRQCADNPVS NPCLFEYVYF ARPDSFIDKI SVYSARVNMG TKLGEKIARE WEDLEIDVVI PIPETSCDIA LEIARILGKP YRQGFVKNRY VGRTFIMPGQ QLRRKSVRRK LNANRAEFRD KNVLLVDDSI VRGTTSEQII EMAREAGAKK VYLASAAPEI RFPNVYGIDM PSATELIAHG REVDEIRQII GADGLIFQDL NDLIEAVRAE NPDIQQFECS VFNGVYVTKD VDQGYLDFLD TLRNDDAKAV QRQNEVENLE MHNEG
|
| |