Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3100 |
Symbol | |
ID | 6066260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3393806 |
End bp | 3394873 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641602517 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001726051 |
Protein GI | 170021097 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.75749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGG TTTGCGTCCT CGGTAACGGG CAGTTAGGCC GTATGCTGCG TCAGGCAGGC GAACCGTTAG GCATTGCTGT CTGGCCAGTC GGGCTGGACG CTGAACCGGC GGCGGTGCCT TTTCAACAAA GCGTGATTAC CGCTGAGATA GAACGCTGGC CGGAAACCGC ATTAACCCGC GAGCTGGCGC GCCATCCGGC CTTTGTGAAC CGCGATGTGT TCCCGATTAT TGCTGACCGT CTGACTCAGA AGCAGCTTTT CGATAAGCTC CACCTGCCGA CTGCACCGTG GCAGTTACTT GCCGAACGCA GCGAGTGGCC TGCGGTGTTT GATCGTTTAG GTGAGCTGGC GATTGTTAAG CGTCGCACTG GTGGTTATGA CGGTCGCGGT CAATGGCGTT TACGCGCAAA TGAAACCGAA CAGTTACCGG CAGAGTGTTA CGGCGAATGT ATTGTCGAGC AGGGCATTAA CTTCTCTGGT GAAGTGTCGC TGGTTGGCGC GCACGGCTTT GATGGCAGCA CCGTGTTTTA TCCGCTGACG CATAACCTGC ATCAGGACGG TATTTTGCGC ACCAGCGTCG CTTTTCCGCA GGCCAACGCA CAGCAGCAGG CGCAAGCCGA AGAGATGCTG TCGGCGATTA TGCAGGAGCT GGGCTATGTG GGCGTGATGG CGATGGAGTG TTTTGTCACC CCGCAAGGTC TGCTGATCAA CGAACTGGCT CCGCGTGTGC ATAACAGCGG TCACTGGACA CAAAACGGTG CCAGCATCAG CCAGTTTGAG CTGCATCTGC GGGCGATTAC CGATCTGCCG TTACCGCAAC CAGTGGTGAA TAATTCGTCG GTGATGATCA ATCTGATTGG TAGCGATGTG AATTATGACT GGCTGAAACT GCCGCTGGTG CATCTGCACT GGTACGACAA AGAAGTCCGT CCGGGCCGTA AAGTGGGGCA TCTGAATTTG ACCGACAGCG ACACATCGCG TCTGTCCGCG ACGCTGGAAG CCTTAATCCC GCTGCTGCCG CCGGAATATG CCAGCGGCGT GATGTGGGCA CAGAGTAAGT TCAGTTAA
|
Protein sequence | MKQVCVLGNG QLGRMLRQAG EPLGIAVWPV GLDAEPAAVP FQQSVITAEI ERWPETALTR ELARHPAFVN RDVFPIIADR LTQKQLFDKL HLPTAPWQLL AERSEWPAVF DRLGELAIVK RRTGGYDGRG QWRLRANETE QLPAECYGEC IVEQGINFSG EVSLVGAHGF DGSTVFYPLT HNLHQDGILR TSVAFPQANA QQQAQAEEML SAIMQELGYV GVMAMECFVT PQGLLINELA PRVHNSGHWT QNGASISQFE LHLRAITDLP LPQPVVNNSS VMINLIGSDV NYDWLKLPLV HLHWYDKEVR PGRKVGHLNL TDSDTSRLSA TLEALIPLLP PEYASGVMWA QSKFS
|
| |