Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4020 |
Symbol | |
ID | 6064592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4427344 |
End bp | 4428633 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641603435 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_001726946 |
Protein GI | 170021992 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000927217 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0101467 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTAT TAGTGATTGG TAACGGCGGG CGCGAGCACG CGCTGGCCTG GAAAGCGGCG CAGTCGCCGC TGGTTGAAAC TGTTTTTGTT GCTCCGGGTA ATGCAGGCAC TGCGCTGGAA CCCGCGCTAC AGAACGTCGC TATCGGCGTA ACCGATATCC CGGCGCTGTT GGATTTCGCA CAAAACGAAA AGATTGATCT GACCATCGTC GGCCCGGAAG CGCCGCTGGT GAAAGGCGTG GTCGATACCT TCCGCGCCGC CGGGCTGAAA ATCTTCGGCC CAACCGCAGG TGCGGCCCAA CTGGAAGGCT CAAAAGCGTT TACCAAAGAT TTCCTGGCCC GCCATAAGAT CCCTACGGCG GAATACCAGA ACTTCACCGA GGTAGAACCT GCGCTGGCGT ATCTGCGTGA GAAAGGCGCG CCAATCGTCA TTAAAGCGGA CGGTCTGGCT GCCGGGAAAG GCGTTATCGT GGCGATGACG CTGGAAGAAG CGGAAGCGGC TGTTCACGAT ATGCTGGCGG GCAACGCTTT TGGCGACGCG GGTCATCGCA TCGTTATCGA AGAGTTCCTC GATGGCGAAG AAGCGAGCTT TATCGTGATG GTGGACGGAG AGCATGTGCT GCCGATGGCT ACCAGCCAGG ATCACAAACG CGTAGGCGAT AAAGATACCG GACCAAACAC CGGCGGGATG GGCGCTTACT CCCCCGCGCC GGTAGTAACC GATGAAGTTC ATCAGCGCAC CATGGAACGT ATCATCTGGC CAACCGTGAA AGGCATGGCG GCGGAAGGCA ACACCTACAC CGGTTTTCTC TACGCGGGCC TGATGATCGA CAAACAGGGC AATCCGAAGG TTATCGAATT TAACTGCCGC TTTGGCGATC CGGAAACCCA GCCGATTATG CTGCGCATGA AGTCCGATCT GGTTGAGCTC TGCCTGGCAG CCTGTGAAGG GAAGCTGGAC GAGAAAACAT CCGAGTGGGA TGAACGCGCT TCTCTCGGCG TGGTGATGGC TGCGGGTGGA TATCCGGGCG ATTACCGCAC CGGTGATGTG ATCCACGGCC TGCCGCTGGA AGAAGTGGAA GACGGCAAAG TGTTCCATGC AGGCACAAAA CTGGCGGATG ACGAGCAGGT AGTGACCAGC GGCGGGCGCG TACTGTGCGT CACCGCGCTG GGTCATACCG TGGCAGAAGC GCAGAAACGC GCTTATGCCT TAATGACCGA TATTCACTGG GACGACTGCT TCTGCCGGAA AGATATCGGC TGGCGCGCCA TCGAACGCGA GCAGAACTAA
|
Protein sequence | MKVLVIGNGG REHALAWKAA QSPLVETVFV APGNAGTALE PALQNVAIGV TDIPALLDFA QNEKIDLTIV GPEAPLVKGV VDTFRAAGLK IFGPTAGAAQ LEGSKAFTKD FLARHKIPTA EYQNFTEVEP ALAYLREKGA PIVIKADGLA AGKGVIVAMT LEEAEAAVHD MLAGNAFGDA GHRIVIEEFL DGEEASFIVM VDGEHVLPMA TSQDHKRVGD KDTGPNTGGM GAYSPAPVVT DEVHQRTMER IIWPTVKGMA AEGNTYTGFL YAGLMIDKQG NPKVIEFNCR FGDPETQPIM LRMKSDLVEL CLAACEGKLD EKTSEWDERA SLGVVMAAGG YPGDYRTGDV IHGLPLEEVE DGKVFHAGTK LADDEQVVTS GGRVLCVTAL GHTVAEAQKR AYALMTDIHW DDCFCRKDIG WRAIEREQN
|
| |