Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3684 |
Symbol | purK |
ID | 7388082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 3053266 |
End bp | 3054321 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643652479 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_002550661 |
Protein GI | 222149704 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0394061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGACCA TTGGAATTAT CGGCGGCGGC CAGCTTGGCC GCATGCTTGC TATGGCCGCA GCAAGGCTGG GGCTGAAGAC CATCGTGCTG GAACCGCAGG CCGACTGTCC GGCCGCCCAG GTTGCCAACC GGCAGATATC AGCCAATTAC GCTGACGAGG CAGCCCTTGC CGAACTGGCC GCCAGTTGCG ACGTGGTCAC CTACGAGTTT GAAAACGTGC CGGTCGCCGC CGCCGAAGCC CTGGCGCGCA GCGTGCCGGT CTACCCCCCA GCCAAGGCAC TTGCCGTGTC CCAGGACAGG TTGAGCGAAA AGCGCTTTCT CAACGACAAT GGCATTGCCA CCGCCCCGTT TCGCACCGTC GACAGCCAGG CGGACCTGGA GCAGGCATTG GCCGAATTCG GTGGCGAAGG CGTGCTGAAA ACCCGCCGCT TCGGCTATGA CGGCAAGGGC CAGCGGGTCT ATCGCAAGGG TGACGATGCC AGCGGCGGCT ATCAGGCGCT TGGTGCCGTG CCGCTGATTC TCGAAGGTTT CGTGCCGTTT GCGCGGGAAA TCTCGATCAT CGCCGCCCGT TCGATATCAG GCGAAATCGC CTGCTACGAT CCGGCTGAAA ATATTCACCG CGACGGCATC CTGCACACCT CCACCGTGCC TGCGTCGATT TCGCCGCAGA CCGCCGCTGC GGCCAAAAGT GCGGCGGAAA AGCTGCTGAC CGGCCTCGAC TATGTCGGCG TGGTCGGGCT CGAACTGTTC CTGATGGCCG ATGGCAGCCT GATCGCCAAT GAAATGGCCC CTCGGGTCCA CAATTCCGGC CATTGGACGG AAGCGGCCTG CGTGATCTCG CAATTCGAAC AACATATCCG CGCCGTTACC GGGTTGCCAC TGGGCGATCC GGCCCGCCAC GGCGATTGCG TCATGACCAA TCTGATCGGC GACGATATTG TCGCCGTTCC AGATTGGCTG GCGAAACCCG GTGCGCTCGT CCATCTCTAT GGCAAGGCTG AAGCACGGCC CGGCCGCAAA ATGGGACATG TCACCGAGGT CAGGAACCGG GAATAA
|
Protein sequence | MKTIGIIGGG QLGRMLAMAA ARLGLKTIVL EPQADCPAAQ VANRQISANY ADEAALAELA ASCDVVTYEF ENVPVAAAEA LARSVPVYPP AKALAVSQDR LSEKRFLNDN GIATAPFRTV DSQADLEQAL AEFGGEGVLK TRRFGYDGKG QRVYRKGDDA SGGYQALGAV PLILEGFVPF AREISIIAAR SISGEIACYD PAENIHRDGI LHTSTVPASI SPQTAAAAKS AAEKLLTGLD YVGVVGLELF LMADGSLIAN EMAPRVHNSG HWTEAACVIS QFEQHIRAVT GLPLGDPARH GDCVMTNLIG DDIVAVPDWL AKPGALVHLY GKAEARPGRK MGHVTEVRNR E
|
| |