Gene Avi_3684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3684 
SymbolpurK 
ID7388082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3053266 
End bp3054321 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content63% 
IMG OID643652479 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_002550661 
Protein GI222149704 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0394061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGACCA TTGGAATTAT CGGCGGCGGC CAGCTTGGCC GCATGCTTGC TATGGCCGCA 
GCAAGGCTGG GGCTGAAGAC CATCGTGCTG GAACCGCAGG CCGACTGTCC GGCCGCCCAG
GTTGCCAACC GGCAGATATC AGCCAATTAC GCTGACGAGG CAGCCCTTGC CGAACTGGCC
GCCAGTTGCG ACGTGGTCAC CTACGAGTTT GAAAACGTGC CGGTCGCCGC CGCCGAAGCC
CTGGCGCGCA GCGTGCCGGT CTACCCCCCA GCCAAGGCAC TTGCCGTGTC CCAGGACAGG
TTGAGCGAAA AGCGCTTTCT CAACGACAAT GGCATTGCCA CCGCCCCGTT TCGCACCGTC
GACAGCCAGG CGGACCTGGA GCAGGCATTG GCCGAATTCG GTGGCGAAGG CGTGCTGAAA
ACCCGCCGCT TCGGCTATGA CGGCAAGGGC CAGCGGGTCT ATCGCAAGGG TGACGATGCC
AGCGGCGGCT ATCAGGCGCT TGGTGCCGTG CCGCTGATTC TCGAAGGTTT CGTGCCGTTT
GCGCGGGAAA TCTCGATCAT CGCCGCCCGT TCGATATCAG GCGAAATCGC CTGCTACGAT
CCGGCTGAAA ATATTCACCG CGACGGCATC CTGCACACCT CCACCGTGCC TGCGTCGATT
TCGCCGCAGA CCGCCGCTGC GGCCAAAAGT GCGGCGGAAA AGCTGCTGAC CGGCCTCGAC
TATGTCGGCG TGGTCGGGCT CGAACTGTTC CTGATGGCCG ATGGCAGCCT GATCGCCAAT
GAAATGGCCC CTCGGGTCCA CAATTCCGGC CATTGGACGG AAGCGGCCTG CGTGATCTCG
CAATTCGAAC AACATATCCG CGCCGTTACC GGGTTGCCAC TGGGCGATCC GGCCCGCCAC
GGCGATTGCG TCATGACCAA TCTGATCGGC GACGATATTG TCGCCGTTCC AGATTGGCTG
GCGAAACCCG GTGCGCTCGT CCATCTCTAT GGCAAGGCTG AAGCACGGCC CGGCCGCAAA
ATGGGACATG TCACCGAGGT CAGGAACCGG GAATAA
 
Protein sequence
MKTIGIIGGG QLGRMLAMAA ARLGLKTIVL EPQADCPAAQ VANRQISANY ADEAALAELA 
ASCDVVTYEF ENVPVAAAEA LARSVPVYPP AKALAVSQDR LSEKRFLNDN GIATAPFRTV
DSQADLEQAL AEFGGEGVLK TRRFGYDGKG QRVYRKGDDA SGGYQALGAV PLILEGFVPF
AREISIIAAR SISGEIACYD PAENIHRDGI LHTSTVPASI SPQTAAAAKS AAEKLLTGLD
YVGVVGLELF LMADGSLIAN EMAPRVHNSG HWTEAACVIS QFEQHIRAVT GLPLGDPARH
GDCVMTNLIG DDIVAVPDWL AKPGALVHLY GKAEARPGRK MGHVTEVRNR E