Gene BBta_6849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_6849 
SymbolpurK 
ID5152229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp7172903 
End bp7174018 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content65% 
IMG OID640561531 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001242642 
Protein GI148258057 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.768888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGT CCAATCGGGT GAAGCTGAAG CCCGGCGACA CGATCGGCAT TCTCGGCGGC 
GGCCAGCTCG GCCGCATGCT GGCGCTGGCG GCGGCGCGGC TCGGGTTGCG CTGCCAGGTG
TTCTCGCCGG ACCCGGATTC GCCGGCATTC GATGTCGTCC TGAACGCAAC CTGCGCCGAA
TATGCCGATG TCGAAGCGCT CGAATTGTTC GCCAACGACG TCGATGTCGT GACCTATGAG
TTCGAAAACG TGCCATCGGC CGCTGCCATG GTGCTGGCTG CCAGACGTCC CGTGCTGCCG
AGCCGCTCGG CGCTCGAAAC CACGCAGGAT CGGCTCACGG AAAAGGATTT CGTGACCTCG
CTCGGCATCC GAACGGCCAA TTACGCCGAT GTCTCCTCGC CATCCACGCT GCGCGAAGCG
ATCCTGCGGA TCGGCCTGCC GGCCGTGCTC AAAACCCGCC GCTTCGGCTA TGACGGCAAA
GGTCAGGTGA AGATCCGCCA GGGCGATGAC CTGGAAAAAC TGTGGGCTGA GCTCGGTACC
AAATCCGCCA TTCTGGAAGC CTTCGTCCCG TTCGAGCGCG AGATCTCGGT GATCGCCGCG
CGGGGGGCTG ACGGACAGGT CGAGTGTTTC GACGTCACGG AAAACGAGCA TCGCGATCAC
ATTCTGAAAG TGTCGCAGGC GCCGGCCCGC ATCCCGGACA CGCTGGCCGA CGAGGCTCGC
CACATTGCCA GCCGGATCGC CACCGCGCTC GATTATGTCG GCGTCCTGGC TGTCGAGATG
TTCGTCGTGC CCGGCCCCTC CGGCCCCGGC GTGCTGGTCA ACGAGATTGC CCCACGGGTG
CATAATTCCG GCCACTGGAC CTTGGATGGG GCGTCGATTT CGCAATTCGA GCAGCACATC
CGGGCGATCG CGGGCTGGCC GCTTGGCAAG CCGCTGCGCC ACGGGCAGGT CACGATGACC
AATCTGATCG GCGACGAGAT CAACAGCTAC GAGCAATGGC TCACGGTTCC CGGCGCCACC
GTGCACCTCT ATGGCAAGGG GGCGGCCCGG CCGGGCCGCA AAATGGGGCA TGTCACCCAG
GTGTCCCCGA TGCCGCCAAG GGCGGGCCTG AAATAA
 
Protein sequence
MSGSNRVKLK PGDTIGILGG GQLGRMLALA AARLGLRCQV FSPDPDSPAF DVVLNATCAE 
YADVEALELF ANDVDVVTYE FENVPSAAAM VLAARRPVLP SRSALETTQD RLTEKDFVTS
LGIRTANYAD VSSPSTLREA ILRIGLPAVL KTRRFGYDGK GQVKIRQGDD LEKLWAELGT
KSAILEAFVP FEREISVIAA RGADGQVECF DVTENEHRDH ILKVSQAPAR IPDTLADEAR
HIASRIATAL DYVGVLAVEM FVVPGPSGPG VLVNEIAPRV HNSGHWTLDG ASISQFEQHI
RAIAGWPLGK PLRHGQVTMT NLIGDEINSY EQWLTVPGAT VHLYGKGAAR PGRKMGHVTQ
VSPMPPRAGL K