Gene Rleg_3574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3574 
Symbol 
ID8014433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3608981 
End bp3610057 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content64% 
IMG OID644826139 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_002977359 
Protein GI241206263 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.416397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCAA CAACGATCGG CATTATCGGC GGCGGCCAGC TCGGCCGCAT GCTGGCGATT 
GCCGCCGCCA GGTTGAATTT TCGCACGGTC ATCCTCGAGC CGCAGGCGGA CTGCCCGGCC
GCCCAGCTCG CCAACCGGCA GATTACCGCT GCCTATGACG ATCCGGCGGC ACTCGCCGAA
CTCGCCGATA TCTGCGATGT CGTCACCTAC GAATTCGAAA ACGTGCCTGT CGCAGCCGCC
GAAAAGCTCT CGGCGAGCGT GTCGGTCTAT CCGCCGCCGA AGGCGCTGGA AGCCGCCCAG
GACCGTCTCG TCGAAAAACG CTTTCTCAAC GGCTGCGGCA TAACCACTGC ACGCTTCCAT
GCGATCCACA GCCAGGCCGA TCTCGAAACG GCGCTGAAGG ATTTCGGCGG CCAGGGCGTG
CTGAAGACCC GCCGTCTCGG TTATGACGGC AAGGGCCAGA AGGTTTTCCG CTCGGCGGCC
GACAGCCCGG ATGGCACCTA TGCAGCACTT GGCGGCGTGC CGCTCATTCT CGAAAGCTTC
GTCGCCTTCG AGCGTGAAGT CTCGATCATC GCCGCCCGCG CCACCGACGG TACGGTCGTC
TGCTTCGATC CCGCCGAGAA TGTCCACCGC AACGGCATCC TCCACACCTC GACGGTTCCC
GCCGCGATCT CGGCGCCGAC GGCGGACGCC GCGCGGAAAT CGGCCGAGAA AATCCTTGCC
GCATTGAACT ATGTCGGCGT CATCGGCATC GAATTCTTTG TGCTTGCCGA TGGCGGTCTG
ATCGCCAACG AGATGGCGCC GCGCGTCCAC AACTCCGGTC ACTGGACGGA AGCCGCCTGC
GTCGTCAGCC AGTTCGAGCA GCATATCCGC GCCGTCACCG GCCTGCCGCT TGGCAATGCC
GAGCGACATT CCGACTGCGT CATGCAGAAC CTGATCGGCG ACGATATCCT TGCCGTTCCC
GACTGGCTGC GGCGCCCCGA CACGCTGGTT CATCTCTACG GCAAGACCGA GTGGCGCCCC
GGCCGCAAGA TGGGTCATGT CACCACCGTG ACGCCGAAAT CGCCGGTTTG GACCTGA
 
Protein sequence
MTATTIGIIG GGQLGRMLAI AAARLNFRTV ILEPQADCPA AQLANRQITA AYDDPAALAE 
LADICDVVTY EFENVPVAAA EKLSASVSVY PPPKALEAAQ DRLVEKRFLN GCGITTARFH
AIHSQADLET ALKDFGGQGV LKTRRLGYDG KGQKVFRSAA DSPDGTYAAL GGVPLILESF
VAFEREVSII AARATDGTVV CFDPAENVHR NGILHTSTVP AAISAPTADA ARKSAEKILA
ALNYVGVIGI EFFVLADGGL IANEMAPRVH NSGHWTEAAC VVSQFEQHIR AVTGLPLGNA
ERHSDCVMQN LIGDDILAVP DWLRRPDTLV HLYGKTEWRP GRKMGHVTTV TPKSPVWT