Gene Rleg_4571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4571 
SymbolguaA 
ID8015323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4696598 
End bp4698160 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content65% 
IMG OID644827148 
ProductGMP synthase 
Protein accessionYP_002978348 
Protein GI241207252 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.855441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.321233 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA CAGCACATCC CGACTCCGTT CTCATCGTCG ATTTCGGCAG CCAGGTGACC 
CAGCTCATCG CACGACGCGT GCGCGAGGCC GGTGTCTATT GCGAGATCGT TCCCTTCCAA
TCGGCCGAAG AGGGCTTCCA CCGCCTGCAG CCGAAGGCCG TGATCCTGTC CGGCAGCCCG
GCTTCCACGG TGGACGAGGG ATCGCCGCGA GCGCCTAACA TCATCTTCGA GAGCGGCCTG
CCGGTGTTCG GCATCTGCTA CGGCCAGCAG ACGATGTGCA TGCAGCTCGG CGGCAAGGTC
GAGAGCGGCC ATCACCGCGA ATTCGGCCGC GCCTTCCTCG AGGTCGACAG GGACTGCCAG
CTGTTCGAGG GCCTCTGGTC CTCCGGCTCG CGCCACCAAG TCTGGATGAG CCATGGCGAC
CGCGTCACCG CGCTGCCGGA TGGTTTCGAG GTGGTCGCCA CCTCCTCCAA CGCACCCTAT
GCCTTCATCG CCGACGAGAA GCGCAAATAT TACGGCGTGC AGTTCCACCC CGAGGTCGTG
CATACGCCTG ATGGCGCCAA GCTGATCGGC AACTTCATTC ACAATATTGC CGGCCTCAAG
GGCGACTGGT CGATGTCAGC CTATCGCCAG AAGGCGGTCG AGCAGATCCG CGAACAGGTG
GGCGACAAGC GCGTCATCTG CGCGCTTTCG GGCGGCGTCG ACAGTTCCGT CGCAGCGCTG
TTGATCCACG AGGCCGTCGG CGACCAGCTG ACCTGCATCC TCGTCGACCA CGGGCTGATG
CGCAAGGACG AGGCGGCCGG CGTCGTCGCC ATGTTCCGCG AGCACTACAA TCTGCACCTG
TTGCACGTCG ATGCGGCCGA TCGCTTCATC GGCGAACTCG AGGGTGTCAG CGACCCGGAA
ACCAAGCGCA AGATCATCGG CCGGCTGTTC ATCGAGACCT TCGAGGAAGA GGCAAAGAAG
CTCGGCGGCG CCGACTTCCT CGGCCAGGGC ACGCTTTATC CCGACGTGAT CGAGAGCGTT
TCCTTCACCG GCGGCCCGTC GGTGACGATC AAGTCGCACC ACAATGTCGG CGGTCTGCCG
GAGCGCATGA AGATGCAGCT CGTCGAGCCG CTGCGCGAGC TCTTCAAGGA CGAGGTGCGC
GCACTCGGCC GCGAACTCGG CTTGCCCGAC AGCTTCATCG GCCGTCACCC CTTCCCAGGC
CCGGGCCTGG CGATCCGTTG CCCCGGCGGC ATCACCCGCG AAAAGCTGGA GATCCTGCGC
GAGGCCGATG CGATCTATCT CGACGAAATC CGTAAGGCCG GCCTCTACGA CGCCATCTGG
CAGGCCTTCG CCGTGCTGCT CCCCGTCCAG ACCGTCGGCG TCATGGGCGA TGGGCGCACC
TACGAATTCG TCTGTGCGCT GCGCGCCGTC ACCTCCGTCG ACGGCATGAC GGCGGACTTC
TACCACTACG ACATGGAATT CCTCGGCCGC GCCGCCACCC GCATCATCAA CGAAGTGCGC
GGCATCAACC GCGTGGTTTA TGATGTGACG AGCAAGCCGC CCGGCACGAT CGAGTGGGAG
TGA
 
Protein sequence
MTQTAHPDSV LIVDFGSQVT QLIARRVREA GVYCEIVPFQ SAEEGFHRLQ PKAVILSGSP 
ASTVDEGSPR APNIIFESGL PVFGICYGQQ TMCMQLGGKV ESGHHREFGR AFLEVDRDCQ
LFEGLWSSGS RHQVWMSHGD RVTALPDGFE VVATSSNAPY AFIADEKRKY YGVQFHPEVV
HTPDGAKLIG NFIHNIAGLK GDWSMSAYRQ KAVEQIREQV GDKRVICALS GGVDSSVAAL
LIHEAVGDQL TCILVDHGLM RKDEAAGVVA MFREHYNLHL LHVDAADRFI GELEGVSDPE
TKRKIIGRLF IETFEEEAKK LGGADFLGQG TLYPDVIESV SFTGGPSVTI KSHHNVGGLP
ERMKMQLVEP LRELFKDEVR ALGRELGLPD SFIGRHPFPG PGLAIRCPGG ITREKLEILR
EADAIYLDEI RKAGLYDAIW QAFAVLLPVQ TVGVMGDGRT YEFVCALRAV TSVDGMTADF
YHYDMEFLGR AATRIINEVR GINRVVYDVT SKPPGTIEWE