Gene Rleg_3142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3142 
Symbol 
ID8014045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3141264 
End bp3142574 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content62% 
IMG OID644825708 
Productguanine deaminase 
Protein accessionYP_002976936 
Protein GI241205840 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.14624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA CACTCCTGCG CGGCCGCCTG CTTTCCTTCC ATCGCGCGCC GCTGAGCCTC 
GCCGACAGCC AAAGCTATCT TTACGAAGAG GATGGCGGCC TGCTGGTCGA AGATGGGCTG
ATCGCGGCGG TCGGCCCCTA TGCCAACGTC AAGGCAAAAG CATCCGCGGA CACGGCCGAG
ATCGACCACC GCCCGCATCT GATCATGCCG GGCTTCATCG ACATGCACCT GCATTTTCCG
CAGATGCAGG TGATTGCCTC CTACGCCGCC AACCTGCTCG AATGGCTGAA TACCTATACC
TTTCCCGAGG AGTGCCGTTT CGTCGAAAGC GCCCATGCCG AAAGGATCGC CACGCATTTC
TACGACGAGT TGATCCGCCA CGGCACGACG ACGGCGGTTG CCTATTGCTC CGTGCACAAG
ACCTCGGCCG ACGCCTTCTT TGCCGAGGCG ATGAGGCGCA ATATGCGCAT GGTCGGCGGC
AAGGTGATGA TGGACCGCAA TGCCCCGCAG GGCCTGCTCG ACACGCCCGA GATGGGCTAT
GACGAGACCC GCCAAGTCAT ATCGGACTGG CATGGCAAGG GCCGCAACCA CGTCGCCATC
ACCCCGCGCT TCGCCATCAC CTCGACACCG GCGCAGATGG AAGCGACATC AGCGCTTGCC
CGTGAATTTC CCGACCTGCA CATCCAGACG CATCTTTCGG AAAACCCCGA CGAGATCAAA
TTCACCTGCG AGCTCTATCC CGACGCGCTC GACTATACCG ACATCTATGC CCGCTACGGC
CTGCTCGGGC CAAAAAGCCT CTTCGGCCAT TGCATACATC TGTCCGAGCG CGAGGCCGAC
GCGATGAGCG AGGCGGGCGC CGTCGCCGTC CACTGCCCGA CCTCGAACCT TTTCCTGGGC
TCAGGCCTGT TTCCGCTGAA GGCGCTGGCG CGGCGGGAAA AGCCTGTTCG CATCGGGGTA
GCGACCGATA TCGGCGGCGG TTCCAGCTAT TCGATGCTGC GGACGATGGA CGAGGCCTAC
AAGATCCAGC AGTTGCTCGG CGAACGGCTG AACCCGCTGG AAAGCTATTA CCTGATGACG
CGTGGCAATG CCGAAGCCCT GTCACTGGCG GACCGGATCG GCACACTGGA ACCCGGCACC
GAGGCCGATC TCGTCGTTCT CGATGCAACG GCGACACCGG CCATGGCGCT GAAGATGGAA
GTGGTGAAGA CGCTGCCCGA AGAACTTTTC CTGCTGCAGA CGATGGGCGA CGACCGAGCG
ATCGCCGAGA CCTATGTGGC CGGCATTGCG TTGAAGAAGG AGCTTCAATG A
 
Protein sequence
MTTTLLRGRL LSFHRAPLSL ADSQSYLYEE DGGLLVEDGL IAAVGPYANV KAKASADTAE 
IDHRPHLIMP GFIDMHLHFP QMQVIASYAA NLLEWLNTYT FPEECRFVES AHAERIATHF
YDELIRHGTT TAVAYCSVHK TSADAFFAEA MRRNMRMVGG KVMMDRNAPQ GLLDTPEMGY
DETRQVISDW HGKGRNHVAI TPRFAITSTP AQMEATSALA REFPDLHIQT HLSENPDEIK
FTCELYPDAL DYTDIYARYG LLGPKSLFGH CIHLSEREAD AMSEAGAVAV HCPTSNLFLG
SGLFPLKALA RREKPVRIGV ATDIGGGSSY SMLRTMDEAY KIQQLLGERL NPLESYYLMT
RGNAEALSLA DRIGTLEPGT EADLVVLDAT ATPAMALKME VVKTLPEELF LLQTMGDDRA
IAETYVAGIA LKKELQ