Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3142 |
Symbol | |
ID | 8014045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3141264 |
End bp | 3142574 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644825708 |
Product | guanine deaminase |
Protein accession | YP_002976936 |
Protein GI | 241205840 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02967] guanine deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.14624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA CACTCCTGCG CGGCCGCCTG CTTTCCTTCC ATCGCGCGCC GCTGAGCCTC GCCGACAGCC AAAGCTATCT TTACGAAGAG GATGGCGGCC TGCTGGTCGA AGATGGGCTG ATCGCGGCGG TCGGCCCCTA TGCCAACGTC AAGGCAAAAG CATCCGCGGA CACGGCCGAG ATCGACCACC GCCCGCATCT GATCATGCCG GGCTTCATCG ACATGCACCT GCATTTTCCG CAGATGCAGG TGATTGCCTC CTACGCCGCC AACCTGCTCG AATGGCTGAA TACCTATACC TTTCCCGAGG AGTGCCGTTT CGTCGAAAGC GCCCATGCCG AAAGGATCGC CACGCATTTC TACGACGAGT TGATCCGCCA CGGCACGACG ACGGCGGTTG CCTATTGCTC CGTGCACAAG ACCTCGGCCG ACGCCTTCTT TGCCGAGGCG ATGAGGCGCA ATATGCGCAT GGTCGGCGGC AAGGTGATGA TGGACCGCAA TGCCCCGCAG GGCCTGCTCG ACACGCCCGA GATGGGCTAT GACGAGACCC GCCAAGTCAT ATCGGACTGG CATGGCAAGG GCCGCAACCA CGTCGCCATC ACCCCGCGCT TCGCCATCAC CTCGACACCG GCGCAGATGG AAGCGACATC AGCGCTTGCC CGTGAATTTC CCGACCTGCA CATCCAGACG CATCTTTCGG AAAACCCCGA CGAGATCAAA TTCACCTGCG AGCTCTATCC CGACGCGCTC GACTATACCG ACATCTATGC CCGCTACGGC CTGCTCGGGC CAAAAAGCCT CTTCGGCCAT TGCATACATC TGTCCGAGCG CGAGGCCGAC GCGATGAGCG AGGCGGGCGC CGTCGCCGTC CACTGCCCGA CCTCGAACCT TTTCCTGGGC TCAGGCCTGT TTCCGCTGAA GGCGCTGGCG CGGCGGGAAA AGCCTGTTCG CATCGGGGTA GCGACCGATA TCGGCGGCGG TTCCAGCTAT TCGATGCTGC GGACGATGGA CGAGGCCTAC AAGATCCAGC AGTTGCTCGG CGAACGGCTG AACCCGCTGG AAAGCTATTA CCTGATGACG CGTGGCAATG CCGAAGCCCT GTCACTGGCG GACCGGATCG GCACACTGGA ACCCGGCACC GAGGCCGATC TCGTCGTTCT CGATGCAACG GCGACACCGG CCATGGCGCT GAAGATGGAA GTGGTGAAGA CGCTGCCCGA AGAACTTTTC CTGCTGCAGA CGATGGGCGA CGACCGAGCG ATCGCCGAGA CCTATGTGGC CGGCATTGCG TTGAAGAAGG AGCTTCAATG A
|
Protein sequence | MTTTLLRGRL LSFHRAPLSL ADSQSYLYEE DGGLLVEDGL IAAVGPYANV KAKASADTAE IDHRPHLIMP GFIDMHLHFP QMQVIASYAA NLLEWLNTYT FPEECRFVES AHAERIATHF YDELIRHGTT TAVAYCSVHK TSADAFFAEA MRRNMRMVGG KVMMDRNAPQ GLLDTPEMGY DETRQVISDW HGKGRNHVAI TPRFAITSTP AQMEATSALA REFPDLHIQT HLSENPDEIK FTCELYPDAL DYTDIYARYG LLGPKSLFGH CIHLSEREAD AMSEAGAVAV HCPTSNLFLG SGLFPLKALA RREKPVRIGV ATDIGGGSSY SMLRTMDEAY KIQQLLGERL NPLESYYLMT RGNAEALSLA DRIGTLEPGT EADLVVLDAT ATPAMALKME VVKTLPEELF LLQTMGDDRA IAETYVAGIA LKKELQ
|
| |