Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3303 |
Symbol | |
ID | 6982056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3391128 |
End bp | 3393107 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643398020 |
Product | 5'-Nucleotidase domain protein |
Protein accession | YP_002282796 |
Protein GI | 209550879 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.737364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAGT CTTTCAGCTT CGGTCTTTTG ACCGCGTCCA TGCTGGCGCT GAGCACGGGC GCCGCTTTTG CGGATTACGA ACTCAATATT CTTCATATCA ACGATTTCCA TTCGCGCATC GAATCGATCA ACAAGTTCGA CTCCACCTGC TCGGCCGAGG AAGAAGGCAA GAAGGAATGC TTCGGCGGTG CTGCCCGCCT GAAGACCGCG ATCGACCAGC GCCGTCAGGC GCTTTCCGGC AAGAATGTCC TCCTGCTGAA TGCCGGCGAC AATTTCCAGG GCTCGCTGTT CTACACGACC TACAAGGGCG CGGCCGAAGC CGAATTCCTC AACCTGATGA AGTTCGACGC CATGACCGTC GGCAACCATG AATTCGACGA CAGCGAGGAC GGGCTTGCGA CCTTCCTCGA CAAGGTGCAA TTCCCCGTCG TTACCGCAAA CGTCAAGGCG GCAGCCGCCT CAAAGCTCGG CGACCGCATC AAGCCCTCGC TGGTGCTCGA TGTCGGTGGC CAGAAGATCG GCATCGTCGG CGCCGTCACC AATGACACGG CCGAACTTTC CTCCCCCGGC CCGAATGTCA CGATATCAGA TGACGTCCAG GCCATTACGT CAGCCGTTCA GGATCTGAAG GGCCAGGGCG TCAACAAGAT CATCGCGCTG ACCCATGTCG GTTATCCCCG CGATCTCGCT TTGATCGCCA AGATCCCGGA CGTCGACGTC GTCGTCGGCG GCCACTCCCA CAGCCTGCTC TCCAATACCG ACCCCAAGGC CGAAGGCCCC TATCCGACGA TGGTCGACAA TCCGGGCGGC TACAAGGTGC CGGTCGTCCA GGCCGCCTCC TACAGCAAGT ATCTCGGCGA TCTCGTCGTC AATTTCGACG ATAGCGGCGT CGTCAAGGAT GCCAAGGGCG ATCCGATCCT GATCGATTCC AGCTTTACGC CCGATCCGGC TGTCGTTGCC CGCATCGCGG AACTGGCCAA ACCGATCGAG GAACTGCGCA AGAAGGTCAT CGGCTCCTCC GACGCCCCGA TCGACGGCGA CCGCAAGGTC TGCCGCGTCA AGGAATGCTC GATGGGCAAT CTGGTGGCCG ATGCCATGCT CGACCGCACG AAGAACCAGG GTGTTGCCAT CGCTTTCCAG AACGGCGGCG GCCTGCGCGC TTCGATCGAT GGCGGTGAGG TCACCCAGGG CGAAGTCATC ACCGTCCTGC CCTTCCAGAA CACGCTCGCC ACCTTCGAGG CCACCGGCGC AGATGTTGTC AAGGCGCTCG AAAACGGCGT CAGCCAGATC GATCAGGGCG CCGGCCGCTT CCCGCAGGTC GCCGGCCTGA AATTCTCCTT CGACCAGTCC AAGCCTGTCG GCAGCCGCGT CAGCGATGTC CAGGTGAAGG AGGGCGACAC CTTCGCTCCG ATCGACCAGG CCAAGACTTA TAAGGTCGCC ACCAACAACT TCATGCGGGC CGGCGGCGAC GGTTATTCGA TCTTCAAGGA AGGCAAGAAC GCCTATGATT TCGGTCCGGA TCTGGCCGAC GTGACCGCCG AATATCTGGC CGCCCACTCG CCCTATAAGC CCTATACCGA CGGCCGCATC ACCGAAAACG GCGCGGCAGT CGCGCAGGCA CCGGCTTCCG AACCGGCCGC TCCCGCACCA GCAACGCCGG CAGCCCCTGC ACCAACCACA GAACCTGCCC CCGCTCCTGC TGCACCGGCA ACGCCTGCGC CTGCGGCACC GGCAGCACCA GCGCCCGCAA CACCAGCAAC GCCCGCGCCC GCCGCGGAGC CGACACCGGC GCCGGCAGCC TCCGCACCTG CCGGAACGAC GCCGTCCACC CACGTCATCG CCGCCGGCGA CACCTTCTGG GATCTCGCCG TGACCTTCTA TGGCGACGGC ACGCTGTGGC GGAAGCTTTC GGAGGCCAAC GGCAGCCCGA ACCCGCGTCA CCTGACGGTC GGCAAGGAGA TCGAGGTTCC CGCCAAGTAA
|
Protein sequence | MTKSFSFGLL TASMLALSTG AAFADYELNI LHINDFHSRI ESINKFDSTC SAEEEGKKEC FGGAARLKTA IDQRRQALSG KNVLLLNAGD NFQGSLFYTT YKGAAEAEFL NLMKFDAMTV GNHEFDDSED GLATFLDKVQ FPVVTANVKA AAASKLGDRI KPSLVLDVGG QKIGIVGAVT NDTAELSSPG PNVTISDDVQ AITSAVQDLK GQGVNKIIAL THVGYPRDLA LIAKIPDVDV VVGGHSHSLL SNTDPKAEGP YPTMVDNPGG YKVPVVQAAS YSKYLGDLVV NFDDSGVVKD AKGDPILIDS SFTPDPAVVA RIAELAKPIE ELRKKVIGSS DAPIDGDRKV CRVKECSMGN LVADAMLDRT KNQGVAIAFQ NGGGLRASID GGEVTQGEVI TVLPFQNTLA TFEATGADVV KALENGVSQI DQGAGRFPQV AGLKFSFDQS KPVGSRVSDV QVKEGDTFAP IDQAKTYKVA TNNFMRAGGD GYSIFKEGKN AYDFGPDLAD VTAEYLAAHS PYKPYTDGRI TENGAAVAQA PASEPAAPAP ATPAAPAPTT EPAPAPAAPA TPAPAAPAAP APATPATPAP AAEPTPAPAA SAPAGTTPST HVIAAGDTFW DLAVTFYGDG TLWRKLSEAN GSPNPRHLTV GKEIEVPAK
|
| |