Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3298 |
Symbol | |
ID | 8014183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3299127 |
End bp | 3300098 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644825857 |
Product | NMT1/THI5 like domain protein |
Protein accession | YP_002977084 |
Protein GI | 241205988 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.141874 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAGT TGATGGTTGC AATGATGGCG AGCGCGATGT CGCTTGCATC GGCCCATGCG ATGGCCGCCG ACAAGGTGGT GCTGCAGCTG AAATGGGTCA CGCAGAGCCA GTTCGGCGGT TATTACGTCG CCAAGGAAAA GGGCTTCTAT AAGGAGGAAG GCCTCGACGT CGACATCAAG CCGGGCGGCC CTGATATCGC CCCCGAGCAG GTGATCGCCG GCGGCGGCGC CGATGTCATC GTAGACTGGA TGGGCGGTGC CCTGGTTGCC CGCGAAAAGG GCGTTCCGCT CGTCAACATC GCCCAGCCCT ATCAGAAGGC GGGCCTGGAA ATGGTCTGCC GCAAGGACGG CCCGATCAAG ACCGAAGCCG ACTTCAAGGG CCACACGCTC GGCGTCTGGT TCTTCGGCAA CGAGTATCCC TTCTTCGCCT GGATGAACAA GCTCGGCCTG TCCACAGAAG GCGGTCCGAA CGGCGTCACC GTGTTGAAGC AGAGCTTCGA TGTGCAGCCG CTTGTCCAGA AGCAGGCCGA CTGCATCTCT GTCATGACCT ATAACGAATA TTGGCAGGCG ATCGATGCCG GCTTCAAGCC GGAAGAACTG ACGGTCTTCA ACTACACGGA AATGGGCAAC GACCTTCTTG AAGACGGCCT CTATGCGATG GAAGACAAGC TGAAAGATCC GGCCTTCAAG GAGAAGATGG TCAAGTTCGT CCGCGCATCG ATGAAGGGCT GGAAATATGC CACCGAGAAT CCCGACGAAG CCGCCGAGAT CGTCATGGAT AATGGCGGCC AGGACGACAA CCATCAGAAG CGCATGATGG GCGAAGTCGC CAAGCTGGTC GGCGACAGCT CCGGCAAGCT GGACGAGGCG CTCTATGCCC GCACGGCAAA GGCGCTGCTC GACCAGAAGA TCATAAGCAA GGAGCCGTCG GGCGCCTGGA CGCACGATAT CACCGACGCC GCTTCCAAGT AG
|
Protein sequence | MRKLMVAMMA SAMSLASAHA MAADKVVLQL KWVTQSQFGG YYVAKEKGFY KEEGLDVDIK PGGPDIAPEQ VIAGGGADVI VDWMGGALVA REKGVPLVNI AQPYQKAGLE MVCRKDGPIK TEADFKGHTL GVWFFGNEYP FFAWMNKLGL STEGGPNGVT VLKQSFDVQP LVQKQADCIS VMTYNEYWQA IDAGFKPEEL TVFNYTEMGN DLLEDGLYAM EDKLKDPAFK EKMVKFVRAS MKGWKYATEN PDEAAEIVMD NGGQDDNHQK RMMGEVAKLV GDSSGKLDEA LYARTAKALL DQKIISKEPS GAWTHDITDA ASK
|
| |