Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3961 |
Symbol | |
ID | 8014775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4037608 |
End bp | 4038684 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826530 |
Product | GumN family protein |
Protein accession | YP_002977741 |
Protein GI | 241206645 |
COG category | [S] Function unknown |
COG ID | [COG3735] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0233081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACAT CAATCGGCCG CCGGACGTCT TACCTCGCAG TGCCGGCCAA TCTGTTGCTC TGGCTGATCG CAGCCTTTCA CATGCTGCTT CTTGCCGCCC TGTTTGCCGC CTTCCTGACG GCAAGGCCGG CAGCGGCCGA AGACGTCGCC TGCACCGGCC GCAATCTGAT GGTCGAGCTG CAGCAGAACG ACCCTGCCCG CTACGCAGAG GCGCTGAAGG AAGCCGACGC CACGCCGAAC GGCAAGGGTA TCTTCTGGAA GATCGAGAAG CCGGGATTGG CACCCTCGTG GCTGCTCGGC AGCATGCATG TCACCGATCC GCGCGTGCTG GCCCTGCCGC CGCGCGCCCA GGCAGCCCAC GATGCCGCCG ACACGATCAT CATCGAATCC GACGAGATCC TCGATGAGCG GAAGGCGACC GCCGCCCTGC TTGCAAAGCC GGAACTGACG ATGTTCACCG ACGGCACGAC GATCGACAAG CTGCTTTCTC CCGAGGACTA CAAGCGTCTC GAAACCGGCC TCAAGCAGCG CGGTATCCCG ATCAGTACCG TTTCCCGGAT GCGGCCCTGG ATGATTTCCA GCGCCGTCGC CCTGCCGGCC TGCGAAATCG CCCGCAAGGC AAAAGGCGCG CAGTTCCTCG ACCAGAAGAT CGCCACCGAT GCCATTGCTC AGGGCAAACA GGTCAAGGGG CTGGAAACCC TTGCCGAGCA GATCCAGGCC ATGGCCGATC TGCCGGTCGA ATTCCATCTG AAATCGCTGA TCGAGACGCT GGAACTCGGC GACAAGATGA GCGATGTCGT CGAGACGATG ACCGACCTCT ACCTCTCGGG TGATATCGGC ATGACCATGC CGATGCTGAA AACCGTGACA CCGGAGGAGG AAGGTGAAAA CAGCGATTAT GCCGCCTTCG AGCAGCGCGT CATCCTTGAC CGCAACAAGG TGATGGCCGA GCGCGCAGCG CCCATCCTCG ACAGCGGCAA CGTCTTCATG GCCGTCGGTG CCCTGCATCT GCCCGGCAAG GACGGCGTCA TCGAACTGCT GCGCCAGCAG GGCTTCACCG TAACAGATGT AAATTAA
|
Protein sequence | MTTSIGRRTS YLAVPANLLL WLIAAFHMLL LAALFAAFLT ARPAAAEDVA CTGRNLMVEL QQNDPARYAE ALKEADATPN GKGIFWKIEK PGLAPSWLLG SMHVTDPRVL ALPPRAQAAH DAADTIIIES DEILDERKAT AALLAKPELT MFTDGTTIDK LLSPEDYKRL ETGLKQRGIP ISTVSRMRPW MISSAVALPA CEIARKAKGA QFLDQKIATD AIAQGKQVKG LETLAEQIQA MADLPVEFHL KSLIETLELG DKMSDVVETM TDLYLSGDIG MTMPMLKTVT PEEEGENSDY AAFEQRVILD RNKVMAERAA PILDSGNVFM AVGALHLPGK DGVIELLRQQ GFTVTDVN
|
| |