Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6037 |
Symbol | |
ID | 6977423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 467073 |
End bp | 468335 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643393489 |
Product | protein of unknown function DUF181 |
Protein accession | YP_002278307 |
Protein GI | 209546417 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.463338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.501056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTTT TCACCGATCT TGAAACGCTT GCCGGTGACC TCAAACGATC ATCGGCCGGC GTCAGCGATT ATCACGACCG CGCCGTCACG CCGGCTCAGA CCTTGGCGGC GATCAGGCCG CATCTGCGCG AATTCGGCAT CACGCGCGTC GGCCTGCTGA CCGCACTCGA CGTCTTGAAC ATCCCCGTCG CCTTCGCGAC GCGGCCGAAC AGCCATACGC TCTCGGTCTT CCAGGGCAAA GGCATCGACA ATGATGCCGC CATGACCTCG GCCGCCATGG AAGCGATCGA GACGCGGATC GCCGAAATCC CGCCCGCCGA CCTGACGGAG GCGACCGTCG CCGGCATGCG GGCGGAAAAT GCGGCCATGA TCGATCTCGA CAATGTCGCC CGCTGCGCTC CCGACGAGAT CGGCAGCGGA CCCATTCCCT GGTGCTCCGG GCTCGACATC CTTTCCGGCA GCAGCGCCTT CGTGCCGTGG TGGCTTGTCG GCCTCGACCA TCGCGGCGAA AGACCACCGG GTTTCGAGCA GTCGAGCGAT GGGCTGGCCT CCGGCAACAC GCCATCCGAA GCCGTTCTGC ACGGGCTCTG CGAACTGGTG GAGCGCGACG CCTGGGCCTT GACCCAGCTG AAATCGCCCG AGCGGCTGAA GGAGAGCCGC ATCGATCCCG CCTCCTTCGG CGACGCAGTC ATCGATGTCA TGACCGACCG GATCGCGCGC GCCGGCATGC GGCTGCTGCT CCTCGACATG ACCACCGATA TCGGCGTTCC CGCCTTTCTC GCGGTCATCA TGCCCGGCAA CCTTTCCGAC CGTGTCGATG CACGCTGGGC CCATGTCTGC GGCGGCTGCG GCTGCCATCC CGATCCCGTG CGCGCCGCGC TGCGCGCCAT CACCGAAGCG GCGCAGAGCC GGCTGACCGC AATTGCCGGC AGCCGCGACG ATTTTTCGCC GCGCGTCTAT CAGCGGCTCG ACCAGAGCGC GGCGATGCAG CAGGTGGTCG AACTTTGTGA GGGCGGCGGC CGCATGCGCG CCTTCCAGCC GCGTCAGAGC CGCCCGGCGA CAATCCAGGA AACCATCGGC CATATCGCCG ACCGGCTGGC TGCGACCGGC ATCGAGCAGA TCGTCGTCGT GCCGTTTGCG CACCGGGCTC TGCCGGTCTC CGTCGTCAGG GTCATCGTGC CGGGCCTGGA GGTCGATATC TCCGGCCAGT ACATCCAGCT CGGCATGCGG GCGGTCAACA CCATGAGGGG AGCCCAGTCA TGA
|
Protein sequence | MSVFTDLETL AGDLKRSSAG VSDYHDRAVT PAQTLAAIRP HLREFGITRV GLLTALDVLN IPVAFATRPN SHTLSVFQGK GIDNDAAMTS AAMEAIETRI AEIPPADLTE ATVAGMRAEN AAMIDLDNVA RCAPDEIGSG PIPWCSGLDI LSGSSAFVPW WLVGLDHRGE RPPGFEQSSD GLASGNTPSE AVLHGLCELV ERDAWALTQL KSPERLKESR IDPASFGDAV IDVMTDRIAR AGMRLLLLDM TTDIGVPAFL AVIMPGNLSD RVDARWAHVC GGCGCHPDPV RAALRAITEA AQSRLTAIAG SRDDFSPRVY QRLDQSAAMQ QVVELCEGGG RMRAFQPRQS RPATIQETIG HIADRLAATG IEQIVVVPFA HRALPVSVVR VIVPGLEVDI SGQYIQLGMR AVNTMRGAQS
|
| |