Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0455 |
Symbol | |
ID | 8011655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 472699 |
End bp | 474471 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644823049 |
Product | Capsule polysaccharide biosynthesis protein |
Protein accession | YP_002974303 |
Protein GI | 241203207 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAATATC GCCACCCCAT TGCTGCCTTA ACCGTGCCTA CGAGGCCGCT GTTGTCTCGC ATTCGCGACA GGCTTAGCCG ACATGGTTGG TATCGCCGCT TTCATCTCGC ATATGTAAGA TGGCGTGCCC GCCATGGTGG ACTGCCCGAC TGGGCGGCAT GGCAACCACA AGCGGGACTC GAGCCGGTAC GCCCACCTTT GATCGACGGC GCCAAGCGCG TCTTGGTTGC AACAGGCACT GCCGGACATC TGCCGTCCAT GACGATGGAA AGCCTGCTTG GAATGGCATT GGCGATTCGT GACGCCAGTA TTGACTTCCT CATTTGCGAT GCGGCGCTGC CTGCCTGCAT GATGTGCGAG ATCAGTTGGT ATTCTGATGT CGACAAGCTG GCTAAGCATG GCCCAAAAGA CCGCTGTCTG ACCTGCTATC GACCATCCGC TGAAATGCTC GACCGGGCCG GACTCAACGC GATCGGACTC AACTCGAAAC TCACTGACGC CGATCGCTTC AAGGCAAAGA CGACTTCAAC GCAGATGCCG CAAAGCGAGA TCGCGGCGTA TATAGAGGAC GGCATTCCCA TCGGGGAGCA CGCCGTGGCG GGCGCTCTCC GTTTTTTTTC AAGAGGCGAC CTTGAGAAGG TGTCTGGAGC CGACGTGATC CTTAGGCGTT ACCTGGAAGC GGCGATTCTG ACCTATTACG GCACCCGACG CCTGCTTGCC GAAGGTCACT ATGATGCCGT GGTGTTAAAT CACGGGATAT ATGTTCCCCA GGGCATCATA TCGGAAACAG CCCGACACCT CGGCGTGCGC GTGGTGACGT GGCATCCAGC CTACAGACGG GGATGCTTTA TCTTCAATCA CCATGAAACC TATCATCATG GCTTGATGAC CGAACCGGTA TCGTCTTGGG AGGATATGTC TTGGAACGGT ATGCAGCAGC AGCAGATCAC GCAGTATCTC CGTAGCAGAT GGGTTGGCCA GCAAGACTGG GTGAAGTTTC ACGATCAGCC GGAGTTCGAT ACCCGCTCGA TTGAGGAGGA AATCGGCATT GATTTCCGAC GTCCGACCAT CGGGTTGCTC ACAAACGTGA TTTGGGACGC GCAGCTGCAT TACAAGGCAA ACGCCTTTCC GAACATGGTT GACTGGTTGA TCAAGACCAT TGCCTACTTC GAAAAGCGGC CCGACCTGCA GTTGCTGATA CGGGTTCATC CGGCCGAACT TACAGGGACG CTTCCATCCC GGCAGCCTGC CGTCGACGAA ATTCGGCGCC AGTTTCCTAA CCTGCCGGCA AATGTCTTCA TCATTCCGCC CGAAAGCAAA GCAAGTACCT ACGTCGCCAT GTCTCACTGC AACGCCGTCC TGATCTATGG CACGAAAATG GGCGTTGAGC TTTCGGCGAT GGGAATACCG GTCATTGTCG CCGGTGAGGC CTGGATTCGT GGCAAAGGCG TTACGATGGA TGCAACCTCG GAAGAAAATT ATCTGCGCTT GCTCGATGCT CTGCCGCTGC GGGAAAGGCT GAATGACGCC ACGATCGAAC GGGCGCAGAA ATATGCTTAT CACTTCTTCT TTCGGCGCAT GGTCCCGCTG GATTGTATTA AGGAGCGAAA GGGCTGGCCG CCATTCGCCG TGCATATAGA TAGTCTGGAC GACCTTGCAC CCGGCAAGTC GCCCGGGCTC GACATCGTTT GCAACGGCAT CATCGCGGGG ACACCCTTCA TCTATCCCGC AGAGGACCTC ATGGGAGCGC CCTCAGACCG GAGAGCGACG TGA
|
Protein sequence | MKYRHPIAAL TVPTRPLLSR IRDRLSRHGW YRRFHLAYVR WRARHGGLPD WAAWQPQAGL EPVRPPLIDG AKRVLVATGT AGHLPSMTME SLLGMALAIR DASIDFLICD AALPACMMCE ISWYSDVDKL AKHGPKDRCL TCYRPSAEML DRAGLNAIGL NSKLTDADRF KAKTTSTQMP QSEIAAYIED GIPIGEHAVA GALRFFSRGD LEKVSGADVI LRRYLEAAIL TYYGTRRLLA EGHYDAVVLN HGIYVPQGII SETARHLGVR VVTWHPAYRR GCFIFNHHET YHHGLMTEPV SSWEDMSWNG MQQQQITQYL RSRWVGQQDW VKFHDQPEFD TRSIEEEIGI DFRRPTIGLL TNVIWDAQLH YKANAFPNMV DWLIKTIAYF EKRPDLQLLI RVHPAELTGT LPSRQPAVDE IRRQFPNLPA NVFIIPPESK ASTYVAMSHC NAVLIYGTKM GVELSAMGIP VIVAGEAWIR GKGVTMDATS EENYLRLLDA LPLRERLNDA TIERAQKYAY HFFFRRMVPL DCIKERKGWP PFAVHIDSLD DLAPGKSPGL DIVCNGIIAG TPFIYPAEDL MGAPSDRRAT
|
| |