Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4500 |
Symbol | |
ID | 8015261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4634489 |
End bp | 4636102 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644827076 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_002978277 |
Protein GI | 241207181 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.177123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGT ATGACAGGAA CAGGATAAGC CGGCTCCCTG GCTGGCGCAG TTTTGAGCCG TCTCAGACTG CGCCGGAAGG CGTGGGCATG CGCAGTCCCG TCGTCCGCCC GGATGATTTC GTCCGGCCAT CGCCCGAACC TGCCCCGCCC CCCTTCGTGC CGCCGGCAAG CATTGCGGAA ACCCGCCAAT ATGAGCGTCC GGCACCTCAG CCTCAGCCTG CAAGACAGCC TGTCGCGGAC GCGCCGCCGA ATGCAGAGTC CGCGCCGGCC GCACCGCTTC TCGACCTCCG CTCCAGTATC GCCGCGATCT GGAGCCGGCG ACTGATCGTG CTGGGTTTGG CGCTTCTCGG AGCCCTCGCC GGCGGGGCGG TGGCGCCGCG CATCGCCCAG AAATTCACCG CCATCAGCAG CCTCTATTTC GATCCGCGCC AGATCGGCCT TGCCGATGCC GGCGCGCAAT CGTCGGGTCC CTCGCCGGAA ATGATCTCGG CGCTGATCGA CAGCCAGGTG CAGATCCTGA CCTCGGGCAA TGTGCTGCGC CGCGTCGCCG AAGCCATGAA GCTCGACCAA GATCCCGAAT TCACCGGCGG CCGCACGGAT GGCGCCGCCG TGATCGGCAC TCTGCAGAAG GCGCTGGTCA TCACCCGTGA GGCCAGCACC TATGTCGTCT CGCTTGCCGC CACGACCAAC GATCCCGAAA AATCCGCAAG GCTTGCCAAC CAGGTCGTCA CCTCCTTCAC CGAGGAGGAG AACAGTGCCT CGAACGGCAT CTACGAAAAT ACCTCCTCAA CGCTCGACGG TCGCCTCAAC GACCTGCGCC AGAAGGTGCT GGAGGCCGAG CAGGCTGTCG AAACCTTCCG CGCCGACAAC GACATGGCCG CGACCGAGGG CAACCTGATT TCCGATCAGC GGCTCGTTTC GCTGAACACG ATGCTGGTGA CGGCACAGGA AAAGACCATC CAGGCAAAGG CGCGCGCCGA TGCCGTCGCC AACCTCCGCG TCGAGGACAT CGTCGCCGGC AACCAGGCGG AGGGCGGCGT CACCTCGCCG CTCGTCAGCC TGCGTCAGCA ATATGCCACC CAGGCCGCCG CCGTCGGCAG CCTCGAAAGC CAGATGGGCA CGCGTCATCC GCGCCTGCAG GCCGCCCGCT CCTCGCTGCA GAGCATTGGC GGCGAAATCA AGGGCGAATT GCAGCGTCTC GTCACCTCGG CAAGGGGCGA ATACGAGCAG GCAAAGGCCG CCGAGGACAG CATCGCCAAG GAACTTGCCG TGCAGAAGGC GTTGCAGGCG AGTACCTCCG ACAAGCAGGT GGAATTGAAC GAATTGCAGC GCAAGGCGAC GGCGGCGCGC GATATTTACG AGACGGTGCT GAAACGCTCT AGCCAGACGA GCGAGGAGCA AAACTTCAAC CGGAGTAACA TTCGCGTCAT TTCGCCGGCC GAGCCGCCGG TCAAGGGAGA CGGTCCCGGA AAGACGATTC TATTGGTTGC CGGCGTCATC GGCGGTTTTC TCGCCGGTTT CGTCGTCGGC GCGGGCTTTG CGATTCTCGC CGGCCTCTTC AGCCATCCCG TCATCAGAAG TTATTTCAGG AAGTCTCCCG CTGCGGCCGC TTGA
|
Protein sequence | MNQYDRNRIS RLPGWRSFEP SQTAPEGVGM RSPVVRPDDF VRPSPEPAPP PFVPPASIAE TRQYERPAPQ PQPARQPVAD APPNAESAPA APLLDLRSSI AAIWSRRLIV LGLALLGALA GGAVAPRIAQ KFTAISSLYF DPRQIGLADA GAQSSGPSPE MISALIDSQV QILTSGNVLR RVAEAMKLDQ DPEFTGGRTD GAAVIGTLQK ALVITREAST YVVSLAATTN DPEKSARLAN QVVTSFTEEE NSASNGIYEN TSSTLDGRLN DLRQKVLEAE QAVETFRADN DMAATEGNLI SDQRLVSLNT MLVTAQEKTI QAKARADAVA NLRVEDIVAG NQAEGGVTSP LVSLRQQYAT QAAAVGSLES QMGTRHPRLQ AARSSLQSIG GEIKGELQRL VTSARGEYEQ AKAAEDSIAK ELAVQKALQA STSDKQVELN ELQRKATAAR DIYETVLKRS SQTSEEQNFN RSNIRVISPA EPPVKGDGPG KTILLVAGVI GGFLAGFVVG AGFAILAGLF SHPVIRSYFR KSPAAAA
|
| |