Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4322 |
Symbol | |
ID | 8015102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4439113 |
End bp | 4440765 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826898 |
Product | Tetratricopeptide domain protein |
Protein accession | YP_002978101 |
Protein GI | 241207005 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.576139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATAG ATATCATCGA TACCATTGCG GGGTTCGAGG CGGTGCGCGA GAACTGGGAT CAGGTCTTCC TGGAAGATCC AGACGCGCAG CATTTCCTCT CCTGGATATG GCTGAAGAAC TATCTGTCGC GCCGACGCCG CTGGTTCATC CTCGCCCTTC GCGAACGCGA TCCCGAAGCA CCCTATGTCG CCTTCCTTCC GCTGCGCCTC ATCACGCATC TGAACGAAAA GACCGGGCTC TTCTATGACG AGATCATCAT GGCCGGGAAT TTTGCGGCCG ATTATACCGG CTTCATCGTC AGGCCGGATT ACGAGCATCA CGCCATTGCG GGCTTTGCTT CGTTTATCAA ACATCAAAAC TGGACCGACC TGAAACTCGA ATATTTCAGC GGTCCTGCCA GCCGGCGCGA GAAGATGATC GAGGCGCTGC AGGGACCGGA AGTGATGTTT CGCGACAGCT CGCCCAAGAA CAGCGAGAAT ATCGACAATA CGATCTGTCC CATCGTCCCC CTGCCGGCAA GCTTCGACGA CTATCTCGAG CAGCGCATGA GCAGCCAGAC GCGCCAGAAG CTCCGGCGGT TCCTGCGCAA GGTCGAAGGC GACGATATTT ACAGGATCAC GATGGCAACT CCCGAGACCA TCGATCGGGA CCTGGACACC CTCTTCAATC TCTGGCGGAT CAAATGGAGT GCCCGCAAGG GCGCAGAGCG GACCGAGCGG CTGATCGTCA CCACGCGCGA GATGCTGATG GACAGCTTCA ACTGCGGCAA TCTCGAGGTG CCGGTCCTTT GGTATGGCGA CCAGCCGCTC GGCGCGCTGG CGAATATCGT CGACCGGCAG AAGAAGGCCA TCCTCTTCTA TATCACCGGC CGCGACGAGA ACTGGAAGAC GCCGTCTCCC GGCCTCATCC TGCATGGCTA CTGCATCCGG CGGGCGATCG AGCAGGGCTT CAAGACCTAT GACTTCCTGC GCGGAAACGA ACCCTATAAG TACATGTTCG GGGTGGAGGA AAGGCGGATC AGCTGCACCC TGTTCCGCAC CCGCAACGGC CAGAACCTGC ATGGCGCGCT CAACCCGCGC AGCATTCGTT TCGTCTATGA ACAGGCGCTC GACATGTACC GCAATGGAGC CCGGAGCAAG GCGGAGATCG CCTTCAACCA GGTCCTGCAA TCGGCGCCAG GCCATACCGG CGCGGAATTC GGGCTCGCCA ATCTGCTGTT CGACCGAGGC AAACTGACGG AGGCATTGAC TGCCTACAAG GCTCTCGTCG AGCAAGCACC CGACCCGACA CCGATCCAAA TGCGGCTTGG CGACACGCAA CTTGCGCTGC ATCAATATGA GCAGGCCGCA GAGACTTTCC GTCAGATCGG CGAGATCGGG CCGCATCTCA TCCAGGCGCA TTACAAGCGT GGTATCGCCC TTGCGGCGAG CAAACGGCTG GCCGAGGCGG AGGCGGTTTT CGCCGCGATC CAGGACGTGC ATTCGGACGA TCCGACCGCA CTCGATTATG CCGCCAAGGC GGGCGTTGCC CTCGAACGCC TGCGCTCGAT CACCGAGCCC GCCGCTGGCA AGACGGACGT CGTGCCGGAG ACCATCGCCC GCTGGAACCG AGGACGGCAG CTCAGCGAGC GACGCCGGCC ACGTCTGCAT TGA
|
Protein sequence | MRIDIIDTIA GFEAVRENWD QVFLEDPDAQ HFLSWIWLKN YLSRRRRWFI LALRERDPEA PYVAFLPLRL ITHLNEKTGL FYDEIIMAGN FAADYTGFIV RPDYEHHAIA GFASFIKHQN WTDLKLEYFS GPASRREKMI EALQGPEVMF RDSSPKNSEN IDNTICPIVP LPASFDDYLE QRMSSQTRQK LRRFLRKVEG DDIYRITMAT PETIDRDLDT LFNLWRIKWS ARKGAERTER LIVTTREMLM DSFNCGNLEV PVLWYGDQPL GALANIVDRQ KKAILFYITG RDENWKTPSP GLILHGYCIR RAIEQGFKTY DFLRGNEPYK YMFGVEERRI SCTLFRTRNG QNLHGALNPR SIRFVYEQAL DMYRNGARSK AEIAFNQVLQ SAPGHTGAEF GLANLLFDRG KLTEALTAYK ALVEQAPDPT PIQMRLGDTQ LALHQYEQAA ETFRQIGEIG PHLIQAHYKR GIALAASKRL AEAEAVFAAI QDVHSDDPTA LDYAAKAGVA LERLRSITEP AAGKTDVVPE TIARWNRGRQ LSERRRPRLH
|
| |