Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5398 |
Symbol | |
ID | 6978492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1039800 |
End bp | 1041692 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643394500 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_002279318 |
Protein GI | 209547400 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.229574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00287392 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGAAAT CCGCAGAGCT CGACAAACTC GCCCGCCGCC ACGGCATCAG CCTGACAAGG CCTAGCCCCG AGAACCGGGA AGTGGTGATC TCAGCCGCGA CTAAGCGCAA AATACTCTCG GCATTGAATA TTGAACTGAC GGAGGATCAA GAGCCTGGTG AGCCGCGGCG GAAGGCTAAG CCGGATGGCA GGAAGATCCC GGTGTCGTTT CTGCCGGATT TCCTATCCGA CACACGGGTC TGGGGCGTGA GCTTGCAGCT TTACGAGCTC CGTTCGGCAC GCAACTGGGG CATAGGAGAC TTCCAGGATC TCGCCGATTT GGCCGATCTG GCGGGATCGC TGGGGGCGGA TTTCATCGGT CTCAATCCGC TTCACGCGCC GTTCCTCGCC GATCCTGATC GCTGCAGCCC CTATGAACCC TCAAGCCGCC AGCATCTCAA CCCGCTCTAT ATCGCGGTCG ACCAGGTGCC GGGCTTTGCT GGCAATCCCA AGCTGGAACA GGAATTGGAG CGCCTTCGCC AATCCGATCT CGTCGACTAC ATCGGTGTCG CGCGGGCCAA GCTTGGAGCC CTTCGTGATC TCTGGTCGGC GCGGCGACAA TGCCGTGTTG GCGACGAGGC CGATTTCGAC GCATTTGTCG CGCAAGGCGG CGACAGCCTG CGGCTGCATG CGCTGTTCGA ATGCCTCTCC GCTTTCATGG TCGAGCGCGG GGCGGGCGCC GGCTGGCAGC GGTGGCCGGC CGAGTTGCAG CGCTTCGACA GCGCCGCTGT CGGCGATTTC GAACGCGAGC ATGCAGATGA CGTTCGCTTT CACATGTGGC TGCAATGGCT CGCCCACCGC CAGCTGATGC AGGCGGCAGA TCGGGCGCGC AAGGCCGGCC TCAGGATAGG GCTCTATCTC GATCTTGCCG TCGGGGAGGC GGTCGACGGC TCGGCGACAT GGAGCGAGCC GGATATCTAT GTCTCGCAGG CGACGATCGG TAGTCCTCCG GATCCATTCG CCGTCGATGG GCAGGATTGG CACCTTGCCG GATACCTGCC ATCCGAAATT GCCGGAGGGG AGATGTCGCC TTACCGGCGC ATGGTTGGCA CCGCCATGCG CTACGCGGGC GCCATTCGTA TTGATCACGC ACCGGCGATC CGCCGCCTTT TCCTGGTTCC GTTAGGCAGC AGGCCGGATG GCGGCGCCTA CGTCCGCTAT CCCGAGGACC GGCTGTTGCA GATCCTCGCC GAGGTTTCCG CTGAACATCG ATGCCTTGTC ATCGGGGAGT CCCTCGGAAT GATTCCTGAA GGCTTGCAAG AGGATCTGGC TACTGCCGGC ATTCTCTCCT ACCGGATCCT TTCCTATGAA CAGGATGAGA AGGGCTTCAA GCCCGCCGAT GCCTATCCGG TCCTCGCGCT CGCCTGCATT TCGACGCATG ACCACCAGAC GCTTGCCGGC TGGTGGCGCG GCGCCGACAT TCAGGATCGC TGTGAACACG GTATCGTGCC GCCCGATCTC ACCGAAGAAC ATCTCAAATA CCGCAAGCGC GAGCGGAGGT ATCTGAAAGC GGTCTTCAAC GCCGCTGGCC TCGACGTGCC GCCCCGGCTC ACGGCGGCGC GGGCAAGCCA GGAAGCGTTG CAAGATCTGA CGGTGAGCGC TTATCGTTTC ATTGCTCGCA CGCCGTCGTT GCTGACATCG GTGCGGCTTG CCGATCTCAC CGACGAGAAA GCGCCGACCA ATATTCCGGG CACCAGCGAC AGCTATCCGA ACTGGAAGCC GAAGCTTTCG GTTTTGCTGG AGGATCTGCT GTCGGTCCTG CTGCTCAAGC GCGTAACGGC GGCGATGCGG GAGGAAAGGC CGCGCTACGC CTCCGCGACG CGAATGGAAA TCGATCGGGG ACGGAACGAA TAG
|
Protein sequence | MMKSAELDKL ARRHGISLTR PSPENREVVI SAATKRKILS ALNIELTEDQ EPGEPRRKAK PDGRKIPVSF LPDFLSDTRV WGVSLQLYEL RSARNWGIGD FQDLADLADL AGSLGADFIG LNPLHAPFLA DPDRCSPYEP SSRQHLNPLY IAVDQVPGFA GNPKLEQELE RLRQSDLVDY IGVARAKLGA LRDLWSARRQ CRVGDEADFD AFVAQGGDSL RLHALFECLS AFMVERGAGA GWQRWPAELQ RFDSAAVGDF EREHADDVRF HMWLQWLAHR QLMQAADRAR KAGLRIGLYL DLAVGEAVDG SATWSEPDIY VSQATIGSPP DPFAVDGQDW HLAGYLPSEI AGGEMSPYRR MVGTAMRYAG AIRIDHAPAI RRLFLVPLGS RPDGGAYVRY PEDRLLQILA EVSAEHRCLV IGESLGMIPE GLQEDLATAG ILSYRILSYE QDEKGFKPAD AYPVLALACI STHDHQTLAG WWRGADIQDR CEHGIVPPDL TEEHLKYRKR ERRYLKAVFN AAGLDVPPRL TAARASQEAL QDLTVSAYRF IARTPSLLTS VRLADLTDEK APTNIPGTSD SYPNWKPKLS VLLEDLLSVL LLKRVTAAMR EERPRYASAT RMEIDRGRNE
|
| |