Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6606 |
Symbol | |
ID | 8022856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 31634 |
End bp | 34918 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644833475 |
Product | hypothetical protein |
Protein accession | YP_002984609 |
Protein GI | 241666525 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00814353 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCACGG ACCTTGACCG CAGTTTTCAA ATGCCCCGAC GCGACGAGCT TGGCCTCTTC ACGATCAGCA ACGGTTCCGG CCTGTCGATA TCGGCGCTGC CGAACGGCAC CCTGTTTGCC ATCGAATATG CCGACGACAA GGGATCGGTG CAGATCAACC AGATCCAGGG CTCGCCGCTG ACCGGCGGCG TCAGTCGCCT TTATCTGCGT ATCGGTGGTG CGGCACCTGA TGTCGTCGAG CTTGTCGGGT CTTTCGCCGA TGGCAGCTTT GGGCACGATG CGACAAGTCT CTCCTGGAGC GGCAAGAGAG GCGATATCGG CTACAATGTC CGCCTCGAGC TTCATCCCTC CGAAACGGCC TGGTTCTGGC GTGTTTCCAT CCGGCATCTG AAGGATGGCA CGCTGCCGGT CGATCTGGTG CTGATCCAGG ACGTCGGTCT TGGCGATCGC GGCTTCCTGA TGAACAGCGA AGCCTATGCC TCGCAATATG TCGATCACCA TATCGCCGAT CACGAGACAT TCGGTTCTGT CGTAATGAAC CGGCAAAATC TCAAGCAGTC GGGTGCTCGC AACCCCTGGC TCGTGCAGGG CTGCCTCGAT GGGGCGGCTG CCTATGCCAC CGATGCGATC CAATTGGTAC AGGCGAGTGA CCGTCTCGGC GACTTGCTGG TCGGCCCCTT CGGCACCAAC CTGCCGAGCA AGCGGCGCCA GCAGGAGACG GCATGCCCCG CCATCCAGTC AAAATCGCTT TCCGTCCCGG CAAGCGGTGC GACCGCGACC TTCTTTGCCG TCTTCGCCGC GGATCATCCC GAGGCATCGA GCGATGCCGA CCTTTCACGG CTTGACGAGC TTGCGGCGAC AGAGGGGACG GCGGCCGATA TTGCCGAGGC CGCGCCGGTC CGCAGCCTGC TCCAGGATGC TGCTTTGCTG AAGGCCGAGG CGCTGGATAA AAAAATGATC AGCCAGCTCT ATCCGCAACG GAGCCTGGAA GAGCGCGTCG ACGGAAAGCT TCTTTCGTTT TTCGTCTCCG ACGGCGTTTT GAACCGCCAT GTGGTCCTGC GCGACAAGGA GCTCTTGGTA GCGCGCCGTC ACGGGGCGAT CGTCAGAAGC GGTGAGAATA TGCTGCTCGA CGATAGGACG CTTGCCGCGA CCTGCTGGAT GCAGGGTATT TTCGCGGCCC AGCTGACGAT CGGCAATACG TCCTTTCACA AGCTCTTTTC GGTCTCCCGC GACCCCTACA ATCTGACCCG CGCCAGCGGG CTGCGCATCA TGGCCGATGT CGGCGCCGGC TGGCAGCTTC TGGCGGTGCC GTCGGCTTTC GAAATGGGGC TCAGCGATTG CCGCTGGATC TATCGCCTCC CTGAACGCAC GATCATCGTG TCGGCGGTCG CCTCCGGCGA GGATGCGGCG ATGCAGTGGA CCGTCTCCGT CGAGGGCGAG CCGTGCCGCT TCCTGGTGTT TGGTCATGTC GTGCTCGGCG AGCGCGAATA TGATGCGGGC GGGCAGATCG AATTCGATAC GTCGGGTAAA CGCCTTCTTT TCCGGCCGGA CCCGGCCTGG CTCTGGGGCG AGCGTTATCC CGACGCCGGC TACTGGCTGG TGAGTTCGAC GCCCGATGCC ATCGAAGAGA TCGGCGGCGA CGAACTGCTC TACAGCGATG GGGTAACACG CAACGGCGCT TTCGTGGCAC TACGATCACT GATGACGCAG GCGCTGTCGT TCGCTGTCGT CGGCTCGATG ACAGATGCGG CGGAAGCCGA GCGCCTGGCG CAGCGTTATC AGGCCGGCGT CACCGATGAA GCCATGCTTG CCCCGGCATC GAAATTCTGG CGGAACACCG TGCGCGGCCT GACGGTCGCC AGCACGTCGC CCGACCTTGC CGCGCAGACG ACCCTGCTGC CCTGGCTCGC CCATGATGCA ATCGTGCATC TAAGCGTGCC GCACGGCCTC GAGCAATATA CGGGTGCGGC CTGGGGCACA CGCGACGCCT GCCAGGGGCC GATAGAATTC CTGCTCGCCT ACGAGCATGA CCGCGAAGCC AAGCAGGTGC TGAAAACCGT CTTCAGCGAA CAATACCTGG GAAAAGGCGA CTGGCCGCAA TGGTTCATGC TGGAGCCCTA TGCCAACATC CGGGCGGGCG ATAGTCACGG CGATATCGTC GTCTGGCCGC TGAAGGCGCT CTGCGACTAT ATCGAAGCGA CCGGCGATCT CGCCATCCTC GACGAGAAAG TCTCCTGGCG CGATGAAAAG ACGATGGCCA GGGCGGAGCT CGACACTATC GCAATCCATG TCGAGAAGCT GCTCGATACC GTTCGCGAAG CTTTCATCCC CGGAACGCAT CTGATCCGCT ATGGCGAGGG AGACTGGAAC GATTCTCTGC AGCCGGCCGA TCCGCATCTG CGCGACTGGA TGGTCAGCAG CTGGACCGTT GCCCTGCTCT ACGAGCAGAT CGTGCGCTAT TCGGCGATCC TGCGCCGCCT CGGCCATGGC GGCAAGGCGA AAGGCTTGAG GAAGATCGCA ACGGCGATGC GCCGGGATTT CAACCGCCAT CTCGTGCGCG ACGGCGTCGT GGCCGGCTAC GGTATCTTCG ATCCCAGCCA TGACGGCGTC GAATTGCTGC TGCACCCGAG CGACCGGCGC ACCGGCCTGC ATTTTTCGCT GATCTCGATG ACGCAGGCGA TGCTCGGCGG CCTCTTCACG CCGGCTCAAA GACAGGGTCA TATGAAGCTG ATCGAAGAGC ATCTGCTCTT TCCCGACGGC GTACGACTGA TGGAGAAGCC CGCGGCCTAT GCCGGCGGAC CCGAGACGCT GTTTCGCCGC GCCGAATCCT CCTCCTTCTT CGGCCGCGAG ATCGGGCTGA TGTATGTGCA TGCGCATCTG CGCTACTGCG AAACGCTGGC GCTTGATGCC GAAGCGGAAG AGCTCTGGAA GGCGATCGCC CTCGTCAACC CGATTTCGGT CACATCGGCG TTGCCGCATG CATCGCTGCG CCAGCGCAAC ACCTATTTCA GCAGCAGCGA TGCGGCTTTC CATGACCGTT ATCAGGCAGC GGCCGAATGG GAGCGCGTCA AGGCGGGAAA GATCGCCGTC GACGGCGGAT GGCGGATCTA TTCAAGCGGC CCCGGGCTTT ACACCAGGAG CTTCGTCGAA AATATCCTCG GCTTCAAACG GCGCTTCGGC CGCCGCAAAC GCAAGCCGCT TCTTCCCGCG GTCCACGCCT CTGCCGATCT GCAAACCGAC CACGCCGTCT GGCGCCGGCT GATGAAGCCG AAGCCTGAGG TGTGA
|
Protein sequence | MATDLDRSFQ MPRRDELGLF TISNGSGLSI SALPNGTLFA IEYADDKGSV QINQIQGSPL TGGVSRLYLR IGGAAPDVVE LVGSFADGSF GHDATSLSWS GKRGDIGYNV RLELHPSETA WFWRVSIRHL KDGTLPVDLV LIQDVGLGDR GFLMNSEAYA SQYVDHHIAD HETFGSVVMN RQNLKQSGAR NPWLVQGCLD GAAAYATDAI QLVQASDRLG DLLVGPFGTN LPSKRRQQET ACPAIQSKSL SVPASGATAT FFAVFAADHP EASSDADLSR LDELAATEGT AADIAEAAPV RSLLQDAALL KAEALDKKMI SQLYPQRSLE ERVDGKLLSF FVSDGVLNRH VVLRDKELLV ARRHGAIVRS GENMLLDDRT LAATCWMQGI FAAQLTIGNT SFHKLFSVSR DPYNLTRASG LRIMADVGAG WQLLAVPSAF EMGLSDCRWI YRLPERTIIV SAVASGEDAA MQWTVSVEGE PCRFLVFGHV VLGEREYDAG GQIEFDTSGK RLLFRPDPAW LWGERYPDAG YWLVSSTPDA IEEIGGDELL YSDGVTRNGA FVALRSLMTQ ALSFAVVGSM TDAAEAERLA QRYQAGVTDE AMLAPASKFW RNTVRGLTVA STSPDLAAQT TLLPWLAHDA IVHLSVPHGL EQYTGAAWGT RDACQGPIEF LLAYEHDREA KQVLKTVFSE QYLGKGDWPQ WFMLEPYANI RAGDSHGDIV VWPLKALCDY IEATGDLAIL DEKVSWRDEK TMARAELDTI AIHVEKLLDT VREAFIPGTH LIRYGEGDWN DSLQPADPHL RDWMVSSWTV ALLYEQIVRY SAILRRLGHG GKAKGLRKIA TAMRRDFNRH LVRDGVVAGY GIFDPSHDGV ELLLHPSDRR TGLHFSLISM TQAMLGGLFT PAQRQGHMKL IEEHLLFPDG VRLMEKPAAY AGGPETLFRR AESSSFFGRE IGLMYVHAHL RYCETLALDA EAEELWKAIA LVNPISVTSA LPHASLRQRN TYFSSSDAAF HDRYQAAAEW ERVKAGKIAV DGGWRIYSSG PGLYTRSFVE NILGFKRRFG RRKRKPLLPA VHASADLQTD HAVWRRLMKP KPEV
|
| |