Gene Rleg_6606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6606 
Symbol 
ID8022856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp31634 
End bp34918 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table11 
GC content62% 
IMG OID644833475 
Producthypothetical protein 
Protein accessionYP_002984609 
Protein GI241666525 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00814353 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCACGG ACCTTGACCG CAGTTTTCAA ATGCCCCGAC GCGACGAGCT TGGCCTCTTC 
ACGATCAGCA ACGGTTCCGG CCTGTCGATA TCGGCGCTGC CGAACGGCAC CCTGTTTGCC
ATCGAATATG CCGACGACAA GGGATCGGTG CAGATCAACC AGATCCAGGG CTCGCCGCTG
ACCGGCGGCG TCAGTCGCCT TTATCTGCGT ATCGGTGGTG CGGCACCTGA TGTCGTCGAG
CTTGTCGGGT CTTTCGCCGA TGGCAGCTTT GGGCACGATG CGACAAGTCT CTCCTGGAGC
GGCAAGAGAG GCGATATCGG CTACAATGTC CGCCTCGAGC TTCATCCCTC CGAAACGGCC
TGGTTCTGGC GTGTTTCCAT CCGGCATCTG AAGGATGGCA CGCTGCCGGT CGATCTGGTG
CTGATCCAGG ACGTCGGTCT TGGCGATCGC GGCTTCCTGA TGAACAGCGA AGCCTATGCC
TCGCAATATG TCGATCACCA TATCGCCGAT CACGAGACAT TCGGTTCTGT CGTAATGAAC
CGGCAAAATC TCAAGCAGTC GGGTGCTCGC AACCCCTGGC TCGTGCAGGG CTGCCTCGAT
GGGGCGGCTG CCTATGCCAC CGATGCGATC CAATTGGTAC AGGCGAGTGA CCGTCTCGGC
GACTTGCTGG TCGGCCCCTT CGGCACCAAC CTGCCGAGCA AGCGGCGCCA GCAGGAGACG
GCATGCCCCG CCATCCAGTC AAAATCGCTT TCCGTCCCGG CAAGCGGTGC GACCGCGACC
TTCTTTGCCG TCTTCGCCGC GGATCATCCC GAGGCATCGA GCGATGCCGA CCTTTCACGG
CTTGACGAGC TTGCGGCGAC AGAGGGGACG GCGGCCGATA TTGCCGAGGC CGCGCCGGTC
CGCAGCCTGC TCCAGGATGC TGCTTTGCTG AAGGCCGAGG CGCTGGATAA AAAAATGATC
AGCCAGCTCT ATCCGCAACG GAGCCTGGAA GAGCGCGTCG ACGGAAAGCT TCTTTCGTTT
TTCGTCTCCG ACGGCGTTTT GAACCGCCAT GTGGTCCTGC GCGACAAGGA GCTCTTGGTA
GCGCGCCGTC ACGGGGCGAT CGTCAGAAGC GGTGAGAATA TGCTGCTCGA CGATAGGACG
CTTGCCGCGA CCTGCTGGAT GCAGGGTATT TTCGCGGCCC AGCTGACGAT CGGCAATACG
TCCTTTCACA AGCTCTTTTC GGTCTCCCGC GACCCCTACA ATCTGACCCG CGCCAGCGGG
CTGCGCATCA TGGCCGATGT CGGCGCCGGC TGGCAGCTTC TGGCGGTGCC GTCGGCTTTC
GAAATGGGGC TCAGCGATTG CCGCTGGATC TATCGCCTCC CTGAACGCAC GATCATCGTG
TCGGCGGTCG CCTCCGGCGA GGATGCGGCG ATGCAGTGGA CCGTCTCCGT CGAGGGCGAG
CCGTGCCGCT TCCTGGTGTT TGGTCATGTC GTGCTCGGCG AGCGCGAATA TGATGCGGGC
GGGCAGATCG AATTCGATAC GTCGGGTAAA CGCCTTCTTT TCCGGCCGGA CCCGGCCTGG
CTCTGGGGCG AGCGTTATCC CGACGCCGGC TACTGGCTGG TGAGTTCGAC GCCCGATGCC
ATCGAAGAGA TCGGCGGCGA CGAACTGCTC TACAGCGATG GGGTAACACG CAACGGCGCT
TTCGTGGCAC TACGATCACT GATGACGCAG GCGCTGTCGT TCGCTGTCGT CGGCTCGATG
ACAGATGCGG CGGAAGCCGA GCGCCTGGCG CAGCGTTATC AGGCCGGCGT CACCGATGAA
GCCATGCTTG CCCCGGCATC GAAATTCTGG CGGAACACCG TGCGCGGCCT GACGGTCGCC
AGCACGTCGC CCGACCTTGC CGCGCAGACG ACCCTGCTGC CCTGGCTCGC CCATGATGCA
ATCGTGCATC TAAGCGTGCC GCACGGCCTC GAGCAATATA CGGGTGCGGC CTGGGGCACA
CGCGACGCCT GCCAGGGGCC GATAGAATTC CTGCTCGCCT ACGAGCATGA CCGCGAAGCC
AAGCAGGTGC TGAAAACCGT CTTCAGCGAA CAATACCTGG GAAAAGGCGA CTGGCCGCAA
TGGTTCATGC TGGAGCCCTA TGCCAACATC CGGGCGGGCG ATAGTCACGG CGATATCGTC
GTCTGGCCGC TGAAGGCGCT CTGCGACTAT ATCGAAGCGA CCGGCGATCT CGCCATCCTC
GACGAGAAAG TCTCCTGGCG CGATGAAAAG ACGATGGCCA GGGCGGAGCT CGACACTATC
GCAATCCATG TCGAGAAGCT GCTCGATACC GTTCGCGAAG CTTTCATCCC CGGAACGCAT
CTGATCCGCT ATGGCGAGGG AGACTGGAAC GATTCTCTGC AGCCGGCCGA TCCGCATCTG
CGCGACTGGA TGGTCAGCAG CTGGACCGTT GCCCTGCTCT ACGAGCAGAT CGTGCGCTAT
TCGGCGATCC TGCGCCGCCT CGGCCATGGC GGCAAGGCGA AAGGCTTGAG GAAGATCGCA
ACGGCGATGC GCCGGGATTT CAACCGCCAT CTCGTGCGCG ACGGCGTCGT GGCCGGCTAC
GGTATCTTCG ATCCCAGCCA TGACGGCGTC GAATTGCTGC TGCACCCGAG CGACCGGCGC
ACCGGCCTGC ATTTTTCGCT GATCTCGATG ACGCAGGCGA TGCTCGGCGG CCTCTTCACG
CCGGCTCAAA GACAGGGTCA TATGAAGCTG ATCGAAGAGC ATCTGCTCTT TCCCGACGGC
GTACGACTGA TGGAGAAGCC CGCGGCCTAT GCCGGCGGAC CCGAGACGCT GTTTCGCCGC
GCCGAATCCT CCTCCTTCTT CGGCCGCGAG ATCGGGCTGA TGTATGTGCA TGCGCATCTG
CGCTACTGCG AAACGCTGGC GCTTGATGCC GAAGCGGAAG AGCTCTGGAA GGCGATCGCC
CTCGTCAACC CGATTTCGGT CACATCGGCG TTGCCGCATG CATCGCTGCG CCAGCGCAAC
ACCTATTTCA GCAGCAGCGA TGCGGCTTTC CATGACCGTT ATCAGGCAGC GGCCGAATGG
GAGCGCGTCA AGGCGGGAAA GATCGCCGTC GACGGCGGAT GGCGGATCTA TTCAAGCGGC
CCCGGGCTTT ACACCAGGAG CTTCGTCGAA AATATCCTCG GCTTCAAACG GCGCTTCGGC
CGCCGCAAAC GCAAGCCGCT TCTTCCCGCG GTCCACGCCT CTGCCGATCT GCAAACCGAC
CACGCCGTCT GGCGCCGGCT GATGAAGCCG AAGCCTGAGG TGTGA
 
Protein sequence
MATDLDRSFQ MPRRDELGLF TISNGSGLSI SALPNGTLFA IEYADDKGSV QINQIQGSPL 
TGGVSRLYLR IGGAAPDVVE LVGSFADGSF GHDATSLSWS GKRGDIGYNV RLELHPSETA
WFWRVSIRHL KDGTLPVDLV LIQDVGLGDR GFLMNSEAYA SQYVDHHIAD HETFGSVVMN
RQNLKQSGAR NPWLVQGCLD GAAAYATDAI QLVQASDRLG DLLVGPFGTN LPSKRRQQET
ACPAIQSKSL SVPASGATAT FFAVFAADHP EASSDADLSR LDELAATEGT AADIAEAAPV
RSLLQDAALL KAEALDKKMI SQLYPQRSLE ERVDGKLLSF FVSDGVLNRH VVLRDKELLV
ARRHGAIVRS GENMLLDDRT LAATCWMQGI FAAQLTIGNT SFHKLFSVSR DPYNLTRASG
LRIMADVGAG WQLLAVPSAF EMGLSDCRWI YRLPERTIIV SAVASGEDAA MQWTVSVEGE
PCRFLVFGHV VLGEREYDAG GQIEFDTSGK RLLFRPDPAW LWGERYPDAG YWLVSSTPDA
IEEIGGDELL YSDGVTRNGA FVALRSLMTQ ALSFAVVGSM TDAAEAERLA QRYQAGVTDE
AMLAPASKFW RNTVRGLTVA STSPDLAAQT TLLPWLAHDA IVHLSVPHGL EQYTGAAWGT
RDACQGPIEF LLAYEHDREA KQVLKTVFSE QYLGKGDWPQ WFMLEPYANI RAGDSHGDIV
VWPLKALCDY IEATGDLAIL DEKVSWRDEK TMARAELDTI AIHVEKLLDT VREAFIPGTH
LIRYGEGDWN DSLQPADPHL RDWMVSSWTV ALLYEQIVRY SAILRRLGHG GKAKGLRKIA
TAMRRDFNRH LVRDGVVAGY GIFDPSHDGV ELLLHPSDRR TGLHFSLISM TQAMLGGLFT
PAQRQGHMKL IEEHLLFPDG VRLMEKPAAY AGGPETLFRR AESSSFFGRE IGLMYVHAHL
RYCETLALDA EAEELWKAIA LVNPISVTSA LPHASLRQRN TYFSSSDAAF HDRYQAAAEW
ERVKAGKIAV DGGWRIYSSG PGLYTRSFVE NILGFKRRFG RRKRKPLLPA VHASADLQTD
HAVWRRLMKP KPEV