Gene Rleg_5389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5389 
Symbol 
ID8007347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp801429 
End bp802697 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content64% 
IMG OID644822293 
Producthypothetical protein 
Protein accessionYP_002973553 
Protein GI241113718 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGC ATACCGACCA GGCGAAGCGC ATCCTCGCCG CCAACGATCG CGGCGGCTAT 
ACCGTGCCGA CCGACCGGCT CTACCCGTTC CAGTGGAACT GGGATTCGGC CTTCGTTGCC
ATGGGCTTTG CGCTCTACGA TACCGACCGC GCCTATCGCG AGCTGGAGCG GCTGGTCGAG
GGCCAGTGGG CTGATGGAAT GATCCCGCAT ATCGTCTTCC ATGCGCCAAG CGATACTTAC
TTCCCGGGAC CGAACGTCTG GCGCACGAGA CACGCTATCC CGACCTCCGG CATAACCCAG
CCGCCGGTCT TTGCGATTGC GCTGCGCAAG CTGCATGAAG CCGCCGGCAA GGATGGCGAA
GCGCGCACCC TGCCCCTCTA CGTGGCGGCG CTGAAATGGC ATCGCTGGTG GTATTCGGCG
CGCGACCCTG AAGGCACGGG GCTCATAGCA CTCCTGCATC CCTGGGAAAG CGGCAGCGAC
AATTCTCCCG CCTGGGACAT CGCGCTCGCC CGAGTGCCGA CCAATACCGA TACGCCTGTG
GTGCGCAAGG ATACCGGTCA TGTCGATGCC GATATGCGCC CGCGCGACGA GGATTACCGC
CGCTTCATCC ATCTCGTCGA TACCTATGCC GCCTGCGGCT GGGATCCGGC GCGGCAATGG
GAAAAGGCGG CGTTCAAGGT CGCCGAGATC CAGACGACCG CAATCCTGCT CAAGGCGGGC
GAAGATCTGG AACACCTTGC CCGCCTGTTT GGGCGGACCG ATGACGCGAT CGAGATCGCT
GCCTTCAACG ACCGCAGCCG CAAGGCGATA ATGGCCCAGT GGCGGCCGGA GCTTGTCCGC
TTCGTCTCGC GCGACCTGAT CTCCGGCGAA GATGTCGAAG CCGCCACGCA AGCCGGCTTC
ATCCCCCTCC TCTCGCTGGA CCTCGACAAG CAGGTTGCGG ACGCCCTGGT CTCCGAAATG
AAGGCCTGGT CCAAGGATCT CAAGGTTGCC TTCCCCACGA CCAAACCCGG CATCGCCAGT
TGGGAGCCGA AGCGCTACTG GCGCGGCCCC GCCTGGGCGA TCATCAATTG GCTGCTGATC
GACGGCCTTA AGCGCAATCG CTACGCGGAT GTCGCCGAAG AGCTGCGGCA ATCCACCATC
GCAGCGATCG AAACGGAAGG TTTCGCCGAA TATTTCGACC CGGTCACCGG CCAGGGCTGC
GGTGGCCTCG GCTTTTCCTG GACGGCTGCC GCCTATCTAT GGCTTGAGCG AGGCGTCGTC
CTCGCCTGA
 
Protein sequence
MNMHTDQAKR ILAANDRGGY TVPTDRLYPF QWNWDSAFVA MGFALYDTDR AYRELERLVE 
GQWADGMIPH IVFHAPSDTY FPGPNVWRTR HAIPTSGITQ PPVFAIALRK LHEAAGKDGE
ARTLPLYVAA LKWHRWWYSA RDPEGTGLIA LLHPWESGSD NSPAWDIALA RVPTNTDTPV
VRKDTGHVDA DMRPRDEDYR RFIHLVDTYA ACGWDPARQW EKAAFKVAEI QTTAILLKAG
EDLEHLARLF GRTDDAIEIA AFNDRSRKAI MAQWRPELVR FVSRDLISGE DVEAATQAGF
IPLLSLDLDK QVADALVSEM KAWSKDLKVA FPTTKPGIAS WEPKRYWRGP AWAIINWLLI
DGLKRNRYAD VAEELRQSTI AAIETEGFAE YFDPVTGQGC GGLGFSWTAA AYLWLERGVV
LA