Gene Rleg_5054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5054 
Symbol 
ID8007647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp437508 
End bp439304 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content63% 
IMG OID644821969 
ProductHeparinase II/III family protein 
Protein accessionYP_002973229 
Protein GI241113394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.124625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAGCG AGATTTCAGG GGAGCTGCCG GATGTGCTGG GCGATTTTAC GCCCGGGGCG 
GTCGGCTCCG ATCGCCTCAG ATGGAGCACT GTGCCGCAAG CGCTACGGGA ACTGGTCATT
GGCGAGGCCG AAGAGACGGT TGCGCGCGCC TGGCCGCTGA TTGCGGCCTC GGATTACCGG
GAGTTCACCG AAACCGGCAA CCGCGCGCGC TTCGAGGAGC TTTATTTCAC ACGCCGCCGG
ATGCTCAACA ATCTGGTGCT CGGCGAACTC GTCGAAGGCG GCGCGCGCTT CCTGCGGAAG
ATCGTCGACG GCATCTTCCT GATCGTGGAG GAGAGCGGCT GGCAGCTGCC GGCGCATAAC
GCCTATGAAC GAAGCGGCGC GCGCCTGCCG CTTCCCGACA ATTCGCAGCC TGTCGTCGAT
CTCTTCGCCG CCGAAACAGC CGCCCTGCTT GCCACCGTCG TCGCATTGTT TCGCGACGAG
CTCGACGCCA TCAGCCCCGA GATTACGGCA CGCGTCGAGC GCGAGATCGA GATCCGCATC
CTCTCGCCCT ATCTCGGCCG GCATTTCTGG TGGATGGGCC GCGGCGAAGA GCGGATGAAC
AACTGGACGG CATGGATAAG CCAGAACGTC CTGTTGACGG TTTTCTCCCT GAAAACAGAT
CAGCCCACGC GTCACGCCGT CGTCAAAAAC GCGCTCGGCA GCCTCGACGC CTTCCTGAAG
GACTATGCCG AGGACGGCGC CTGCGAGGAG GGCGTGCTCT ATTACCGCCA CGCCGCGCTC
TGCCTGCATG GTGCGTTGAC CATCCTGGAC GCCGTGGCGG CCGGCCTGTT CGCCGGGGTC
TGGCAGCAGC CGAAGATCCG CAACATGGCC GAATATATTG CCCATATGCA TGTGGCCGGC
CGCTACTATT TCAATTTCGC GGATTCCTCC GCGGTGGTCG AACCCTGCAG CGCGCGGGAA
TACCTGTTCG GACAGGCGGT CGGCTCTAAG ATGCTGGCTG AGTTCGCCGC AGCCGACAGA
GCCGCTTCCA ACAATTCGCA TATGCCCGGG GAATGGAACC TCTGGTATCG CGTGCAGGAA
CTGCTGGCCG GCCCGACGCT TCCCGCTGCC GCCCCGCCGC ATCCTGCATC TCAGCGCGAT
ATCTTCTATC CCGGCATCGG CCTGTTCATC GCCCGCGACG AGCAGTTTTC GCTTGCCGTC
AAGGGCGGTA ACAATGGCGA GGGCCACAAT CACAACGATG TCGGGAGCGT GACGCTCTAC
AAGAGGGGAC GTCCGTTCCT GATCGATGTC GGCGTCGAGA CCTATACCGC AAAAACCTTT
TCGGCGCGGC GCTACGAGAT ATGGACGATG CAGTCGGCGT TCCACAATCT GCCGACATTT
GCAGGCGTCA TGCAGTCGGC CGGCGAAGCT TTTGGCGCGC GCGATGTCGA GGTCGGGTTT
GACGAAGGGA GCGCGCGCAT AGCGCTCGAT ATTTCAGACG CCTATCCCCC CGAAGCGCAG
TTGCACAGCT ATCGGCGCGT CGTTTCCCTG CTGCGCGGCC GCCATGTCGA GATCGTCGAC
ACCTATGACG GCGGCAAGCC CGCGGTCCTG TCGCTGATGA CATGCCTGGC GCCGACCGTC
GGCCCGGACA GGATCGATCT CGCCGATCTC GGCAGCATTT TCGTCGAGGG CGCCGGCGAG
ATCGAAATCG ACGAGATCGT CGTGGAGGAC GCCCGGCTGA GATCGGCCTG GCCCGAGAAA
ATCTACCGGT TGCGCCTGCC GTTTGCCGGC AGGCTGCTGA GATTGCGGAT CGTCTAG
 
Protein sequence
MFSEISGELP DVLGDFTPGA VGSDRLRWST VPQALRELVI GEAEETVARA WPLIAASDYR 
EFTETGNRAR FEELYFTRRR MLNNLVLGEL VEGGARFLRK IVDGIFLIVE ESGWQLPAHN
AYERSGARLP LPDNSQPVVD LFAAETAALL ATVVALFRDE LDAISPEITA RVEREIEIRI
LSPYLGRHFW WMGRGEERMN NWTAWISQNV LLTVFSLKTD QPTRHAVVKN ALGSLDAFLK
DYAEDGACEE GVLYYRHAAL CLHGALTILD AVAAGLFAGV WQQPKIRNMA EYIAHMHVAG
RYYFNFADSS AVVEPCSARE YLFGQAVGSK MLAEFAAADR AASNNSHMPG EWNLWYRVQE
LLAGPTLPAA APPHPASQRD IFYPGIGLFI ARDEQFSLAV KGGNNGEGHN HNDVGSVTLY
KRGRPFLIDV GVETYTAKTF SARRYEIWTM QSAFHNLPTF AGVMQSAGEA FGARDVEVGF
DEGSARIALD ISDAYPPEAQ LHSYRRVVSL LRGRHVEIVD TYDGGKPAVL SLMTCLAPTV
GPDRIDLADL GSIFVEGAGE IEIDEIVVED ARLRSAWPEK IYRLRLPFAG RLLRLRIV