Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5054 |
Symbol | |
ID | 8007647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 437508 |
End bp | 439304 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644821969 |
Product | Heparinase II/III family protein |
Protein accession | YP_002973229 |
Protein GI | 241113394 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.124625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAGCG AGATTTCAGG GGAGCTGCCG GATGTGCTGG GCGATTTTAC GCCCGGGGCG GTCGGCTCCG ATCGCCTCAG ATGGAGCACT GTGCCGCAAG CGCTACGGGA ACTGGTCATT GGCGAGGCCG AAGAGACGGT TGCGCGCGCC TGGCCGCTGA TTGCGGCCTC GGATTACCGG GAGTTCACCG AAACCGGCAA CCGCGCGCGC TTCGAGGAGC TTTATTTCAC ACGCCGCCGG ATGCTCAACA ATCTGGTGCT CGGCGAACTC GTCGAAGGCG GCGCGCGCTT CCTGCGGAAG ATCGTCGACG GCATCTTCCT GATCGTGGAG GAGAGCGGCT GGCAGCTGCC GGCGCATAAC GCCTATGAAC GAAGCGGCGC GCGCCTGCCG CTTCCCGACA ATTCGCAGCC TGTCGTCGAT CTCTTCGCCG CCGAAACAGC CGCCCTGCTT GCCACCGTCG TCGCATTGTT TCGCGACGAG CTCGACGCCA TCAGCCCCGA GATTACGGCA CGCGTCGAGC GCGAGATCGA GATCCGCATC CTCTCGCCCT ATCTCGGCCG GCATTTCTGG TGGATGGGCC GCGGCGAAGA GCGGATGAAC AACTGGACGG CATGGATAAG CCAGAACGTC CTGTTGACGG TTTTCTCCCT GAAAACAGAT CAGCCCACGC GTCACGCCGT CGTCAAAAAC GCGCTCGGCA GCCTCGACGC CTTCCTGAAG GACTATGCCG AGGACGGCGC CTGCGAGGAG GGCGTGCTCT ATTACCGCCA CGCCGCGCTC TGCCTGCATG GTGCGTTGAC CATCCTGGAC GCCGTGGCGG CCGGCCTGTT CGCCGGGGTC TGGCAGCAGC CGAAGATCCG CAACATGGCC GAATATATTG CCCATATGCA TGTGGCCGGC CGCTACTATT TCAATTTCGC GGATTCCTCC GCGGTGGTCG AACCCTGCAG CGCGCGGGAA TACCTGTTCG GACAGGCGGT CGGCTCTAAG ATGCTGGCTG AGTTCGCCGC AGCCGACAGA GCCGCTTCCA ACAATTCGCA TATGCCCGGG GAATGGAACC TCTGGTATCG CGTGCAGGAA CTGCTGGCCG GCCCGACGCT TCCCGCTGCC GCCCCGCCGC ATCCTGCATC TCAGCGCGAT ATCTTCTATC CCGGCATCGG CCTGTTCATC GCCCGCGACG AGCAGTTTTC GCTTGCCGTC AAGGGCGGTA ACAATGGCGA GGGCCACAAT CACAACGATG TCGGGAGCGT GACGCTCTAC AAGAGGGGAC GTCCGTTCCT GATCGATGTC GGCGTCGAGA CCTATACCGC AAAAACCTTT TCGGCGCGGC GCTACGAGAT ATGGACGATG CAGTCGGCGT TCCACAATCT GCCGACATTT GCAGGCGTCA TGCAGTCGGC CGGCGAAGCT TTTGGCGCGC GCGATGTCGA GGTCGGGTTT GACGAAGGGA GCGCGCGCAT AGCGCTCGAT ATTTCAGACG CCTATCCCCC CGAAGCGCAG TTGCACAGCT ATCGGCGCGT CGTTTCCCTG CTGCGCGGCC GCCATGTCGA GATCGTCGAC ACCTATGACG GCGGCAAGCC CGCGGTCCTG TCGCTGATGA CATGCCTGGC GCCGACCGTC GGCCCGGACA GGATCGATCT CGCCGATCTC GGCAGCATTT TCGTCGAGGG CGCCGGCGAG ATCGAAATCG ACGAGATCGT CGTGGAGGAC GCCCGGCTGA GATCGGCCTG GCCCGAGAAA ATCTACCGGT TGCGCCTGCC GTTTGCCGGC AGGCTGCTGA GATTGCGGAT CGTCTAG
|
Protein sequence | MFSEISGELP DVLGDFTPGA VGSDRLRWST VPQALRELVI GEAEETVARA WPLIAASDYR EFTETGNRAR FEELYFTRRR MLNNLVLGEL VEGGARFLRK IVDGIFLIVE ESGWQLPAHN AYERSGARLP LPDNSQPVVD LFAAETAALL ATVVALFRDE LDAISPEITA RVEREIEIRI LSPYLGRHFW WMGRGEERMN NWTAWISQNV LLTVFSLKTD QPTRHAVVKN ALGSLDAFLK DYAEDGACEE GVLYYRHAAL CLHGALTILD AVAAGLFAGV WQQPKIRNMA EYIAHMHVAG RYYFNFADSS AVVEPCSARE YLFGQAVGSK MLAEFAAADR AASNNSHMPG EWNLWYRVQE LLAGPTLPAA APPHPASQRD IFYPGIGLFI ARDEQFSLAV KGGNNGEGHN HNDVGSVTLY KRGRPFLIDV GVETYTAKTF SARRYEIWTM QSAFHNLPTF AGVMQSAGEA FGARDVEVGF DEGSARIALD ISDAYPPEAQ LHSYRRVVSL LRGRHVEIVD TYDGGKPAVL SLMTCLAPTV GPDRIDLADL GSIFVEGAGE IEIDEIVVED ARLRSAWPEK IYRLRLPFAG RLLRLRIV
|
| |