Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1405 |
Symbol | |
ID | 8012495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1402186 |
End bp | 1403214 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644823990 |
Product | hypothetical protein |
Protein accession | YP_002975236 |
Protein GI | 241204140 |
COG category | [R] General function prediction only |
COG ID | [COG4188] Predicted dienelactone hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.301855 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGG GCTTTCGCGA CGGCGTCCTC TACGACGAGA AGAGGTCGGA CTGGGACGCC GCCGGACCCA GACCCATCAG TTGGTCCCTC TGGTACCCCG CCGCCGATGA CGCGCGGAAA AGGGATATAC CGGAAAGAAG CTGGTTCCAG AAAGCGGCCG TCGCCCGCGA TGCACCCATC CGGCCGGAGG CCAGGCCCTA TCCCCTTGTC CTGTTGTCGC ATGGCACCGG CGGATCCGCG GCCGGGCTGG AGTGGCTGGC GCGACGCCTG GTCGATCGCG GATTTGCCGC GCTCGGCGTC AGCCATCACG GCAATACCGG CATCGAGCCC TATCGCGCTG AAGGCTTTGC CTGCCTCTGG GAGCGGGCGC CTGATCTCAG CTACATGCTC GACCACCGGG ATGCGTGGCT CAGCGATCTC TCAGGCCATA TCGATACGAA CAGTGTCTTC GCAGCCGGAT TTTCGGCCGG AGCCTATGGC GTGATGCTGC TGCTTGGCGC TATCGCCCAG TTCTCGCAGT TCGAACCATC GAGGATGAAG CCGGGTGGCG CGCGCGGACC GAGAGAATTT CCCGACCTTG CCGATCATAT CCCGGCATTG CTGCGCACCA GCGATGTGTT TCGCGATTCG TGGTCCCGGA TGTCGAAGTC CTACCGAGAT GACAGAATCA GGGCCGCCCT CATCTGCGCG CCGGGTCGGT CCGTTCTCGG TTTCAGCGAG GAAAGCCTGA ACGCTGTCGA TGCGCCCGCC CTTATCCTGG TCGGTGATGC CGACAAGGCA GCACCGGCCG AAGAATGTTC GTCGTGGCTA CATGCGCGGC TGCGGCGCAG CGTCCTTAAA ATCTTCGGCG GCGGCCTTGG GCATTATGTC TTCGTGCCAG AGGGCACGGC GCTCGGCCTT GCCTTTGCGG CAGAACTCTT TACCGATCCC CCGGGCATCG AGCGCGCAGC CGTTCATGAA GAGATTGCCG ATCTGTCGGC AGCGCTGTTT CAAGACAGCG GCATCATCGC GGAAAAGACG ACGAATTGA
|
Protein sequence | MKLGFRDGVL YDEKRSDWDA AGPRPISWSL WYPAADDARK RDIPERSWFQ KAAVARDAPI RPEARPYPLV LLSHGTGGSA AGLEWLARRL VDRGFAALGV SHHGNTGIEP YRAEGFACLW ERAPDLSYML DHRDAWLSDL SGHIDTNSVF AAGFSAGAYG VMLLLGAIAQ FSQFEPSRMK PGGARGPREF PDLADHIPAL LRTSDVFRDS WSRMSKSYRD DRIRAALICA PGRSVLGFSE ESLNAVDAPA LILVGDADKA APAEECSSWL HARLRRSVLK IFGGGLGHYV FVPEGTALGL AFAAELFTDP PGIERAAVHE EIADLSAALF QDSGIIAEKT TN
|
| |