Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3227 |
Symbol | |
ID | 8014120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3230261 |
End bp | 3231874 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644825788 |
Product | Carbohydrate-binding and sugar hydrolysis |
Protein accession | YP_002977015 |
Protein GI | 241205919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.751454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGTCT ACTATGTAAA TTCGGCAACA GGATCCAATC AGAACAGTGG AGTGAGCGAG CAGTCGGCTT TCGCGACGCT TTCCGCTGTC GAATCCCTGA GGCTAAAGCC GGGAGACAGC GTGCTGCTTG CCGCCGGAAG CGTGTTTAAC GAGCAATTCG ACCTGAAATA CTCCGGCACT GTCAGTTCTC CGATTACAAT TGGCAGTTAC GGCGTTGGCG ATGCGCCGGT TATTCACAGC AGCAATGACG GTATCCACGG CTCGAAGGCT TCAAACATCA TCGTGGAGAA CATCAAGATT GCTGATACCG GGGGTGCGGC GATATATGCG GGAAATGTCT CCAACTGGAC CGTCCGCAAC GTCGAGGTCG AAAATACCGG ACTTGCCGGC AAGCCCGGTT CCGTCAATTT CCAGAGCAGC CAGAACATTA CCATCGAGAA CAGCAAGATT TCCGGGGTAA ATGGAGACGG CATCTGGATG GATAAAGTGA TTGGCGTTAC CATCGTTAAT AATCTGGTCA TCAACAGCCA GGGCGCTGCG GCAGACGCCG TTCAGTTGAA CGACAGCAGC AATATCCTGA TCAAGGGTAA CCACCTAGAG CAGACGGAGA CCAATAGCGC AAAGGGCGTC CTGGTGCTTG TTCGGGCTGT GGATGCAGCG GTCGAGGACA ACACCGTGAT CGGCGGCGGT TTCGGGATAG GTGCCAACGC GGGCACGAAT ATCGCCATCC ACGACAACGA CATCTCGGGA TACGGCGGCT ATAGTTGGTC CTACGGTATC GGCCTTGGCG ATCAGGGCAA TGCGACAAAT TACGATATCA GTGGAAACTA TATCCATGAC GGCGTCTGGG GCGTGTCGAT CAGCGCTGCA GGTTACCCAA GCTATACACG CACCGATATC GATATTTATG GTAATGTCTT TGACGACCTG TCGTCCTCGG CGCTGAAGGT CGACAGGCCT GCATCCGGCT CTTTCTATGA TAATATCATC GACAGCGCGG TGTCGACGTT GACGATGCCG GTCGCAATTG TCCTCCAGAG CACCTTCTCG ATCAACGACA ATAAGACGCT GGAGCAAGCG CAGGCCGAAG TCGACGCTGC GACTGGCAAC ACCCAGTCCA ACAATGAGCA GACGCCTACC ACGCCGGTTG TGGAGCCGTA TGTCGAACCG TCAACACCGA CGCAGACAAG CACTCCGGCG CCCGCGGCCG AAGCGCAGGT TCCGACCGCA CCCACCGTCG CCGTTCCCAG GATTGTCGCA GCCCATGACA GCCTGAAAAT CTCCACGGAC ACGGGCAGCG CCTATCATGG CAACCTTCTC GAAAACGATA GCGCAATCAA TGGAACCGTG CTGCTTCGTC GCTTCGGTGA CAGTGCAGTG GATAAGCATG GCCTGACGCT GACCGGGAAA TACGGCGTCA TCCACGTGGA GAGCGATGGT GATTACACCT ATACCGTCGA TGCGGTGAAA ATCGCCGGTC TCAGCGGAAA GGTCAGTGAA TCCTTCCAAT ACAAGATTTC TGACGGCGCT TCTCACATCG ATACCGACTC GCTCGGCGTG TATATCAATG TGGACGCGTT CCATAGCAGC CAAGCGTCGC ACCTGCTGGT TTGA
|
Protein sequence | MTVYYVNSAT GSNQNSGVSE QSAFATLSAV ESLRLKPGDS VLLAAGSVFN EQFDLKYSGT VSSPITIGSY GVGDAPVIHS SNDGIHGSKA SNIIVENIKI ADTGGAAIYA GNVSNWTVRN VEVENTGLAG KPGSVNFQSS QNITIENSKI SGVNGDGIWM DKVIGVTIVN NLVINSQGAA ADAVQLNDSS NILIKGNHLE QTETNSAKGV LVLVRAVDAA VEDNTVIGGG FGIGANAGTN IAIHDNDISG YGGYSWSYGI GLGDQGNATN YDISGNYIHD GVWGVSISAA GYPSYTRTDI DIYGNVFDDL SSSALKVDRP ASGSFYDNII DSAVSTLTMP VAIVLQSTFS INDNKTLEQA QAEVDAATGN TQSNNEQTPT TPVVEPYVEP STPTQTSTPA PAAEAQVPTA PTVAVPRIVA AHDSLKISTD TGSAYHGNLL ENDSAINGTV LLRRFGDSAV DKHGLTLTGK YGVIHVESDG DYTYTVDAVK IAGLSGKVSE SFQYKISDGA SHIDTDSLGV YINVDAFHSS QASHLLV
|
| |