Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4602 |
Symbol | |
ID | 6977696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 237124 |
End bp | 238125 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643393778 |
Product | Integrase catalytic region |
Protein accession | YP_002278596 |
Protein GI | 209546678 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2826] Transposase and inactivated derivatives, IS30 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.792581 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCGAT GCTATTTGCA ATTGACGCTT GCCGACCGAC GCCGTCTGCA CCAGCTTGTC GAACGTAAGG TGCCGGTCAA TGAGATGGCG CGCCGGCTCG GACGACACCG TTCGACGATC TATCGCGAGA TCCGGCGCAA TACGTTTCAT GATCGCGAGC TTCCCGAATA TAGCGGCTAT TACTCGACGG TTGCCAACGA CATCGCCAAA GACCGGCGGC AACGCCTGCG GAAGCTCAGA CGGCATCCGC AATTGCGCGA CCTGGTCATC GACAGATTGC AGGCCTCCTG GTCGCCCGAG CAGATCGCTG GCCGCCTGTG GGCGGACGGT CTCACTCTCG TTCGGATTTG CGCCGAAACG ATCTATCGCT TCGTCTATGG CAAGGAAGAT TACGGGTTGG GCCTCTATCG CTACTTGCCC GAAGCGCGCC GCAAACGCCG TCCTCGCGGC TCCAGGAAGC CACGCGACAG CGTGTTTCCT GGAGCCTACA AGGTATCGCA ACGGCCTGAT TTTATTGAGG ATCGATCGCA ATTCGGCCAT TGGGAGGGCG ACCTGCTGAT CTTCCGACGT GATATGGGTC CGGCCAACCT CACCTCATTG GTCGAACGCA AAAGCCGTTA CACCGTGATG ATCAAGAACC AGAGCCGGCA CTCGCGGCCG ATCATGGACA AGATCATCGA AGCATTCTCT CCTTTGCCGG CATTCGCACG CCAGAGCTTC ACCTTCGATC GCGGCACGGA GTTTGCCGGA TTCAGGGCTT TGGAAGATGG CATCGGCGCA CGCAGCTGGT TCTGCGATCC CAGCGCACCG TGGCAGAAAG GCGCGGTGGA AAACACCAAC AAGCGCATCC GGCGATTCAT GCCAGGCGAG ACGGACCTGA CGGCCGTCTC GCAACATGAC CTGATCCAGC TCGCCCGTCA GCTCAACGAC CAGCCGAGAA AGTGCCTCGG TTATCGAACG CCGGCCGAGG TCTTCCTCAC ACATTTGCAA GACGGGGCAT GA
|
Protein sequence | MSRCYLQLTL ADRRRLHQLV ERKVPVNEMA RRLGRHRSTI YREIRRNTFH DRELPEYSGY YSTVANDIAK DRRQRLRKLR RHPQLRDLVI DRLQASWSPE QIAGRLWADG LTLVRICAET IYRFVYGKED YGLGLYRYLP EARRKRRPRG SRKPRDSVFP GAYKVSQRPD FIEDRSQFGH WEGDLLIFRR DMGPANLTSL VERKSRYTVM IKNQSRHSRP IMDKIIEAFS PLPAFARQSF TFDRGTEFAG FRALEDGIGA RSWFCDPSAP WQKGAVENTN KRIRRFMPGE TDLTAVSQHD LIQLARQLND QPRKCLGYRT PAEVFLTHLQ DGA
|
| |