Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4247 |
Symbol | |
ID | 6983020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4422564 |
End bp | 4424267 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643398975 |
Product | integrase family protein |
Protein accession | YP_002283732 |
Protein GI | 209551815 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAATGC GGATCTCCCG ACCCATGAAG CGCGCTGGCA CGAAGAACGA GCAATTTAAG AAGCGCGTGC CCACTGCTCT CCTGCCGGTT TTACGCGGCA AGAAGTTCGC AGTTGACCTC CCTCAGACAG TCGCTCCTGA CAGCCCGACA TATACAGTCA CGGTCACGCT TTCCGACATG ATCACCTTTT CGCTGGGCGC TCCAGCCTCG CGTCTGGCTG CGATCCGTTA CGCCGCAGCC TTAGGCCATG TTGAAGCCTT CCTAGAGGCG GCGCAGCGCG GGCCTGAAGA CCTTTCGCAC TTGCAGATCA CCGCACTGTT AGGAGACGTC CACAAGGCAC TCCTTGCCCA GTATGAGGCC AATCCCCCAG CGCGGCACAA GGTCGTCGTT GATGGTGGCG TCGAAGAGTG GAGCGAGCTT TCAGAGTGGC AGGACATGAT CGCTGACGCC GAGGTGGCCG TCCTAACGAT GACACCGCAG GGCAGAGCCG CCGCTATCTC ACGGGTTAGG CGCATCATCG ACATGGACGC CTTTCTGTCC GCACGCGCAT TGCTGCTGTC GGATAAGTCC TATCGCGATC TGGTCGAAGG ACTTCCGGCA ATCTTAAGGC GGGTCGTGGA AACCCTCTCA AATCGCAGCC AGGGCAACTA CTCGGCCGAC CCTTACGCAC CGCAATACCC CCAGTGGAAA GCCAAGACCG CTGCTAAACA GCGTGCGACT GATGGCGATG TGAAGACCTT TGACGATCTC TTCGACCGCT GGAAGGCGGC AGATAAGCGC GCGGCTTCTA CGCTCTCGAC ATGGCGCGGC TACTTGGCGC GGTTTAAGCA GTTCGTGGGC CATGATGATC CGCACCGGGT CGAGCGTGTC GACGCCCTGC GATGGAAAGA CGCGCTGGTA GCCGAAGGTC TCAAGAAGAT CTCAACAACC TACCTCGCAG CGCTCAACAC GCTTTACCGC TTCGGCCTCA ATAACAGCGA GACGACTGGC ATCACCCGCA ATCCCTTCGA TGGCGTGAAG GCACCGCAGA AGGCTACTGC AGGGACAAAG CGTTTGCCTT TCACCCGCGT TGAGGTCGCG GTGATACTTA ATGCTGGGCG CAAGGAAAAG CTTGCGCACC TCCGTTGGAT ACCGTGGCTC CAAGCCCAGA CAGGATCGCG TGTCGCAGAG ATAGCACAGC TGTGGGCAAC CATGGTTATC ACCGATGATG CCGGTCACCC GTGCCTGCAC ATTACAACGG CACCGGATGG CGGGTCTCTC AAGAACGAAG GATCTGAGAG GGTCGTACCG ATCCACCCTG ATCTCATTGA GGATGGCTTC CTTGAGTTCG TCAAGACACG CGGCAAGGGG CCGTTGTTCT ACGGCGGCAG CAAGGGTAAG GCCGCAGTTC GGTTGCGAGA TGACCAGAAG CACCCGTCGA AGGGTGTCAG CAATCGGGTT GGAACTTGGG TGCGGGGACT GGGCATTACG GACCCGCGCA AGGGGCCAAC GCATTCATTC CGGCATTGGT TTAAGTCAGA GCTTCCGCGC CGGTCGGGGT GCAACATCCG CTTGGTTGAT GCGATCCAAG GCCACTCTGC TGAAAGTGAT GCGGCTGGAT ATCATCATTC GGAAACATCG GAGATGCTTG AGGCTATCTC AAAGCTCGAC CTCAAAGGGC TGGCCGACAG CGTGCCTATG GCCGAGCAAC CGGCAGACGC ATAG
|
Protein sequence | MVMRISRPMK RAGTKNEQFK KRVPTALLPV LRGKKFAVDL PQTVAPDSPT YTVTVTLSDM ITFSLGAPAS RLAAIRYAAA LGHVEAFLEA AQRGPEDLSH LQITALLGDV HKALLAQYEA NPPARHKVVV DGGVEEWSEL SEWQDMIADA EVAVLTMTPQ GRAAAISRVR RIIDMDAFLS ARALLLSDKS YRDLVEGLPA ILRRVVETLS NRSQGNYSAD PYAPQYPQWK AKTAAKQRAT DGDVKTFDDL FDRWKAADKR AASTLSTWRG YLARFKQFVG HDDPHRVERV DALRWKDALV AEGLKKISTT YLAALNTLYR FGLNNSETTG ITRNPFDGVK APQKATAGTK RLPFTRVEVA VILNAGRKEK LAHLRWIPWL QAQTGSRVAE IAQLWATMVI TDDAGHPCLH ITTAPDGGSL KNEGSERVVP IHPDLIEDGF LEFVKTRGKG PLFYGGSKGK AAVRLRDDQK HPSKGVSNRV GTWVRGLGIT DPRKGPTHSF RHWFKSELPR RSGCNIRLVD AIQGHSAESD AAGYHHSETS EMLEAISKLD LKGLADSVPM AEQPADA
|
| |