Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3210 |
Symbol | |
ID | 4898159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 262055 |
End bp | 264403 |
Gene Length | 2349 bp |
Protein Length | 782 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640113809 |
Product | integrase catalytic subunit |
Protein accession | YP_001045079 |
Protein GI | 126463966 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCG ACGCCATCAA CTGGATCCCG TTCCAGGTCT CCGAGCACTG CAGGCTGACC ATCGGCGGCA AGGCCTTCCG CCTCGCCTAC CGCTCGGGCG ACGCGCTGGT GCTGCGGCCG GCAGAGGGTG AGGGCCCGTC CGAAACCTTC GACATCAGCC GCCTGAGCCG GCTGAATGCC CTGGGCAAGA TCCATCACGA GCGCGAGCAC TTCCTGCCGC CGCACCTGCG GATGCAAGCG GCCCCGCCCC GCTCCGGCCT CACGATGGCG GATCTGACGC CTCGGCACGC CGGGCGGATC AACGAACGCT ATGCGATGAC CATGGCGCTG ATCGATCTGC ATAGCCGCGG CGAGGTCAGC CTCACCTATC CGTCGGTCAC GGCGGCAATG ACGACGATCC GGGTCGAGGC GCTGAAATAC CTCGCGAATC TTCCGGACGC CGCCCGGCTC GAGGAGGTAC ACCGCGCGCT CGCGGCCGGC GGCAAGGTCC CGCTGCCCGC CGGCGTGGCA CTGCCGAAGC ATCGGAGTGC GAAAACCTTG CTCGGCTGGC GCAAGACCTT TCTCGCGGCC GGCACCAAGC TTGCGCTGGC CGACGACATC GACCAGCGCG GCGATCGGTC CACCTCGTTC GGACCCGAGG AAGATGCGCT GCTCCTCGAA ACCGTCCGCG ATAACTACCT TACCCGCGAG CAGAAATCGA AGGTGGCGGT CACCGCCGAC GTCAAGCGGG CCTTCGCCAA GGCCAACGAG GCCCGTCGAG CCGCGGGACT GTCGGACCTG ACGGTGCCGG GCCGCGACGC TGTCCGCGCG ACGATCGCTC GTTTCGACGC GCTGGAGGTC GTGAGGGCCC GCAAGGGCGA GGAGGCCGCG AAGAAGATGT TCCGCACCAC GACCACGGGA CTCGAGGTCA GCCGGCCGCT CGAGCGCGTC GAGATCGACG AGTGCCGGAT CGATCTTCGG ACGATCCTTT CGCGAGAGGG CCTGCTCAAG CTCTTCACGC CGGAGGAGAT CGAGGCCTTC GGCCTCGACA AGAAGAAGAC GCGGTGGTGG GCGGTGATGG CAATCGACTG CCGCACGCGG GTCATCCTCG CACTCAAGCT CACGCCGAAC CCGCGGACAA GTGCTGCCGT CGAATGCGTG CGCATGATCA TGAGCGACAA GGGCAACTTC TCCGACAAGG TGGGGGCGCT GACACGCTGG TCGCAGCGCG GAACCCCGGA GTCGGTGGTG ACGGACGGCG GCTCGGGGCT GACCTCGATC GCGTTCAGCA ATGCCTGCAC CGACCTGCGC ATCACCGATG TCACGGCGAT CGCCGGTGCC GCCTCGGCCC GCGCCCGCAT CGAGCGGCTG TTCCAGACTA TCTCGAAGAA CCTCCTCTGC CGCCTCTCGG GGCGGACCTT CTCGAGCATC GTCAAGCGCG GCGACTACGA CGCCGATGCC CGCGCCTGTC TCGGCACCGA GGAGTTGTGC CAGGTCCTCG TCCGCTGGAT CGTCGACATC TATCACAACA GCTGGCACAG CGGGCTCGGC TGCACGCCGC TCCAGCAGTG GAACGCGGAC ATGGAGGCCG GGAACTTTCC TCTGAAGGCG CTCCCCTCCC TCGAGCGGCA GCGCGTCGCC TTCGGTATTC CGGGCCGCTA CCGGCTGACG AAACAGGGCG TGGTCATCCT GGGCATCGCC TACACCAGCG AGCGGCTCGC CCGCCACTTC GCCGTCGAAG GCGCGATCAT GGTCGAGACG CGTTGGGATC ACGCCGATCT CGGGGCCATC AGCGTGAAGA TCGGCGATGT CTGGGTCGAG GTACCGGCGG TGCACGACCG GTTCCAGAGT GTGTCGGCGC AGGTCTGGCT CGCCGCGCGG AAGGCCCTGC GCGCCGAGGC CGAGGCGCGG AAGGCCTGGT CGGAGGCCGT CATCTTCAAG GCGATCGACT ACATTGAGGA AACCAACGCC AGGGCCAAGC TTCATCACAG AATCCTCGAC CAGGCCTGGA CGCCGGAGCG GCTGCGAGCC TTCGAGGAAG AACTGTTCCC GACGGGCTTC CGCATCACCG CAGATACGCC GGCCACGCGG GCGGCTGCCG AGGGAATCGG CCGCTCGATC GTGCCCGCAG CGCCGATGGA TGCCTGCGAT GACGCCTTCG AGACCACCGA GGCATCCGGC TCTGTCGAGC CATTAAGGGC CCATACTCGT CTCACCGCCG CGCCCGCGCC GGAGGGTCCG AGCCGCCGGT CCCGGCGCCG GGCGGCAGGC GCCTCCTTGG AGGGCACCCG CCCGACGGCG GAGCCGCGCT CCGACACTGG TGACGCGACT GATGAGGAGC CGGCCTGCTT CTGGAAGACA TCCGACTGA
|
Protein sequence | MNRDAINWIP FQVSEHCRLT IGGKAFRLAY RSGDALVLRP AEGEGPSETF DISRLSRLNA LGKIHHEREH FLPPHLRMQA APPRSGLTMA DLTPRHAGRI NERYAMTMAL IDLHSRGEVS LTYPSVTAAM TTIRVEALKY LANLPDAARL EEVHRALAAG GKVPLPAGVA LPKHRSAKTL LGWRKTFLAA GTKLALADDI DQRGDRSTSF GPEEDALLLE TVRDNYLTRE QKSKVAVTAD VKRAFAKANE ARRAAGLSDL TVPGRDAVRA TIARFDALEV VRARKGEEAA KKMFRTTTTG LEVSRPLERV EIDECRIDLR TILSREGLLK LFTPEEIEAF GLDKKKTRWW AVMAIDCRTR VILALKLTPN PRTSAAVECV RMIMSDKGNF SDKVGALTRW SQRGTPESVV TDGGSGLTSI AFSNACTDLR ITDVTAIAGA ASARARIERL FQTISKNLLC RLSGRTFSSI VKRGDYDADA RACLGTEELC QVLVRWIVDI YHNSWHSGLG CTPLQQWNAD MEAGNFPLKA LPSLERQRVA FGIPGRYRLT KQGVVILGIA YTSERLARHF AVEGAIMVET RWDHADLGAI SVKIGDVWVE VPAVHDRFQS VSAQVWLAAR KALRAEAEAR KAWSEAVIFK AIDYIEETNA RAKLHHRILD QAWTPERLRA FEEELFPTGF RITADTPATR AAAEGIGRSI VPAAPMDACD DAFETTEASG SVEPLRAHTR LTAAPAPEGP SRRSRRRAAG ASLEGTRPTA EPRSDTGDAT DEEPACFWKT SD
|
| |