Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_0725 |
Symbol | |
ID | 5084084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 734889 |
End bp | 737888 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640482283 |
Product | phage integrase family protein |
Protein accession | YP_001166936 |
Protein GI | 146276777 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.710298 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTCCG TTGAAACCCG AGACAACGAT TGGATCATCG ATCACGTGAC GGAAATCCAT GGCGTTAGGG TCAACACAGA ACCAAGCCGG CTTGCGCTGC TGTCCCATAG CACGAGCGAA TGGCAGAGGG CCACCCGAGA AGCTTTGAAA TTTTCTCCTG TTATGGGAGA TGAGACCGCT AAGCTCGATC TAATCGCAAC CGCCACCAAC ATGGCGACGG TCAGGCAAAT ACTGAACGAA TGCCTTCTCG AAGTCGCTCA GCAAGGCGGC GGAAGTGAGA GCACTCGCGC CGCGCTGCAA GTGCTCTTCA TGATCACGGT GCCGAAGCGT ATGGTCCTCT TTCCTTTTCG CCCAGCCACG AACATAAAGT GGCTCAATAA TCATCTCTCT ATGCCGCCGA GTATGCGGGC TGAGCTTGGG CTGCCGGCGC ATGACGCCTG CCCGAAGTAC TTCAACGCAG CGGCAGCGAT ATTGTATTGC GCAACCGACT TGCACGGCCC CGAAGATTTC ACTGACGAGT TTCTTCGAGA AGTCTACCAG AAGGTCATAT TCTTTCTGAA GCCCAACTAC ATGGCCGCAC TTGGTAACCT TCTGCGGAAG ATTGCCGGCG CGAGCAAGGT TGAAAGCGCC ATGCGCGAAG CGCGTCGTCT AAGACCCAGC CCGCGCTTCG CGCGGCAGCG CCCAGATCAG CCTTGGATCA TTGACCCCGA AACTGGCGAT CCGCGATTCT CAGAATGGCA AACGGCGATA GCGGATTGGC TTCAAAATCT GACACACAAG AGTTCGCGAA AATCTGAAAG CGCCGCCAAA CGCTTCGTTG CCTATATAAG GCAATGCGAC GACCCTCCCC TTTCCCCCCA ACTTCTTCGT CGCTGCCACG TCGTTAAAAC ATCGGAAACT CGCGAAGTTC CGTTCATTGA CTGGCTGATG ATTGAGAAGA GCATCAACGC CGCGCGTGAA TCCTGCGGTC GTCTGCGGGA TCTGATGGCG CATTGGCATG ACACTCATCC CATTCTGGGG AGCGATGAAG TGTGGAATAA TCCTTTCCAA ACCTCCGATC TAGAAGGTCT GCGCGGTTCC AGCACCCACA ACAAATCCGT CAAATACGTC TTGCCTCAGC AGATTATCGA AATCGCAAAG CAGGTCCTGC TTGAAGACGA CTACGCCTGG CCGCGCCGGC AGAAAATGTG TCGCGCCCGC GCTCTTGGCC GTCCTGATCT GTTCTCGCCG GTCCTTCCCG TCGCTATCTT CACTCTCCTA GTCCTGCCGC TCCGATCCAT TCAGTTGCGC CTGCTCGATT CCGGAGAGGC TGACGAACTG CAACTCGACG AACGCTTGGA GCGCGCGGTC AACACGTCAA CCCACGCGAT CCGCGGCCGA CAGGAGGGTT TCCTCAAACG CTTCGAACCC TTTGGAGGTC CCGGCAGCGA ATTCGTGGGC TTTCACATCA CGACCAACAA GACCGACGCC GCGCAAAAAG GGGAGGTTGA GATCCCGTTC GACATCCCAT GGCAGGCGGC AGAACTTATA CCGCACATAA GGCGCCTCCA GCAATGGCAG GCCGAAGTAA AACCACTCGA CCGGTTGCGA ACTCGGAAGG AGCTGCACGA GAGGGAAACG CGTATTGATG GAAAGATCGC AAATCTACCG AAATACGCGT TCCTGTTCCG TGATCCGGGA GGGCAATATC CAAAGGAACC GGTGACCTAC AATAAGTTGG CGAACTTCTG GGCGTGCGTT CTGGAAGAAG TTGAGAACCG CCTTCGGCGG GCGGGCACGC CGGTCAGACT TATCGAGGGC CGGACCCGAC AAGGAAAATC GGCGCAACCT GTCGCGATCT ACACGTTGCA TGCTATGCGA GTGAGCGGCA TTACCTATTT TATCGAGTCA GGAGTACCGA TACATATCGT ATCTGAATTT CTGGCCGGTC ACGCCAACAT CATCATTACT CTCTACTACA CGAAGATCGC GCCGGCCAAA GTCAACGAAA TTATTGAACG CGCAGCAGAT CAAAGCCGAG AACTCAGTGA AAGGGGCTAC TTCGCGGCGC TTTCCGAGTT GTCCAAGGCC GAATTAAAAG GGCAACTCGT GGTGTCCGAT GGGGCTGCGG CCCACGTCAG CGGCGGCGAT CCTGGAGTTT GGCATCTCGA CGTTGATGGG TTCTGCGTCT GCGGTCGCAC GCAATGCGCA GAGGGAGGAG CAAAGCTGAA GAATGCACAG GGAAAAGAGT ATTTTGAAGG CATAACTTTC GACAGGTTCA ACTGCGGCGC GTGTCGGTTT CACGCGACCG GACCCGCCTT CCTTGCCGGC CAGGTTCTAG TCTTTAATAG CTTGCTCCAC GAGATTCATC TCCTCGCGAA GAAGCGTGAC GATTTGCAAA ATCGGTTCGA CTTGCGGCAG GATGACTGCG TCAGGGCAAA ATCCGACATG ATCCACCGCC TGGAAAGGAT CGATCAGGAA ATGGACAGCA AGGTACGTGT CCTTGCGTCT CGCTTCACGA GGTTGCAGCA ATCCTTGGAA CTTCATGAGC GCAAACGCGC GGGCGGGAAA CAGGCCCTCA TCACCGGACT TGACCGCCGC GAGGTCTCCG TGGCAATCGA GAGCGCGCGG GACTCGGACT TGCTGGATTT CGCGGCCCAT GTGGCAGAAT TCTTTCCGGA ACTCACAGAC GACTCGGCGC GGATGAGGAA GGGGCTGCTT CTCGAACGCA TGGCGAGCCG CGATGGATTG GGCTCTCTGT TCTTACATTT ACCCGACGCC ATCGCGCTTT CTGCGGCCAA TCGTTTTACT GAACTCGTCA AAGACTTGAT TGGCAGTAAT GCACTTACAA ACTTGATGGA GGGCGCGGCC ACATTTCGAG AACTGGGTTT AATCGACCAA GTACCACGAC TCAAGGATAG GGCGATAGAA CTTGCTCGAG AGGCTCATCG CAAGCTGGCC GACTCGGAGG GTCTGGCTCA GATCGAGGCC CGTCGCGACA GCACCCAAGT TCGCGGCTGA
|
Protein sequence | MDSVETRDND WIIDHVTEIH GVRVNTEPSR LALLSHSTSE WQRATREALK FSPVMGDETA KLDLIATATN MATVRQILNE CLLEVAQQGG GSESTRAALQ VLFMITVPKR MVLFPFRPAT NIKWLNNHLS MPPSMRAELG LPAHDACPKY FNAAAAILYC ATDLHGPEDF TDEFLREVYQ KVIFFLKPNY MAALGNLLRK IAGASKVESA MREARRLRPS PRFARQRPDQ PWIIDPETGD PRFSEWQTAI ADWLQNLTHK SSRKSESAAK RFVAYIRQCD DPPLSPQLLR RCHVVKTSET REVPFIDWLM IEKSINAARE SCGRLRDLMA HWHDTHPILG SDEVWNNPFQ TSDLEGLRGS STHNKSVKYV LPQQIIEIAK QVLLEDDYAW PRRQKMCRAR ALGRPDLFSP VLPVAIFTLL VLPLRSIQLR LLDSGEADEL QLDERLERAV NTSTHAIRGR QEGFLKRFEP FGGPGSEFVG FHITTNKTDA AQKGEVEIPF DIPWQAAELI PHIRRLQQWQ AEVKPLDRLR TRKELHERET RIDGKIANLP KYAFLFRDPG GQYPKEPVTY NKLANFWACV LEEVENRLRR AGTPVRLIEG RTRQGKSAQP VAIYTLHAMR VSGITYFIES GVPIHIVSEF LAGHANIIIT LYYTKIAPAK VNEIIERAAD QSRELSERGY FAALSELSKA ELKGQLVVSD GAAAHVSGGD PGVWHLDVDG FCVCGRTQCA EGGAKLKNAQ GKEYFEGITF DRFNCGACRF HATGPAFLAG QVLVFNSLLH EIHLLAKKRD DLQNRFDLRQ DDCVRAKSDM IHRLERIDQE MDSKVRVLAS RFTRLQQSLE LHERKRAGGK QALITGLDRR EVSVAIESAR DSDLLDFAAH VAEFFPELTD DSARMRKGLL LERMASRDGL GSLFLHLPDA IALSAANRFT ELVKDLIGSN ALTNLMEGAA TFRELGLIDQ VPRLKDRAIE LAREAHRKLA DSEGLAQIEA RRDSTQVRG
|
| |