Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0177 |
Symbol | |
ID | 4898049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 191627 |
End bp | 194620 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640110760 |
Product | putative helicase/exonuclease |
Protein accession | YP_001042068 |
Protein GI | 126460954 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3893] Inactivated superfamily I helicase |
TIGRFAM ID | [TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.169187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGACA CTCCGGGCCC CCGCCTCTTC GGCCTGCCGC CGGGCGTGGA CTTTCCCGCG GCACTGGTGC GTGGCCTGCG CGCGCGCATG GCAGGGGCCG CGCCCGAGGC CATGGCGCGG GTCGAGCTTT ACGTCAACAC GCAGCGGATG CGGCGGCGGA TCACCGAGCT GATGACGGCC GAGGGCACGG GGTTCCTGCC GCGCATCCGC CTCGTGACCG AGCTGCCGCC GGTGCCGGGC CTGCCCGCCC CGGTCCCGCC GCTGCGCCGC AGGCTCGAGC TTGCGCAGCT CGTGGCCCGG CTGATCGAGG CCCAGCCCGA CATCGCGCCG CGCTCGGCGC TCTTCGATCT GTCGGACAGC CTCGCCGAGC TTATCGACGA GATGCAGGGC GAGGGCGTGC CGCCCGAGGC CATCGCGCGG CTCGATGTGG CCGACCATTC GGCGCACTGG CAGCGGACGC AGGCCTTCAT GGCGATCGTG GCGCCGATGT TCGGCGCGGA CGCGCCCGAT GCGCAGGCGC TGGCGCGGAT GGCGGTCGAA CGGATCGCGG CGCGCTGGGC CGAGGCGCCG CCCGATCATC CGGTGATCGT CGCGGGCTCG ACCGGTTCGC GCGGCACGAC CAGCCTCTTC ATGCAGGCGG TGGCGCGGCT GCCGCAGGGC GCGCTCGTGC TGCCCGGCTT CGACTTCGAC CTGCCCGCCG CGGTCTGGGA GGGGATGGAC GACGCGCTGA CCGCCGAGGA TCATCCGCAG TTCCGCTTTC ACCGGCTGAT GGGCCTCGTG GGCGCAGGCC CCGCCGAGGT CGGCCGCTGG ACCGACGAGA TCCCGCCCAG CCCGGACCGA AACCGGCTGA TCTCGCTGTC GTTGCGGCCC GCGCCCGTCA CCGACCAGTG GCTGACCGAG GGCCGGCACC TCACCGGACT TGCCGAGGCC GCGCGCGGCA TGACGCTCAT CGAGGCGCCG GGGCCGCGGG CCGAGGCGCT GGCGGTGGCG ATGATCCTGA GGAAGGCCGC CGAGGAGGGA CGCCGCGCGG CGCTCATCAC CTCGGACCGG GGGCTCACGC GGCAGGTGGC GGCGGCGCTC GACCGCTGGG GGATCGTGCC GGACGATTCC GCGGGCAAGC CGCTCGCGCT CTCGGCGCCG GGGCGCTTCC TGCGCCATGT GGCCCGGCTC TTCGGCCGGC GGCTGACCGG GGAGACGCTG CTCACGCTGC TGAAACATCC GCTGACGGCG ACCGGCGCGG ACCGCGGCAA CCACCTGCGC TGGACGCGGG ATCTGGAGCT GCACCTGCGG CGGAAGGGGC CGCCCTTCCC GGCGGCGGCC GATCTGGCGC TCTGGGCCGG AACACGGCCT GCGGACGGGG TGGCGGACTG GGCGCGCTGG CTGGGCGGAC TGATCGAGGG GCTGGACGCG GCAGGCCCGC GCCCGCTCGC CGATCATGTG GCCGCGCATC TCGCGCTGGC CGAGGCGCTG GCCACGGGGC CCGCGGGCGA CACCACAGGC GAGCTCTGGC TGAAGGAGGC CGGCGAGGCC GCGCGCGCCG CCGTCGAGGA GCTCCGCCGC GAGGCGCCGC ACGGGGGCGA GCTGACGCCG GCCGATTACA CCGACCTCTT CGACGCGATC CTCGCGCGTG GCGAGGTGCG CGAGGCGGTG CAGGCCCGGC CCGATCTGAT GATCTGGGGC ACGCTCGAGG CGCGCGTGCA GGGGGCGGAT CTCGTGATCC TGGGCGGGCT GAACGACGGG ACATGGCCGC AGCTGCCGCC GCCCGACCCG TGGCTCAACC GGCAGATGCG GCTGAAGGCC GGGCTTCTCC TGCCTGAGCG GCGGATCGGC CTGTCGGCGC ACGATTACAG TCAGGCGGTG GCCGCGCCCG AGGTGGTGCT GACCCGCGCC ACCCGAGATG CCGAGGCCGA GACGGTGCCC TCGCGCTGGC TGAACCGCCT GATGAACCTG ATGAGCGGGC TGAAGGCGGG CGGAGGGCCC GAGGCTCTGG CCGGGATGCG CGCGCGCGGC CGCGACTGGC TGGACCTCGC CGCGGCGCTG GAGCAACCCG CGACGCCGGT GCCGCTCGCC ACGCGGCCCG CGCCGCAGCC CCCCGTCCCG GCCCGACCCG AGCGGCTGGC CGTGACGGGC ATCCGCACGC TGATCCGGGA TCCCTATGCT GTCTATGCCC GCCACATCCT GCGGCTCTAT CCGCTCGACC CGCTGCACCG GGCGCCGGAT GCGCGGCTGC GCGGCTCGAT CCTGCACCGC ATCCTCGAGG CCTTCGTGAA GGACCGCGCG CCGGGGGGCG ACCGCCCCGC CGAGCGCGCG CGCCTGATGC GGATCGCGGA GGCGGTGCTG GCCGAGGAGG TGCCCTGGCC CGCGGCCCGG GCGCTCTGGC TCGCGCGGCT CGACCGGGCC GCCGACTTCT TCCTCGAGAC CGAGGCCGCC CATGCCGGCA CCCCGGTCGT GCTGGAGGAG GAGGGGCGCG TGGATCTGAC CCCCCTGCGC TTTACGCTGA CGGCCAAGCC CGACCGGATC GACGTCCTGC CGGACGGGCG GCTCCATATC CTCGACTACA AGACCGGCAC GCCGCCTACG AAGAAGCAGC AGGAGCAGTT CGACAAGCAG CTCCTCCTCG AGGCGGCGAT GGCCGAACGC GGCGGCTTCC GCGGCCTCGG ACCCGCCGAG GTGGCGCGGA TCAGCTACAT CGGCCTCGGC ACCAGCCCCA AGGTCGAGAG TGTCGAGACC GATGCGGCGC TCCTCGGGCA GGTCTGGGAG GGGCTTCACG CCCTCATCGG CCGCTACATG CGGCGTGAGC AGGGCTATGT CTCGCGCCGC GCCATGTTCG GCGAACGCTT CCCGGGCGAT TACGACCATC TCGCGCGGTT CGGCGAGTGG GAGATGAGCG ACGCGCCCGT GCCCGCGCCG GTGGGGGCCG AGGCGCCGGG TGCAGGCTCC GCGGAGGCGG GGACGGATCG CGACAGGGGA AGCTGCCCGG AGGATGCCGC GTGA
|
Protein sequence | MFDTPGPRLF GLPPGVDFPA ALVRGLRARM AGAAPEAMAR VELYVNTQRM RRRITELMTA EGTGFLPRIR LVTELPPVPG LPAPVPPLRR RLELAQLVAR LIEAQPDIAP RSALFDLSDS LAELIDEMQG EGVPPEAIAR LDVADHSAHW QRTQAFMAIV APMFGADAPD AQALARMAVE RIAARWAEAP PDHPVIVAGS TGSRGTTSLF MQAVARLPQG ALVLPGFDFD LPAAVWEGMD DALTAEDHPQ FRFHRLMGLV GAGPAEVGRW TDEIPPSPDR NRLISLSLRP APVTDQWLTE GRHLTGLAEA ARGMTLIEAP GPRAEALAVA MILRKAAEEG RRAALITSDR GLTRQVAAAL DRWGIVPDDS AGKPLALSAP GRFLRHVARL FGRRLTGETL LTLLKHPLTA TGADRGNHLR WTRDLELHLR RKGPPFPAAA DLALWAGTRP ADGVADWARW LGGLIEGLDA AGPRPLADHV AAHLALAEAL ATGPAGDTTG ELWLKEAGEA ARAAVEELRR EAPHGGELTP ADYTDLFDAI LARGEVREAV QARPDLMIWG TLEARVQGAD LVILGGLNDG TWPQLPPPDP WLNRQMRLKA GLLLPERRIG LSAHDYSQAV AAPEVVLTRA TRDAEAETVP SRWLNRLMNL MSGLKAGGGP EALAGMRARG RDWLDLAAAL EQPATPVPLA TRPAPQPPVP ARPERLAVTG IRTLIRDPYA VYARHILRLY PLDPLHRAPD ARLRGSILHR ILEAFVKDRA PGGDRPAERA RLMRIAEAVL AEEVPWPAAR ALWLARLDRA ADFFLETEAA HAGTPVVLEE EGRVDLTPLR FTLTAKPDRI DVLPDGRLHI LDYKTGTPPT KKQQEQFDKQ LLLEAAMAER GGFRGLGPAE VARISYIGLG TSPKVESVET DAALLGQVWE GLHALIGRYM RREQGYVSRR AMFGERFPGD YDHLARFGEW EMSDAPVPAP VGAEAPGAGS AEAGTDRDRG SCPEDAA
|
| |