Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3540 |
Symbol | |
ID | 4899099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 624712 |
End bp | 627891 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640114149 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_001045403 |
Protein GI | 126464290 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.468941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCAC CGATCGGCCT CGAGGTCCTG CGGCTTCTGG ATCTCGGCAT CTCGAAGGCA GCACACCTGC ACATTACCTC GGACGATGCC CGCGCGGAGG AGATCGTCCG GTTCCTGCGG GAGATCGCGC CCGAACTCCG GCCGGATGTC TTTCCGTGCT GGGACTGCCT TCCCTACGAT GGCGCCTCGC CCACGCCCGA TGCGATGGGG CGTCGCATGG CGCTGCTCGA TGCGGTGCGG GCAGGCGACG TCCGGCTCGT CATCGTGGGT CCGTCGGGTC TCCTGCAGCG AGTCCCACCG GTCGAGGCGA TGCGCCGCTT CGCGGTGCGC GCGGGCGAGG CCTCGGATCT CTGCGCCCTG CGCACCTTCG CCGCCAGAGC GGGATATGCG GAGGACGACC GTATCGACGA GCCGGGAGAG ATCGCGTTCC GCGCCGAGGT GGTCGAGGTG TTTGCGGCGG GTGCGGAGCG ACCGGTGCGG ATCGGCCTCG AGAAAGACCG CGTGACCGGG ATCCGCCGCT ACGATCCGGT GTCGCTGCGC TCGATGGAGG ACGTGCCCGA AATCGATCTG CTGCCCGTGA CGGAACTGCC CGAAAGCGCC TTCGCCGAGG GGCCTCCGCC CCGGGGAGCC GAACATCGTC TGCCCCGGGC TTGGCCCTCG CTTGCAACGC TGCCGGATCA TCTCGGGCAG GGCGGGATTT CCGTGACCCC CGGCGCCCTC GCGGCGCTCC GCCAGGCCCG CGCGCAGATC GAGGAAGCGC ACCGCGACCG CGAGAGTCTG GGTGAGGGCG AGACGCTGCC GCCCGATGCG CTCTACCTCG GCGAGGAGGA TCTTGCCGGG ATCCTCGCGC AGGCCGAGAC GCTCGACCTG TCGGGCTGGG AGCCCGTTCC CGCCTTCGCG GAAGACCGGC GACCGCGTGC GGCGCTCGCC CGGTTTGCGC GCGGCCAGAT CGGGGAGGGC CGCCGGGTGG TTCTCCTTGC CGCGACCCCG CGCGACCTCC GGGCGCTGGG GCGCGCCACC GGGGCCGCGG ACCCGGCGCA GGACTGGCAG GACGCGCGCG CCACGCCGGA GGATGTCCCG GCGGCGATCC TCTCTACTCT GGCCCGCGGC TTCGTCGACC TTCCGGGGCG CACGGCCGTC GTCACGGCGC GGGATGTGCT GGGCAGCCGG GCCGCGGAAG ACGCGGCCGT GTCGGCGGTA TCCGCATGGC AGGTGACGCC GGACGCCCTC GCCGAAGGGG ACTTCGTCGT GCATGAGGAC AGGGGTCTCG GGCGCCTCCA GGGCCTCGAG CCGCTGCCCG GCGCGGACGG CCGCGAGGCG ATCCGCCTTG GCTATGCGGC AGACCAGCAT CTCCTCGTGC CGGTCGAGGA GGCCGGCCGG ATCTGGCGGT ATGGCACCGG CGCCGATGTG TCCCTCGATC GGCTCAACGG TGCGGCATGG ACGAACCGCA GGAAGAAGCT CGACGAAGGA ATCGCCGAGG CGGCGCGCGC GCTCGTCGCC GCGGCGAAGG AGCGGGCGGC CAAATCGGCG CGGGCCTTTG AGCCGCCGTC CGACATCTAC GAGCGCTTCG CGGGGCGCTT CCCCTTCACC CTTTCTCCCG ACCAGCGTCG CGCCATTGCC GAGGTTCGAG ACGATCTCGT GGCGGGGCGG CCGATGAACC GGCTGGTCCT CGGCGATGTG GGGTTCGGGA AGACCGAGAT CGCGCTGCGG GCCGCGGCCG CCGTGGCGCT CTCCGGCGCG CAGGTTGCGC TGGTGGCGCC CAGCACCGTC CTCGCGCGGC AGCATGCCGA GACCTTCCGC CGCCGCTTCG AGGGCTTCGG CGTGACGGTG GCCCATCTTT CGCGTCTCGT GCCGTCCAAG GAGGCGAAGG CCGCGCGCGA CGGGCTCCGC GACGGGTCGA TCCGCATCGT CGTGGGCACC CATGCGCTGC TCGGCAAGGG CGTGGCCTTC GCCGATCTCG GCCTCCTGAT CGTGGATGAG GAGCAACGGT TCGGTGCGGC CCACAAGGCG CGGCTCCGGG CGCTCGGCGC CGACCTGCAT GTGCTGACGA TGACGGCCAC GCCGATCCCG CGGACATTGC AGACTGCCCT CGTGGGCCTG CAGGACCTGA GCGAGATCGC GACGCCGCCC GCCCGGCGGC GCGCCGTGCG CACCCTGACC GCCGAGGAGG ATGCGGCCGT CCTGCGGCAG GCGCTGCTGC GCGAGCGGCG GCGGGGGGGC CAGAGCTTCG TCATCGTGCC GCGGATTGCG GAGATCGATG CCACCGAGGC GCGCCTCCGC GACCTCCTCC CGGAGGCGCA GCTCCGCGTG GCGCATGGCG ACCTCGCGCC GGAAGAGCTC GACCGCGCGA TGGTGGACTT CGCCGCCGGT CGGGGCGACA TCCTTCTTGC CACGAGCATC GTCGAGGCCG GGCTCGACGT CCCGCGCGCG AACACGATGA TCGTCATGCA GCCGCAGCTT TTCGGCCTCG CGCAGCTCCA CCAGTTGCGC GGTCGCGTGG GTCGCGGGGC GCGACAGGCC TACTGCTATC TGATGCACGG GCCGGGAGAT GATCTTGACG AGGCGGCCCT GCGGCGGCTC GGCACGCTTC AGGCTTTCGA CCGGCTGGGC GCGGGGGCCG CGATCGCGGC CGAGGATCTC GATCAGCGGG GGGCGGGAGA GCTGTTCGGA GAGCGGCAGT CGGGGCACGT CAGGCTCATC GGGCTGCCGC TCTACCAGCA TCTCCTTGCG CAGGCGGTGC GCGCCGCGCG AGGGGAGCCG CCGACGGCCC AGCACGTCTC CCTCGCGATG GAGGCGGAGG GCGCGCTGCC GGCCGACTAC ATTCCCGAGG CGGGCCTCCG CCTCGGCCTC TATCGCCGCC TTGCCCGGGC GGCCGATCCG CGCGAGGTCG CGCTGCTGGC CGACGAGATC GAGGATCGCT TCGGCCCGCC GCCGGCCGCG GCTGCCGGTC TGCTCGTCGC GGCGGAGATC CGGGCGCTGG CGCGGAGCCT GGGGATCGAA CGGGTCAGCG CGGGGCCGTC CGGTGTGGCG CTCGACCTGG CGCCGGATGC CTGCGTGGAG CGTTTCGCGG AGGATCTGCC CGAGGGCGTC ACGCTCGAAG GCCGGCGGCT TCTGCGGCAG GAGGAGACGG ACGAGGATGC CGCACGCCGG GCGCTGGATC TTCTGCGCGA TCTCGGCTGA
|
Protein sequence | MKPPIGLEVL RLLDLGISKA AHLHITSDDA RAEEIVRFLR EIAPELRPDV FPCWDCLPYD GASPTPDAMG RRMALLDAVR AGDVRLVIVG PSGLLQRVPP VEAMRRFAVR AGEASDLCAL RTFAARAGYA EDDRIDEPGE IAFRAEVVEV FAAGAERPVR IGLEKDRVTG IRRYDPVSLR SMEDVPEIDL LPVTELPESA FAEGPPPRGA EHRLPRAWPS LATLPDHLGQ GGISVTPGAL AALRQARAQI EEAHRDRESL GEGETLPPDA LYLGEEDLAG ILAQAETLDL SGWEPVPAFA EDRRPRAALA RFARGQIGEG RRVVLLAATP RDLRALGRAT GAADPAQDWQ DARATPEDVP AAILSTLARG FVDLPGRTAV VTARDVLGSR AAEDAAVSAV SAWQVTPDAL AEGDFVVHED RGLGRLQGLE PLPGADGREA IRLGYAADQH LLVPVEEAGR IWRYGTGADV SLDRLNGAAW TNRRKKLDEG IAEAARALVA AAKERAAKSA RAFEPPSDIY ERFAGRFPFT LSPDQRRAIA EVRDDLVAGR PMNRLVLGDV GFGKTEIALR AAAAVALSGA QVALVAPSTV LARQHAETFR RRFEGFGVTV AHLSRLVPSK EAKAARDGLR DGSIRIVVGT HALLGKGVAF ADLGLLIVDE EQRFGAAHKA RLRALGADLH VLTMTATPIP RTLQTALVGL QDLSEIATPP ARRRAVRTLT AEEDAAVLRQ ALLRERRRGG QSFVIVPRIA EIDATEARLR DLLPEAQLRV AHGDLAPEEL DRAMVDFAAG RGDILLATSI VEAGLDVPRA NTMIVMQPQL FGLAQLHQLR GRVGRGARQA YCYLMHGPGD DLDEAALRRL GTLQAFDRLG AGAAIAAEDL DQRGAGELFG ERQSGHVRLI GLPLYQHLLA QAVRAARGEP PTAQHVSLAM EAEGALPADY IPEAGLRLGL YRRLARAADP REVALLADEI EDRFGPPPAA AAGLLVAAEI RALARSLGIE RVSAGPSGVA LDLAPDACVE RFAEDLPEGV TLEGRRLLRQ EETDEDAARR ALDLLRDLG
|
| |