Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3954 |
Symbol | |
ID | 4899128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 1088884 |
End bp | 1092264 |
Gene Length | 3381 bp |
Protein Length | 1126 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640114557 |
Product | hypothetical protein |
Protein accession | YP_001045804 |
Protein GI | 126464691 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.573007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTCG ACCGGCTCGA CCTCACGCGC TACGGCCATT TCACGGACCG GCGGCTCGCA TTTCCCGCGC CCACGCCGGG CGAGGCGGAC CTGCATGTGG TCTACGGGCC GAACGAGGCG GGCAAGTCCA CGCTCTTCTC GGCCTGGCTC GACCTCCTCT TCGGCATCCC GCTCCGCACC CGCTACGACT TCCGCCACCC CGGCCCCACG ATGCGCGTGG GCGCGCGGCT CAGCCATGCG GGCGGCGCGC TCGATCTGGC CCGGGTGAAG CGGAACACCG GCAGCCTCCT CGACGCGCAC GACCAGCCGG TGCCCGAGGC GCTGCTGCAA TCGGCGCTGG GCGGCCTCAC GCGCGAGGGC TATTCGGCCA TGTTCTCGCT CGACGACGAC ACGCTGGAAA AGGGCGGCGA CAGCATCCTC GCGAGCCGCG GCGATCTGGG CGAGATGCTC TTCTCGGCGA GCGCGGGGCT GGCCCAGCTG AGCCCCCGGC TAGAAACGAT CCGCACGGCC CTCGACGGCT TTCACCGCAG CGGCAAGCGC AGCGGCTGGC TTTGGGACAC GAAGAAGCGG CTGGCCGAAC TCGATGCGGA GCGGCGGCGG CTCGACGTGT CTGCCGGCAC GATCCACAGG CTCGCGCGCG AGGCGGAGGC GGCCGAGGCC GCCTGGCGGG CGGCGCGCGG TGCAGAGGAT GCGGCGCAGG CGGATCTGGG GCGGCTGCAG GATCTGGCGG CGACCCTTCC GATCCGGGCG CGGCTCGAAG GGCTGCGGGC GCGGCTCGCG CCGCTGGCCC ATCTGCCCGA AGCCGGGGGG GCCGAGCGCG ACCGGCTTGA CCGGCTCGAC CGCGAGACCG AAGCGCTGCG CGCGCGTCGC GCCGACCGGG CCCGCCGCCT TGCGGATCTC GCGGAAGAGG CCGATGCCTT GCCGCTCGAT CCGGCGGTGC TGGCTGCCGC CGGCGACATC GAGGCGGCCG AGGCGCTGCG CCCCGAGCAC GAGACGGCGC AGAAGGACCT GCCCCGCCGC GAAGCCGAGG CGTTCGAGGC CCGCGCCGAG GTGACGGCGC TTCTGGCCGA ACTCGGCCAT CCGGGGGCAA AGCCGGAGGG GCTGGTGCTT CCGGCAACCA CGCTCGCACG GCTCCGCGCC CTTGCCGCAG AGCGCTCCGG CCTCGAAGCG ACCGCGGCCG CCTCGGAGGC CGAGCGGCAC GCGGCAGCCG AACGGCTCGC GCGCGAGCGC GACCGTCTGG GCGATCCGGG CCCCGAGGGC GAGGAGGCCA CGCTGGTCGC GCTGCTCGCG CGGCTGCGGG CGCAGGATCC GGCCGAGGCC CATGCCCGGG CGCGGCTCGA CCGCAACCTG CATCAGGCGC GCCTCACCGC CGCGCTCGAG GCGCTTGCCC CCTGGCAGGG CGATGCCTCG GCCCTCGCGG CCCTGCCCGT GCCCTCCGCC GCCCTGCTCG ACGGCTGGGA GCGCGGCCTC GAAGAGACCC GCCAGCGCGC GGCCGACGCC CAACGGACGG CCGAGGCAAT CCGGGCCGAT CTCGACAGGC TGCGCCACGA TGCAGCGGCA GAGCGGGGCG CCGCCTCGGC CACCGGCCTC ACCCTCACCG AGGCCGCCGC CGCCCGCAGC CGGCGCGAGG CCGCCTGGGC GCGCCATCGC CGGAGCCTCG ATGCGGCCAG TGCCACCGAG TTCGAACAGG CGCTCCGCGA AGACGACCGC ATCTCCGCCC TCCTGGCCGA GGCGCTGGCC GAGGCCCGCC GCGCCGCCGG AGCCGAAGCC GAGGAGGCGC GGCTTGCCCG GGCGCTGGCC GAGGCGGAGG CGGCCCGTGA CGCGGCCCGG ACCGGTCAGG CGCAGATCCG CGCCGCCCTT GCCGAGGCGG GAGGGGCGCT CGGCCTCTGC GATGCCGACC TCTCCGGCCT GCGGCACTGG CTGGCGCTGC GCGACGAGGC GGCGGCCCGG CAGGCGGCGC TCCGCGAGGC CGAAGCCCAC TGCACCCGCC AGTCGGAGGC GCTGGACGCG GCCAGCCTTG CGCTCGCCGC AGCCCTCGGC GCGCCGGAGG GCACGCCCTT CGAAACCCTC CTCTCGACCG CCATCGCCCG CACCGAAGCC GCCGAGCGCC GGCGCGAGGC GCGGCGGCAG CTGGCCGGGC TGGCCGCCGA TCTCAAGGCC CGCGAGGCCG CCGAGGCGCA GGCACAGCAG GCGCTCGCGC GCTGGCGCGA AAGCTGGCAC GAGGCGAGCC GGGGCACGAT CCTCGCCGAC GGCCCTTCCG AGGGTCCGGT GCTCGATCTC CTCGATGCGC TCGGCGCGGC CGCCCGCAGC CTCGCCGCGC TCGAGGACCG GATCGCCAAG ATGGAGGCGA ACCGCGCCCG GTTCGAGGCC GCCCGGACAG CCCTCCTCAC GCGGCTCGGC CTCGATCCCG ACACGGGCTG GGAGGCGCTG CGGTCCCGGC TGCGCCGCGC GCAGGATGCG GCGCGGGACG CGGAACGGCT CGCCCAGCAG CGCACCACCG AAGACCGTCA GGAGGCCGAG GACCGCCGCA CCCTCGCCGC GCTGGACGAA GACCGGGCCG CGCTCGCCCA AGCGCTCGGC TGGTCCGAGG CGGACGGGCC GCTCGCGGCC CACCTCGCCT GCTGCCTCGA GGCGGCCGAG CTGCGCCGTC AGGTGGCAGC CCTTCTGTCG GACCTCTCAG GGCGGCCCGA ACCGCAGGAG ACCGACGATC CCGCGACGCT CACTTCCCGG ATCGAGAAAC TGCGCACCGA CCTCCAGCTC CTGCGGAGCG AGGCCGAAAG CGGCCTCACC GCCCATCTGG ACGCCCGGCG CAGGCTCGAG GCGGTCGGCG GCGACGATGC GCTGGCCCGG ATCGCCTCGG ACCGCGAGAC CCTGCTGGTG GAACTGCGCG ACCGCGCCCG CGCCCATCTC GCCGCCCGCT TCGGGCTGAT GGCCTTCGAG ACCGGGCTCC GGCGCTACCG CGACCGGCAC CGCAGCGCGA TGCTGGCCCG CGCCTCGGAC GCCTTCTGCC GCCTCAGCCG CGGCGCCTAT GGAGGCCTCA CCGCCCAGCC CGACGGCGCG CAGGAGGTGC TGGTGGCGCT GGCCGCCGAA GGCGGGGCGA AACTGGCGGC GGATCTCTCC AAGGGCACGC GGTTTCAGCT CTATCTCGCG CTGCGCATCG CGGGCTTCCA CGAGCTCGCC CAGAGCCGCC CGCCCGTGCC CTTCATCGCC GACGACATCA TGGAGACCTT CGACGACGAC CGCTCGGCCG AGGCCTTCGC CCTGCTGGCC GACATGTCCC GCGTGGGGCA GGTGATCTAT CTGACGCACC ACCGCCACCT CTGCGACATC GCCCGTGCCG CCTGCCCCGG CGCCTCGCTG ATCGACCTCA CGGCACCCTG A
|
Protein sequence | MRLDRLDLTR YGHFTDRRLA FPAPTPGEAD LHVVYGPNEA GKSTLFSAWL DLLFGIPLRT RYDFRHPGPT MRVGARLSHA GGALDLARVK RNTGSLLDAH DQPVPEALLQ SALGGLTREG YSAMFSLDDD TLEKGGDSIL ASRGDLGEML FSASAGLAQL SPRLETIRTA LDGFHRSGKR SGWLWDTKKR LAELDAERRR LDVSAGTIHR LAREAEAAEA AWRAARGAED AAQADLGRLQ DLAATLPIRA RLEGLRARLA PLAHLPEAGG AERDRLDRLD RETEALRARR ADRARRLADL AEEADALPLD PAVLAAAGDI EAAEALRPEH ETAQKDLPRR EAEAFEARAE VTALLAELGH PGAKPEGLVL PATTLARLRA LAAERSGLEA TAAASEAERH AAAERLARER DRLGDPGPEG EEATLVALLA RLRAQDPAEA HARARLDRNL HQARLTAALE ALAPWQGDAS ALAALPVPSA ALLDGWERGL EETRQRAADA QRTAEAIRAD LDRLRHDAAA ERGAASATGL TLTEAAAARS RREAAWARHR RSLDAASATE FEQALREDDR ISALLAEALA EARRAAGAEA EEARLARALA EAEAARDAAR TGQAQIRAAL AEAGGALGLC DADLSGLRHW LALRDEAAAR QAALREAEAH CTRQSEALDA ASLALAAALG APEGTPFETL LSTAIARTEA AERRREARRQ LAGLAADLKA REAAEAQAQQ ALARWRESWH EASRGTILAD GPSEGPVLDL LDALGAAARS LAALEDRIAK MEANRARFEA ARTALLTRLG LDPDTGWEAL RSRLRRAQDA ARDAERLAQQ RTTEDRQEAE DRRTLAALDE DRAALAQALG WSEADGPLAA HLACCLEAAE LRRQVAALLS DLSGRPEPQE TDDPATLTSR IEKLRTDLQL LRSEAESGLT AHLDARRRLE AVGGDDALAR IASDRETLLV ELRDRARAHL AARFGLMAFE TGLRRYRDRH RSAMLARASD AFCRLSRGAY GGLTAQPDGA QEVLVALAAE GGAKLAADLS KGTRFQLYLA LRIAGFHELA QSRPPVPFIA DDIMETFDDD RSAEAFALLA DMSRVGQVIY LTHHRHLCDI ARAACPGASL IDLTAP
|
| |