Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_3215 |
Symbol | |
ID | 3721823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007494 |
Strand | - |
Start bp | 269476 |
End bp | 272856 |
Gene Length | 3381 bp |
Protein Length | 1126 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640072892 |
Product | hypothetical protein |
Protein accession | YP_354732 |
Protein GI | 77465229 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.656216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCTCG ACCGGCTCGA CCTCACGCGC TACGGCCATT TCACGGACCG GCGGCTCGCA TTTCCCGCGC CTGCGCCGGG AGAGGCGGAC CTGCATGTGG TCTACGGGCC GAACGAGGCA GGCAAGTCCA CGCTCTTCTC GGCCTGGCTC GACCTCCTCT TCGGCATCCC GCTCCGCACC CGCTACGACT TCCGCCACCC CGGCCCCACG ATGCGCGTGG GCGCGCGGCT CAGCCATGCG GGCGGCGCGC TCGATCTGGC CCGGGTGAAG CGGAACACCG GCAGCCTCCT CGACGCGCAC GACCAGCCGG TGCCCGAGGC GCTGCTGCAA TCGGCGCTGG GCGGCCTCAC GCGCGAGGGC TATTCGGCCA TGTTCTCGCT CGACGACGAC ACGCTGGAAA AGGGCGGCGA CAGCATCCTT GCGAGCCGCG GCGATCTGGG CGAGATGCTC TTCTCGGCGA GCGCGGGGCT GGCCCAGCTG AGCCCCCGGC TCGAGGCGAT CCGCGTGAAC CTCGACGGCT TTCACCGCAG CGGCAAGCGC AGCGGCTGGC TTTGGGACAC GAAGAAGCGG CTGACCGAAC TCGATGCGGA GCGGCGGCGG CTGGACGTGT CGGCCGGAAC GATCCAGAAG CTCGCACGCG AGGCGGAGGC GGCCGAGGCC GCCTGGCGGG CGGCGCGCGG TGCAGAGGAT GCGGCGCAGG CGGATCTGGG GCGGCTGCAG GATCTGGCGG CGACCCTTCC GATCCGGGCG CGGCTCGAAG GGCTGCGGGC GCGGCTCGCG CCGCTGGCCC ATCTGCCCGA AGCCGGGGTG GCCGAGCGCG ACCGGCTCGA CCAACTCGAC CGCGAGACCG AAGCGCTGCG CGCGCGTCGC GCCGACCGGG CCCGCCGCCT TGCGGATCTC GCGGCAGAGG CCGATGCCTT GCCGCTCGAT CCGGCGGTGC TGGCTGCCGC CGGCGACATC GAGGCGGCCG AGGCGCTGCG CCCCGAGCAC GAGACGGCGC AGAAGGACCT GCCCCGCCGC GAAGCCGAGG CGTTCGAGGC CCGCGCCGAG GTGACGGCGC TTCTGGCCGA ACTCGGCCAT CCGGGGGCAA AGCCGGAGGG GCTGGTGCTT CCGGCAACCA CGCTCGCACG GCTCCGCGCC CTTGCCGCAG AGCGCTCTGG CCTCGAAGCG ACCGCGGCCG CCTCGGAGGC GGAGCGACGC GCGGCGGCCG AGCGCCTCAC GCGCGAGCGC GACCGTCTGG GCGATCCGGG CCCCGAGGGC GAGGACGACA CGCTGGTCGC GCTGCTCGCG CGGCTGCGGG CGCAGGATCC GGCCGAGGCC CATGCCCGGG CGCGGCTCGA CCGCGATCTG CATCAGGCGC GCCTCACCGC CGCGCTCGAG GCGCTTGCCC CCTGGCGGGG CGATGCCCCG GCGCTCGCGG CCCTGCCCGT GCCCTCCGCC GCCCTGCTCG ACGGGTGGGA GCGCGGCCTC GAAGAGGCCC GTCAGCGCGT GGCCGACGCC CAGAGGACGG CAGAGGCGAT CCGCGCCGAT CTCGACCGGC TGCGCCACGA TGCGGCGGCA GAGCGAGGCG CCGCCTCGGC CACCGGCCTC ACCCTCGCCG AGGCCGCCGC CGCCCGCAGC CGACGTGAAG CCGCCTGGGC CAGCCATCGC CGCGCCCTCG ATGCGGCCAG TGCCACCGAG TTCGAACAGG CGCTCCGCGA GGACGACCGC ATCTCCGCCC TCCTCGCCGA GGCGCTGGCC GAGGCCCGCC GCGCGGCCGG AGCCGAAGCC GAGGAGGCGC GGCTTCTCCG CACGCTGGCC GAGGCGGACG CGGCCCGTGA GGCGGCGCGC GCCGGGCAGG CCCGGATCCG CACCGCCCTT GCCGAGGCGG GAGGGGCCTT CGGCCTCTGC GATGCCGACA TCTCCGCCCT GCGACACTGG CTCGGCCTCC GCGACGAGGC GCTGGCCCGG CAGGCGGCGC TCCGCGAGGC CGAAGCCCAT TGCACCCGCC AGTCGGAGGC GCTGGACGCG GCCACCCTTG CCCTCGCCGC GGCCCTCGGC GCGCCGGAGG GCACGCCCTT CGAGGCCCTC CTCGCGACCG CCATCGCCCG CACCGAAGCC GCCGAACGTC GGCGCGAGGC GCGGCGGCAG CTGGCCGGGC TGGCCGCCGA TCTCAAGGCC CGCGAGGCCG CCGAGGCGCA GGCACAGCAG GCGCTCGCGC GCTGGCGCGA AAGCTGGCAC GAGGCGAGCC GGGACACGAT CCTCGCCGAC GGCCCTTCCG AGGGTCCGGT GCTCGATCTC CTCGATGCGC TCGGCGCGGC GGCCCGCAGC CTCGCCGCGC TCGAGGACCG GATCGCCAAG ATGGAGGCGA ACCGCGCAAG GTTCGAGGCC GCCCGGACGG CGCTCCTCGC GCGGCTCGAC CTCGATCCCG ACACGGGCTG GGAGGCGCTA CGGTCCCGGC TGCGCCGCGC GCAGGATGCG GCGCGGGACG CGGAACGGCT CGCCCAGCAG CGCACCACCG AGGAGCGTCA GGAGGCCGAG GACCGCCGCA CCCTCGCCGC GCTGGAAGAA GACCGGGCCG CGCTCGCCCA AGCGCTCGGC TGGTCCGAGG CGGACGGGCC GCTCGCGGCC CACCTCGCCT GCTGCCTCGA GGCGGCCGAG CTGCGCCGTC AGGTGGCAGC CCTTCTGTCG GACCTCTCAG GGCGGCCCGA ACCGCAGGAG ACCGACGATC CCGCGACGCT CACTTCCCGG ATCGAGAAAC TGCGCACCGA CCTCCAGCTC CTGCGGAGCG AGGCCGAGAG CTGCCTCACC GCCCATCTGG ACGCCGGGCG CAGGCTCGAG GCGGTCGGCG GCGACGATGC GCTGGCCCGG ATCGCCTCGG ACCGCGAGAC CCTGCTGGTG GAACTGCGCG ACCGCGCCCG CGCCCATCTC GCCGCCCGCT TCGGGCTGAT GGCCTTCGAG GCGGGGCTCC GGCGCTACCG CGACCGGCAC CGCAGCGCGA TGCTGGCCCG CGCCTCGGAC GCCTTCTGCC GCCTCAGCCG TGGAGCCTAT GCCGGCCTCG CGGCCCAGCC CGACGGCGCG CAGGAGGTGC TGGTGGCGCT GGCCGCCGAA GGCGGGGCGA AACTGGCGGC GGATCTCTCC AAGGGCACGC GGTTTCAGCT CTATCTCGCG CTGCGCATCG CGGGCTTCCA CGAGCTCGCC CAGAGCCGCC CGCCCGTGCC CTTCATCGCC GACGACATCA TGGAGACCTT CGACGACGAC CGCTCGGCCG AGGCCTTCGC CTTGCTGGCC GACATGTCCC GCGTGGGGCA GGTGATCTAT CTGACGCACC ACCGCCACCT CTGCGACATC GCCCGCGCCG CCTGCCCCGG CGCCTCGCTG ATCGACCTCA CGGCGCCCTG A
|
Protein sequence | MRLDRLDLTR YGHFTDRRLA FPAPAPGEAD LHVVYGPNEA GKSTLFSAWL DLLFGIPLRT RYDFRHPGPT MRVGARLSHA GGALDLARVK RNTGSLLDAH DQPVPEALLQ SALGGLTREG YSAMFSLDDD TLEKGGDSIL ASRGDLGEML FSASAGLAQL SPRLEAIRVN LDGFHRSGKR SGWLWDTKKR LTELDAERRR LDVSAGTIQK LAREAEAAEA AWRAARGAED AAQADLGRLQ DLAATLPIRA RLEGLRARLA PLAHLPEAGV AERDRLDQLD RETEALRARR ADRARRLADL AAEADALPLD PAVLAAAGDI EAAEALRPEH ETAQKDLPRR EAEAFEARAE VTALLAELGH PGAKPEGLVL PATTLARLRA LAAERSGLEA TAAASEAERR AAAERLTRER DRLGDPGPEG EDDTLVALLA RLRAQDPAEA HARARLDRDL HQARLTAALE ALAPWRGDAP ALAALPVPSA ALLDGWERGL EEARQRVADA QRTAEAIRAD LDRLRHDAAA ERGAASATGL TLAEAAAARS RREAAWASHR RALDAASATE FEQALREDDR ISALLAEALA EARRAAGAEA EEARLLRTLA EADAAREAAR AGQARIRTAL AEAGGAFGLC DADISALRHW LGLRDEALAR QAALREAEAH CTRQSEALDA ATLALAAALG APEGTPFEAL LATAIARTEA AERRREARRQ LAGLAADLKA REAAEAQAQQ ALARWRESWH EASRDTILAD GPSEGPVLDL LDALGAAARS LAALEDRIAK MEANRARFEA ARTALLARLD LDPDTGWEAL RSRLRRAQDA ARDAERLAQQ RTTEERQEAE DRRTLAALEE DRAALAQALG WSEADGPLAA HLACCLEAAE LRRQVAALLS DLSGRPEPQE TDDPATLTSR IEKLRTDLQL LRSEAESCLT AHLDAGRRLE AVGGDDALAR IASDRETLLV ELRDRARAHL AARFGLMAFE AGLRRYRDRH RSAMLARASD AFCRLSRGAY AGLAAQPDGA QEVLVALAAE GGAKLAADLS KGTRFQLYLA LRIAGFHELA QSRPPVPFIA DDIMETFDDD RSAEAFALLA DMSRVGQVIY LTHHRHLCDI ARAACPGASL IDLTAP
|
| |