Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3973 |
Symbol | |
ID | 3969396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4422764 |
End bp | 4425397 |
Gene Length | 2634 bp |
Protein Length | 877 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637927077 |
Product | hypothetical protein |
Protein accession | YP_533818 |
Protein GI | 90425448 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0974285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.211682 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCAGA CGACGCCCAA TGCAGATGAC GACGAGCGTC CATGGCGCGA TAAATTCCAC GCCGTGTTCG CCAAGGAAAT CGAAGCCATC AACAGCCGCC GCGAAGCGAC GCGGAAGCTC GCAACGCCCG CCTCCGGCAA GACCCTCGAC GCCGTTGGTC TCGCTCTGTC CGGCGGCGGC ATCCGCTCGG CGTCGGTCTG TCTCGGCGTG GTGCAGGCGC TGAACAACCA GGGCCTGTTG TCGCGGATCG ACTATCTGTC GACGGTGTCG GGCGGCGGTT ATCTCGGCAG CTCGCTGTCG GCGACGCTGA CCTGCCATAA GGACTTCGTG TTCGGCGGCA GCGCCGCCCC GCTCGCCGGC GCCGCCCCCG CCACCGACAT CAGCGACACC CCGGCGGTCG GCCATCTGCG CAACTATTCG AATTATCTGA TCCCGCGCGG CTTTCGCGAC GTGCTGAGCG CCGCGGCGAT CGTGCTGCGC GGGCTGGTCG CGAACCTGTC CCTGGTGGTG CCGGTGCTGC TGCTGCTGGC GGCGGTCACC GCGTGGTCGA ACCCGGTGCG GACCTCGCTC GGCAAATCGG ATTTCTTCGG CGTCGACCTC AGCCAATTTC TGACGCGGAA TTTCGGCGTC ACGCTGGTGG CGGGGCTGAC GCTACTCGGG CTGTTCCTGC TGTGGGCGCT GGCGCGGTCG TCGCCATGGC TGGCGCGGCG GCTGGGCTGG ATGGACATTG TCGGCGGGGT CTTGCTGGTG GCGTTCGCGG CGATCGCGTT CTTCGAATTT CAGCCGTTCG TGATCGAAGG CATGTTCCAG ATCGCCGACC ACAACAACGG CAACCCCAGC GGGCTGGTGC TGGGACTGTT CACCAGTTGG GTGAAGACGC TGGCCGCCGC CGCGGCGCCG GTCACCGCTT TGGTGGTGAT GTTCCAGCGT CAGATCGGCG CGTTCCTCAA CAGCGCCACC GCGGGATCGA GCCTCGCCGC GCAGGCCTCC GCCATTGCGA TCAAGCTCGC GGTCTGGGTC GCCGGGCTGG CGCTGCCGCT GATCATCTGG GTAGCGTATC TCTATCTGAG CTATTGGGCG ATCGCCAATG ACGGCAAGCG TACCGCAGAA CAGGTGCGCT GCCCGCCGAC CGCGATCTCC GCCACCGTAA ACATCCAGCA GCAGGATGGC GCGTCCACCG CGACGCTGCA GGGCAGCCTG CAACCCGACG ACGCCAGCCG CTGCGCCGCG GCCGCGCCTG CGGATTCGTC GTCGCCGCAG GCGTGGGCAC ATACGCCAAA CTGGCTGATC TGGCTCTCCA AGTTCATGCC CGGCCGGGTG CCGGAGGGCC ACCTGATGCC GGCGTTATAT GTCGCGGTGG CGCTGCTGGT GTTCGTGCTG TCGTGGCCGC TGGCGCCGAA CGCCAACTCG CTGCACCGCC TGTATCGCGA CCGGCTCGGC AAAGCGTTTC TATTCGATCC GCGCCATCGC CGCGGCGCCC GGCCCAGCGC CAACGAGCCG AGCCGCGAGC AGGGCCGGGA TTTCGTCAAC GTCAGCGGTA TGCGGCTCAG CACGCTATCG ACGGCGCAGG CGCCCTATCA CCTGATCAAT GCGGCACTCA ACATCCAGGG CTCGGACTTC GCCAACCGCC GCGGCCGCAA CGCCGATTTC TTCGTGTTCT CGAAATACTC GACCGGCAGC CAGGCCACCG GCTACGCGCC CACCGACCGG CTCGAGGCGG CGGCCGCCGA ACTCGATCTC GCCACCGCGA TGGCGATCTC CGGCGCCGCG GCCTCGGCCA ACATGGGCTC CAAGACCATC CGGCCGCTGA CGCCGACGCT GGCGCTGCTG AACATCCGGC TGGGCTATTG GCTGACGAAC CCGGCGTTCT TTGCGGCGGC CGGATCGGGG ACGGCGGCGG GCGCGGACGT AAAATCCAGT TGGACGCCGA CCCACCGCAC CACGCGCTAT CTGTGGTCGG AGCTTTCCGG CCGGCTCTAC GAGAACAGCG ACGAAATCTA TCTCACCGAC GGCGGCCATA TCGAAAATCT CGGCATCTAC GAATTGCTGC GCCGGCAATG CCGGCTGATC ATCGCGGTCG ATGCCGAAGC CGACAGCGCG ATGCATTTTC CCTCGCTGGT GACGCTGCAG CGCTACGCGC GGATCGATCT CGGGATTCGG ATCTATCTGC CGTGGGCGCC GATCCAGGCC GCGACGCTGG GCTGCATGGC GGTCAACGCC GGCAAGCTGC CGCCGCCGCC GCCATCCAAA GAACCGCACG GCCCGCATGT GGCGATCGGC ATCATCGACT ACGGCGGCGG CGAAAAGGGC ACCCTCGTCT ACATCAAGTC GTCGCTGAGC GGCGACGAGA ACGATTACGT CCGCGATTAT GCCCGGCGCC ACGCCCAGTT TCCGCACGAG GCGACCGGCG ACCAGTTCTT CAGCGAGGAG CAGTTCGAGG TCTATCGCGC GCTCGGTTTC CACATCGCCC ACGGCTTGCT GTGTGGTGCC GACGACGTCA GCGTCGCGAG CGACGGCGGC GACCCGGTGG TGACGAAATT CAGCGATGCC GGCAACGCGA CCGTCGCGGC GGTGCGGGCC GCGCTCGGGC TGAGTGTGGA AGAAGAGGGG GAGGGGGCCG GGTCCGGCGT GTAA
|
Protein sequence | MDQTTPNADD DERPWRDKFH AVFAKEIEAI NSRREATRKL ATPASGKTLD AVGLALSGGG IRSASVCLGV VQALNNQGLL SRIDYLSTVS GGGYLGSSLS ATLTCHKDFV FGGSAAPLAG AAPATDISDT PAVGHLRNYS NYLIPRGFRD VLSAAAIVLR GLVANLSLVV PVLLLLAAVT AWSNPVRTSL GKSDFFGVDL SQFLTRNFGV TLVAGLTLLG LFLLWALARS SPWLARRLGW MDIVGGVLLV AFAAIAFFEF QPFVIEGMFQ IADHNNGNPS GLVLGLFTSW VKTLAAAAAP VTALVVMFQR QIGAFLNSAT AGSSLAAQAS AIAIKLAVWV AGLALPLIIW VAYLYLSYWA IANDGKRTAE QVRCPPTAIS ATVNIQQQDG ASTATLQGSL QPDDASRCAA AAPADSSSPQ AWAHTPNWLI WLSKFMPGRV PEGHLMPALY VAVALLVFVL SWPLAPNANS LHRLYRDRLG KAFLFDPRHR RGARPSANEP SREQGRDFVN VSGMRLSTLS TAQAPYHLIN AALNIQGSDF ANRRGRNADF FVFSKYSTGS QATGYAPTDR LEAAAAELDL ATAMAISGAA ASANMGSKTI RPLTPTLALL NIRLGYWLTN PAFFAAAGSG TAAGADVKSS WTPTHRTTRY LWSELSGRLY ENSDEIYLTD GGHIENLGIY ELLRRQCRLI IAVDAEADSA MHFPSLVTLQ RYARIDLGIR IYLPWAPIQA ATLGCMAVNA GKLPPPPPSK EPHGPHVAIG IIDYGGGEKG TLVYIKSSLS GDENDYVRDY ARRHAQFPHE ATGDQFFSEE QFEVYRALGF HIAHGLLCGA DDVSVASDGG DPVVTKFSDA GNATVAAVRA ALGLSVEEEG EGAGSGV
|
| |