Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3945 |
Symbol | |
ID | 4024461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4389999 |
End bp | 4391720 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637964149 |
Product | hypothetical protein |
Protein accession | YP_571067 |
Protein GI | 91978408 |
COG category | [S] Function unknown |
COG ID | [COG1479] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAACG CGGACCAACA TCCCGTCGCC GAAGTGCTGG GCGACAAGTT CATCCATGAA ATTCCGCCAT ATCAGCGCCC GTACGCCTGG ACATCGGATC AGGCCCTCCA ACTGATCGAA GATCTCAGGG AGGCGATGAG CTCAGGCGCC GACGAACCCT ATTTTCTCGG CAGCATAGTG CTGATACGGC CGCGCGGAGA ACCCGTCGGT CAGGTGGTCG ACGGCCAGCA GCGTCTGACG ACGCTGACGA TATTGGCCGC CGTGCTGCGC GACCTGGCAA CCGATCCCGA CGCGCGCGAG GCAATCTCCG GCGCGGTCTA CATCAAACCT AATCCCTACA AAAAGCAGGT CGAGTCGGTC AGGATTCTTC CTCATTCGGA GGACCGCATT TTCTTCCGCG AGGCCATCCA GTTTCCCGAC GCGACGTCGA AATCGTCTCC GCCCCATCAG CCGAAGACCG AAGCCCAGAA GCTGATGTGG GACAACGCCC TGGCGTTGCG TAAGCGCGTC GTTGAGATGA CGATCGAAGA CCGTCAGAGA CTCGTCGATT ACCTCCTGAA CAATTGCGTT CTGGTGGTGG TTTCCACGGA GTCGCGTGGC GCGGCGCTCA GGATCTTCAG GGTGCTGAAC GATCGCGGCC TGGATCTTTC GAATGCGGAT GTCATCAAGG CCGATCTGCT CGGGAAATTT AAAGACCATA CGGAGATGGC GCATCAGGCC GCCCGGTGGC GCGACTTCGA GACCGACCTC GGCCGCAATG ATTTCGAGGA TCTGCTGGAA AATCTCCGGT TCATCCGTGA GAAAGGCAAG AACCGAAGTT CGCTAAGTGA AGCGTACGAA TTACGCTTCA AGATGGCTAC ACCTCCGGAC GTCAGAAATT TTCTCGATCA CGAACTTGCG CCGGCCAAGC GCTGGTTTGC CGAGATCGTG GATGGAGACG GAGCGGATTT TCCGACCATT CTCCGAAGCG GGCTGTCGGA AGCATTGGCG GGTCTTCGCC TGGTTCCCAA CAAGGATTGG ATGCCGGTCG CTCTCGCCGC CGCGATGCAG TTCGGCGCCA CGGAGAAGCT GCTCTCGACG CTGGTCAAGC TTGAGGGGCT GGCTTGGATC ATGCAGTTGG GACGCCGCTA CGATACACAG CGGATGAACC GCTACGGCGA AATCATCGGA GCTCTTGGCG GTCCGGATGC GGAGCTCGAA AGCAAGCTTG TTCCCTCGGT CGAAGAGAAC GACGATGCTT GGTCAGCGCT GAGCGGAAAG CTCTACAGCA AGTTTCCCGT GCGAGTCGTC CGTGCTGTTC TCGAACGTTT GGACAGATTG CTATCCGAGC AAATCGTCGT CTGGGATGGG CAAAAGACCG TCGAACACAT CCTTCCGCAG AATCCCGAGG CCGGGGAATG GGTTGGTTTC GATTCAGAAC GGCGGGAGGC GGTCACGGAT ACACTCGGTA ACCTGGTTCT GCTGACTTCG CGTAAGAACT CGTCTGCCTC CAATCTGCCC TTCGCAAAGA AACGCCTAGT CTATTTTGGA CTCGCGGAAA CGAGCGCTGG AAAGAAGAGA GCGACGTATG CGAGCGCCCA AGAACTGGGC GAGCTCCGCG ACTGGGATGT CCCCGCATAT CGAAGTCGAC AAGAACGTCA CCTTGCGTTG CTCGCCAAGC GATGGGGCAT CACGCTTCAG CCCACCGTCC AGCCGCCCGC TGCGGACGCT CTTCGGTCTT GA
|
Protein sequence | MINADQHPVA EVLGDKFIHE IPPYQRPYAW TSDQALQLIE DLREAMSSGA DEPYFLGSIV LIRPRGEPVG QVVDGQQRLT TLTILAAVLR DLATDPDARE AISGAVYIKP NPYKKQVESV RILPHSEDRI FFREAIQFPD ATSKSSPPHQ PKTEAQKLMW DNALALRKRV VEMTIEDRQR LVDYLLNNCV LVVVSTESRG AALRIFRVLN DRGLDLSNAD VIKADLLGKF KDHTEMAHQA ARWRDFETDL GRNDFEDLLE NLRFIREKGK NRSSLSEAYE LRFKMATPPD VRNFLDHELA PAKRWFAEIV DGDGADFPTI LRSGLSEALA GLRLVPNKDW MPVALAAAMQ FGATEKLLST LVKLEGLAWI MQLGRRYDTQ RMNRYGEIIG ALGGPDAELE SKLVPSVEEN DDAWSALSGK LYSKFPVRVV RAVLERLDRL LSEQIVVWDG QKTVEHILPQ NPEAGEWVGF DSERREAVTD TLGNLVLLTS RKNSSASNLP FAKKRLVYFG LAETSAGKKR ATYASAQELG ELRDWDVPAY RSRQERHLAL LAKRWGITLQ PTVQPPAADA LRS
|
| |