Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4047 |
Symbol | |
ID | 3969296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4496181 |
End bp | 4497761 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637927151 |
Product | hypothetical protein |
Protein accession | YP_533892 |
Protein GI | 90425522 |
COG category | [S] Function unknown |
COG ID | [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.237674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCGGT CCATGCCTTT TTCCTGCGCA CCCGACGATC TGCAACAGAA AGTCTTCGCC TTCTTGGCGA ATTCGGCGAA CCATGGCGAT CGACCGGTGC ATGTCGTCAC CACGCATGGC GCAGCGGTGT TTCTGGCGGG TGACCGCGCG CTGAAGATCA AGCGCGCGGT GCGGTTTCCC TATCTGGACT ATTCCACGCT GGACAAGCGC AAAGCCGCTT GTGACCAGGA AGTGAGCATC AATCGGCTGT TCGCGCCGCA GATCTATCGC GGCGTCGTTC CGATCACGCA GCGGGCCGAC GGCCGGTTCG AGATCGCCGG AGACGGCCGG GTGGTCGAAT GGGCGATCGA TATGACGCGG TTCGACGAGC GGCAAACCGT CGATCTTCTG GCCGAAGCCG CGCCGCCGGA AACCGCGCTG CTGCTCGACA TCGCCGAGGT CATCGCAGCC TCGCACGCGG CGGCGCCGAT CGTCGCACGC GGTCCTTGGA TCGATTCTAT CTTGCGGATC GTTGCCGGTA ACACCAAAGC GTTCCGCGCC GGCGGCTTCG ACGGAGCGGC GATCGCGGCG CTGGATGCCG CCAGCCGCGC CGGCTTTGCG CGGCTGCAAC CGCTGCTCGA TCGACGCGGC GAGCAGGGGT ACGTGCGACG CTGCCACGGC GACCTGCACC TGGCGAACAT CGTGGTGATC GACGGCAAGC CGGTGCTGTT CGACGCCATC GAATTCGATC CCTCGATCGC GTCGACCGAT GTGCTGTACG ATCTCGCCTT CGTGGTGATG GATTTCATCC ACTACGACCG CGCAGCGGCC GCCAGCGTTG TTCTCAACCG TTACCTCGCC ATCACCTCCG ACCAGCATCT CGACGCGCTG TCGGCGCTTC CCTTGCTGAT GTCGATGCGC GCGGCGATCC GCGCCAATGT GATGCTGTCG CGGCCCGCAC AGGACGCCGC GCATCTGGCC GAGATCCGAC GCACCGCCGA GAGCTATTTC ACGCTGGCCT GCCGGCTGAT CGCGCCGCCG CAGCCGCGCT TGATCGCGAT CGGCGGATTG TCGGGGACCG GCAAATCGGT GCTGGCGCGC AGTCTGGCGA GCACCATCGC GCCGCTGCCG GGCGCGATCG TGCTGCGCTC CGACGTGACG CGCAAGCAGC AGTTCAACGT CAAGGACACC GATCGATTGC CGGCCGAGGC CTATCGGCCG CAAGTGACCG CTGAGGTTTA TCGGACGCTC TGCCAACGCG CGGCAAGAAT CCTCGCCCAG GGACACTCGG CGATGGTCGA TGCGGTCTTC GCACGTGAGG ACGAGCGTCG CGCGATCAGT GAGGTTGCCG AGCGGGCCCA GGTTCCGTTC GATGGCTTGT TTCTGGTCGC TGACTTGGCG ACCAGAATCG CGCGAGTCAG CAGCAGGATC GGCGACGCCT CCGATGCAAC GGCCGAGATC GCCAAAGCGC AGCAGGCCTA CGACGCCGGT GTCATCGATT GGACGATCGT CGATGCCGCG GGCACACCCG ACCAGACGCT GCTGCGGGCA ACCGAGGCGC TCACCACTCC TCGCGCAATA CCCCGCGGCG CCGGGGCATA A
|
Protein sequence | MYRSMPFSCA PDDLQQKVFA FLANSANHGD RPVHVVTTHG AAVFLAGDRA LKIKRAVRFP YLDYSTLDKR KAACDQEVSI NRLFAPQIYR GVVPITQRAD GRFEIAGDGR VVEWAIDMTR FDERQTVDLL AEAAPPETAL LLDIAEVIAA SHAAAPIVAR GPWIDSILRI VAGNTKAFRA GGFDGAAIAA LDAASRAGFA RLQPLLDRRG EQGYVRRCHG DLHLANIVVI DGKPVLFDAI EFDPSIASTD VLYDLAFVVM DFIHYDRAAA ASVVLNRYLA ITSDQHLDAL SALPLLMSMR AAIRANVMLS RPAQDAAHLA EIRRTAESYF TLACRLIAPP QPRLIAIGGL SGTGKSVLAR SLASTIAPLP GAIVLRSDVT RKQQFNVKDT DRLPAEAYRP QVTAEVYRTL CQRAARILAQ GHSAMVDAVF AREDERRAIS EVAERAQVPF DGLFLVADLA TRIARVSSRI GDASDATAEI AKAQQAYDAG VIDWTIVDAA GTPDQTLLRA TEALTTPRAI PRGAGA
|
| |