Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3838 |
Symbol | |
ID | 3911641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4383488 |
End bp | 4384798 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885738 |
Product | hypothetical protein |
Protein accession | YP_487442 |
Protein GI | 86750946 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.44323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTATCCG TCGTCGCCCG ATCGTCGCGC GCCCGGCAGA CTCTGGCGCA GACCCGCGAC TGGCTGATAC GCATGCGGGC CGGCGAGAAG CTGTTCATCC TGTCGATGTG CCTGATCTAT GCGCTGCACA GCATCTGGCT GGCGCGCACC GTCTTGTGGT TCCTGGTGCT GCCGGTGATG CTGATCACGG CGGCGCCGTA TCGCAACCTG CTGCCGATCG CGAAATCGGG CGTGTTCATC GCTTCGGCGG TGTTTCTGTT GCTGATCATC GGCACCTCGG CGCTCGGCGG CGAGACCCCG TGGCCGATGC TGCTGCGAAA CCTGCGCTAC TTCGCGGCGG TCGTCGCCTT CGTGGCGATC GTCGCGCAAT TGGTGCGCGG CGACGGCGAT TTCCTGCGGC TGCTGTTCCT GGTGCTGGCG CCGGTGGCGG CGCTGGCGGC GATCCGCGAC GTGGTATCGT TCACCGGTCT GTCGCTGGAG ACGCTGATGA CCACCCGGCT GCAGGGCGTC AGGGGGCTGA CGGTCTACTA CAACTCCAAC GTCATCGGCA TGATGTTCGC GATGCCCTGC GTCGGCGCGG CGGCGGTGAT GGCGTCGCGC AGGTTACGGC GATGGCAGTT CGCGATGCTG GCGGCGTCGG TTCTGATCCT GCTCGGCGCC GTCGTGCTCA CCGGCAGCCG CGGCTCGCTG ATCGCCGCGA CGGTCGGCAT CGGCGTGTCG ATCCTGTCGG CGAATTGGCG GCTGGCGGCG GCGATCGTGG CGCTGGTGGC GGTCGCGGCG GCGGTCACGC TGCTGACCCC GCTCGCCGGC GAGTTGCTGC AGCGGCGCGA CAGCCTGCGA TTGACGCTGT GGCCGATCTA TTTCGACATG GCGATGCTGA AGCCGTGGCT CGGCTATGGT CTCGCTTTCG ACACCCAGCG CATGCTGCCC GATGGCACCA TGGTGATGAA CGGACACAAC ATCTTTCTGT GCGCGGCGGT GAGGGGCGGC GTGTTCAGCG CGCTGGCGCT GTTCGGCATC GTGCTGGCCT CGTTGGTGAG CGCGCTGCGG GCGTGGTTGC GCCGCCGCGA GATCCTGGGG CTGGCGTTGC TGGCGGTCTG CCTCACCGCC ACCACCGTCG ACTACGAGAT CATCCCGACC GACCTCGGCT ATTTGTACGT TCTGTTCTGG CTTCCCGTCG GCATCTGCCT CGGCGCGGCG CTGGCCGAGG CGCCGCGCGG GCTTTCGGGC CGACACGCCG GTGCCGCGGG AACCGCTGCC GACGCGGCGC CGACCGTGAC GTCACACAAT CGCCAGCCGG CGGGTGCTTA G
|
Protein sequence | MVSVVARSSR ARQTLAQTRD WLIRMRAGEK LFILSMCLIY ALHSIWLART VLWFLVLPVM LITAAPYRNL LPIAKSGVFI ASAVFLLLII GTSALGGETP WPMLLRNLRY FAAVVAFVAI VAQLVRGDGD FLRLLFLVLA PVAALAAIRD VVSFTGLSLE TLMTTRLQGV RGLTVYYNSN VIGMMFAMPC VGAAAVMASR RLRRWQFAML AASVLILLGA VVLTGSRGSL IAATVGIGVS ILSANWRLAA AIVALVAVAA AVTLLTPLAG ELLQRRDSLR LTLWPIYFDM AMLKPWLGYG LAFDTQRMLP DGTMVMNGHN IFLCAAVRGG VFSALALFGI VLASLVSALR AWLRRREILG LALLAVCLTA TTVDYEIIPT DLGYLYVLFW LPVGICLGAA LAEAPRGLSG RHAGAAGTAA DAAPTVTSHN RQPAGA
|
| |