Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3675 |
Symbol | |
ID | 3911477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4214988 |
End bp | 4216205 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885577 |
Product | VWA containing CoxE-like |
Protein accession | YP_487281 |
Protein GI | 86750785 |
COG category | [R] General function prediction only |
COG ID | [COG3552] Protein containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGTA ACCCGATGAC CGCGATGGAT CACCTCAACC CGCCGACCGG CAAGATGGCC GACAACATCG TCGGCTTCGC CCGCGCGCTG CGCGCGGCCG GCCTGCCGGT CGGGCCGGGC GCGGTGATCG ATGCGCTGGA GGCGTTGCAG CTCATCGACA TCGGCAACCG CGCCGATCTC TACGCCACGC TGGAGGCGAT CTTCGTCAAG CGTCGCGAGC ACGCGCTGAT TTTCGCGCAG GCGTTCGCAC TGTTCTTCCG CGCCGCCGAA GAATGGCAGC ACATGCTGGA TTCGATCCCG CTGCCCGATC ACGCCAAGAA GAAGCCGCCG CCCGCCTCGC GCCGCGTGCA GGAAGCGATG GCGCCGTCCA CCACGCGCGA TTTCCCCGCC GCCGAAGAGC AGGAAGTGCG ACTCGCGGTG TCCGACAAGG AGATCCTGCA GAAGAAGGAC TTCGCCCAGA TGAGCGCGGC GGAGATCGCC GAGGTGACGC GGTCGATCGC GCGGATGCGC CTGCCGCAGG CGGAATTGCG CACCCGCCGC GTCCGCCCGG ACAAGCGCGG CCTCAAGCTC GATCTGCGCC GCACGCTGCG CGCGTCGCTC CGCACCGGTG GCGACATCGT CGATATTCGC AAGCTCGGGC TGATCGACAA GCCGGCGCCG ATCGTGGCGC TCCTGGATAT CTCCGGCTCG ATGAGCGAGT ACACGCGGCT GTTCCTGCAT TTCCTCCACG CCATCACCGA CGACCGCAAG CGCGTCTCGA CCTTCCTGTT CGGCACGCGG CTGACCAACG TCACCCGCGC GCTGCGGGCG CGCGATCCGG ATGAGGCGCT GGCGAGCTGC ACCTCGTCGG TCGAGGACTG GGCCGGCGGG ACGCGGATCG CGACCTCGCT GCACGGCTTC AACAAGCTGT GGGCGCGCCG CGTGCTCGGG CAGGGCGCGA TCGTGCTGCT GATTTCGGAC GGGCTCGAGC GCGAGGCGGA CTCCAAGCTG GCGTTCGAGA TGGACCGGTT GCATCGCTCC TGCCGGCGGC TGATCTGGCT CAATCCTCTG CTGCGGTTCG GCGGCTTCGA ACCGCGCGCG CAGGGCATTA AAATGATGCT GCCGCACGTT GACGAATTCC GCCCGGTGCA TAACTTGACC TCGATGCAGG GGCTGATCGA GGCGCTGTCG TCGGCGCCGC CGCCGCACCA TTTCAGCGCG ATCCGCTCCG CCGCCTGA
|
Protein sequence | MQRNPMTAMD HLNPPTGKMA DNIVGFARAL RAAGLPVGPG AVIDALEALQ LIDIGNRADL YATLEAIFVK RREHALIFAQ AFALFFRAAE EWQHMLDSIP LPDHAKKKPP PASRRVQEAM APSTTRDFPA AEEQEVRLAV SDKEILQKKD FAQMSAAEIA EVTRSIARMR LPQAELRTRR VRPDKRGLKL DLRRTLRASL RTGGDIVDIR KLGLIDKPAP IVALLDISGS MSEYTRLFLH FLHAITDDRK RVSTFLFGTR LTNVTRALRA RDPDEALASC TSSVEDWAGG TRIATSLHGF NKLWARRVLG QGAIVLLISD GLEREADSKL AFEMDRLHRS CRRLIWLNPL LRFGGFEPRA QGIKMMLPHV DEFRPVHNLT SMQGLIEALS SAPPPHHFSA IRSAA
|
| |