Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3529 |
Symbol | |
ID | 3911331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4039254 |
End bp | 4040864 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885431 |
Product | hypothetical protein |
Protein accession | YP_487135 |
Protein GI | 86750639 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.370836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGA GCGAAACCGG TCCGGACGAT AGCGACGGCA AGATCTTCAT CGGCAAAGGC GAGCAGCCGG CCTGGCTGAC GCTCGGCCTC GGCAATCGCC ACGGCCTCGT CACCGGCGCC ACCGGCACCG GCAAGACGGT GACGCTGCAG GTGATGGCCG AAGGTTTTGC GCGCGCCGGT GTGCCGGTGT TCGCCGCCGA CATCAAGGGC GATCTCTCCG GCATCGCCGA AGTCGGCGAG GCCAAGGACT TCATCCTGAA GCGCGCCGCC GCGATGGGGC TGGCGTTCCA GCCCGATCAA TTCAGCACGG TGTTCTGGGA CGTGTTCGGC GAGCAGGGCC ATCCGGTGCG CGCCACCGTC TCGGAGATGG GGCCGCTGCT GCTGTCGCGG ATGCTCGATC TCAACGACGT CCAGGAGGGC GTGCTCAACG TCGCGTTCCG CGTCGCCGAC GACATGGGCC TGCCGCTGGT CGACATGAAG GATCTGCGCG CGATGCTCGA CGCGATCGCG CCGATCGCCG CCAAGGTCGC CGAGAACGGC GACGTCAATG CCGACATCCG CCAGGCCGCG GCGTCGCTCG GCAACGTCAC CAAGCAGACC GTCGGCACCA TCCAGCGGCA ATTGCTGGTG CTGGAGAACC AGGGCGGCGC CAGCTTCTTC GGCGAACCGG CGCTGCAGCT CAAGGATTTC ATCCGCACCG ACGGCCAGGG CCGCGGCGTG GTCAACATCC TGACCGCCGA CAAGCTGATG TCCAATCCGC GGCTGTACGC GACCTTCCTG CTGTGGATGC TGTCTGAACT GTTCGAGGAA CTGCCCGAAC TCGGCGACCC CGACAAGCCG AAGCTGGTGT TCTTCTTCGA CGAGGCGCAT CTGCTGTTTA ACGACGCGCC GAAGCCGCTG ATGGATAAAA TTGAACAGGT CGTGCGATTG ATCCGCTCAA AAGGCGTCGG CGTGTACTTC GTGACGCAGA ATCCGATCGA CGTGCCGGAT CGCGTGCTGG CGCAGCTCGG CAATCGGGTG CAGCATGCGC TGCGCGCCTT CACGCCGCGC GACCAGAAGG CGGTGGCGGC GGCGGCGGAG ACGTTCCGGC CCAATCCGAA GCTCGACACC ACGAAGGCGA TCACCGAGCT CGGCAAGGGC GAGGCGCTGG TGTCGTTCCT CGAAGGCAAC GGCACGCCGG CGATGGTCGA GCGGGTGATG ATCCGCCCGC CGTCGGCGCG GATCGGCACG ATCACGCCGG AGGAACGCGC CGCGATCATC AAGGCGAGCC CGATGAAGGG CAAATACGAC ACCGCGATCG ATTCGGAATC GGCTTACGAG AAGTTGCGCG ACCGCATCGA AGGCAAGAAT GCCGGGGCCG AAGCCGCGCC GGGCGAGGGC GGGCTGCTCG GCCAGCTCGG CAGCATCGTC TCCACCGTGT TCGGCACCAA CACGCCGCGC GGCAAGCTCA CCACCGGGCA GGTGGTGGCG CGCGGCGTCG CCCGCACCGT CGCCACCACG GTGGTCGGCG GCATCGCCGC GCAGCTCGGC AAGAAGGTCG GCGGCGCGAT GGGCAGCTCG GTCGGCCGCT CGATCGTGCG CGGCACGCTG GGCGGCCTGC TGCGGCGGTG A
|
Protein sequence | MTASETGPDD SDGKIFIGKG EQPAWLTLGL GNRHGLVTGA TGTGKTVTLQ VMAEGFARAG VPVFAADIKG DLSGIAEVGE AKDFILKRAA AMGLAFQPDQ FSTVFWDVFG EQGHPVRATV SEMGPLLLSR MLDLNDVQEG VLNVAFRVAD DMGLPLVDMK DLRAMLDAIA PIAAKVAENG DVNADIRQAA ASLGNVTKQT VGTIQRQLLV LENQGGASFF GEPALQLKDF IRTDGQGRGV VNILTADKLM SNPRLYATFL LWMLSELFEE LPELGDPDKP KLVFFFDEAH LLFNDAPKPL MDKIEQVVRL IRSKGVGVYF VTQNPIDVPD RVLAQLGNRV QHALRAFTPR DQKAVAAAAE TFRPNPKLDT TKAITELGKG EALVSFLEGN GTPAMVERVM IRPPSARIGT ITPEERAAII KASPMKGKYD TAIDSESAYE KLRDRIEGKN AGAEAAPGEG GLLGQLGSIV STVFGTNTPR GKLTTGQVVA RGVARTVATT VVGGIAAQLG KKVGGAMGSS VGRSIVRGTL GGLLRR
|
| |