Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2734 |
Symbol | |
ID | 3910527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3117834 |
End bp | 3118841 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637884634 |
Product | hypothetical protein |
Protein accession | YP_486347 |
Protein GI | 86749851 |
COG category | [R] General function prediction only |
COG ID | [COG2607] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.333936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.444885 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGA CAACCAAATC CAAGACAACC CAATCCAAGA CGTCCCGCCC CAAATCCGCC CGCCCGTCCC GCCCCGCCGC CAAGAAGCCC GCCGCCAAGC GCCGCGGCAC CGCGCCCAGG CCGGCCGCGG CAGCCGCGGA TTCATCGACG CTGGAGCGCA TCGCGCGCGC ACTGGAGGGC ATTTCGGCGC ATTTGGCGGG GACTTCGGCG CAGCCGGCCG CCGCGACGCT GGCCTCCGCC GAGGCTTTCA TCTGGCACCC CGAGGGGCGA CTGGCGCCGG TTCCCAAGGT CAGCCGGGTC GAACTCGACC TGCTGCAGGG CATCGACCGG ATGCGCGACA CCCTGATCGA CAATACCGAA CGGTTCGCGA CCGGCCTGCC CGCCAACAAC GCCTTGTTGT GGGGCGCGCG CGGCATGGGC AAATCGTCGC TGGTCAAGGC CGCTCACGCT CATGTCAACG CGCGGCCCGA CGTCGCCGGC ACGCTGAAGC TGATCGAGAT CCACCGCGAG GACATCGAAA GCCTGCCGGC GCTGATGACG CTGCTGCGGG AGTCGGATGA TCGCTTCATC GTGTTCTGCG ACGACCTGTC GTTCGACGGC AACGACGCTT CGTACAAATC GCTGAAGGCG GTGCTCGAGG GCGGCATCGA AGGCCGGCCG GACAATGTGA TTCTCTACGC CACCTCCAAC CGGCGGCATC TGCTGGCGCG CGAGATGGTC GAGAACGAGC GCTCGACCGC GATCAATCCC GGCGAAGCCG TCGAGGAGAA GGTATCTCTG TCGGATCGCT TCGGGCTGTG GCTCGGCTTC CATAAATGCA GCCAGGACGA ATTCCTGACG ATGGTGCGCG GCTATTGCGA CCATTACGGC ATTCGGGTCG ATGACGAGCA GCTCGAGCGC GAGGCGCTGG AATGGTCGAC GACGCGCGGC TCGCGCTCCG GCCGCGTCGC CTGGCAATTC GTGCAGGACC TCGCCGGTCG GATGAAGGTG CGGCTCGGCG GCAAGTAA
|
Protein sequence | MAKTTKSKTT QSKTSRPKSA RPSRPAAKKP AAKRRGTAPR PAAAAADSST LERIARALEG ISAHLAGTSA QPAAATLASA EAFIWHPEGR LAPVPKVSRV ELDLLQGIDR MRDTLIDNTE RFATGLPANN ALLWGARGMG KSSLVKAAHA HVNARPDVAG TLKLIEIHRE DIESLPALMT LLRESDDRFI VFCDDLSFDG NDASYKSLKA VLEGGIEGRP DNVILYATSN RRHLLAREMV ENERSTAINP GEAVEEKVSL SDRFGLWLGF HKCSQDEFLT MVRGYCDHYG IRVDDEQLER EALEWSTTRG SRSGRVAWQF VQDLAGRMKV RLGGK
|
| |