Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1331 |
Symbol | |
ID | 3907839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1518009 |
End bp | 1519631 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637883225 |
Product | hypothetical protein |
Protein accession | YP_484952 |
Protein GI | 86748456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.239363 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGATT ACTATCCGCT GATCGCACGC GCCATATCCG GCCTGGACCC CAGTGCTCCG GGCGAGCAGC GCCGTGCGAT CTACGAGCGG GCGCGCTCGG CCTTGATCAC GCAGCTGCGC GGCGTCCAGC CGCCGCTGAC CGAATCCGAA ATCACCCGCG AGCGTCTGGC GCTCGAGGAG GCCGTGCGCA AGGTCGAGTC CGAGGCCGCG CAGCGGTCCC GCGACGCCGC CCGCGCCGAG TTGAAGAACC GCCGCCCCGC CGGCGACCCC GCGCGTCCCG GCGATTCGCT GCGGGCGTCG AGCCGCCCGG CGCCGCGTCC GGGCGATCCG CCGCCGCAGG TCTCGCGCGC GCCGCTGCCG CCGACCGCGG CTCAGGCCGA CGCCGAGCCG CCGCCGATCC GTCCGCGATC GCAACCGCCG GCGCCGCCCG CGCCGCCGCG CGACGAGCGG GCGCAGCGCA ATCTTCGCGT CGAGCCGCCG CCGATTCCGC CGGAGCCGCC GCTGCCCGGC CGCGAGCGCC CGGCGCCGCG CCGGCCCGAT CAGGGTCCGG CCGCGAATCA GGGCGCCGAC AATGCGGGGC TGCGCGGCTT CCGCGACGTC ACCGCCGATC TGGCCGATCT CGGCCAGGCC GCCGCGCAGG CCAACCGTTC GGCGCGCAAG ACCTACGCCA ATGTGGCGCC CTCGACGGAG TTCGACCGGC TCGAACCGTC GATGGAGAAC CGCACCGATC CGGACGCGCC ATATTCCTAC GACGAATCGG TCGACGAGGC GGCGCGCTAC CAGCCGCCGC CGGCCGCGGC GCGGACGCGC GTCGAGCCGG ACCGCAAGGG GCCGCCGCGA AAGCCGACCC GCCCGCCGTC GCGTTTTCCG CTGAAGAGCG CGCTGGTGAT CGGCCTGGTG CTGGTGCTGG CCGGTGCCGG CATTTTGTGG GGACCGTCGC TGTATACGTC GCTGCGCGCG ATGATGAGCT CGGCGCCGTC GACCGAGACC GCAACGCCGT CCGCGCCGCC GAGCTCGACC GAGCGGCCGA AAATCACCGA TCGCGTCGGC CAGCCGTCGA GTTCGGAGGC GATCGCCCCG GTGGCGCAGC GCGTCGTGCT GTACGACGAG GATCCGTCGG ATCCGAAGGG CAAGCAATAT GTCGGCACGG TGGTGTGGCG CACCGAGCAG ATCAAGGGCG CCAGCGCCAA GGGCGGCGCC GATCTGGCGG TGCGCGCCGA CATCGAGGTG CCCGAGCGCA AGTTCAAGAT GACGATGTCG TTCCGCCGCA ACACCGACAC CTCGCTGCCG GCGAGCCATA CGGCGGAGCT GACCTTCATC CTGCCGCAGG ATTTCAGCGG TGGCGGCGTG AGCAACGTTC CCGGCATCCT GATGAAGTCG AACGAGCAGG CGCGCGGCAC GCCGCTGGCC GGGCTCGCCG TCAAGGTCAC CGACGGCTTC TTCCTGGTCG GGCTGAGCAA TGTCGAGGCC GACCGCGCCC GCAACCTGCA GCTCCTGAAA GAGCGCTCCT GGTTCGACGT GCCGATCGTC TACACCAACC AGCGCCGCGC CATCATCGCC ATCGAAAAGG GCCCGCCCGG CGAGCGCGCC TTCGGCGAGG CCTTCGCCGC TTGGGGCGAG TAG
|
Protein sequence | MADYYPLIAR AISGLDPSAP GEQRRAIYER ARSALITQLR GVQPPLTESE ITRERLALEE AVRKVESEAA QRSRDAARAE LKNRRPAGDP ARPGDSLRAS SRPAPRPGDP PPQVSRAPLP PTAAQADAEP PPIRPRSQPP APPAPPRDER AQRNLRVEPP PIPPEPPLPG RERPAPRRPD QGPAANQGAD NAGLRGFRDV TADLADLGQA AAQANRSARK TYANVAPSTE FDRLEPSMEN RTDPDAPYSY DESVDEAARY QPPPAAARTR VEPDRKGPPR KPTRPPSRFP LKSALVIGLV LVLAGAGILW GPSLYTSLRA MMSSAPSTET ATPSAPPSST ERPKITDRVG QPSSSEAIAP VAQRVVLYDE DPSDPKGKQY VGTVVWRTEQ IKGASAKGGA DLAVRADIEV PERKFKMTMS FRRNTDTSLP ASHTAELTFI LPQDFSGGGV SNVPGILMKS NEQARGTPLA GLAVKVTDGF FLVGLSNVEA DRARNLQLLK ERSWFDVPIV YTNQRRAIIA IEKGPPGERA FGEAFAAWGE
|
| |