Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2533 |
Symbol | |
ID | 3910322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2896598 |
End bp | 2898997 |
Gene Length | 2400 bp |
Protein Length | 799 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637884432 |
Product | hypothetical protein |
Protein accession | YP_486149 |
Protein GI | 86749653 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTCTT GGCCGGCCCG CCCCGGCGGC GCGTTCGTGG CGACCTGCGC GCTTCTCAGG GAAGCCCGGT CTTCCGGCCG CCTCGCCCAC GCGACCATCG CCTATTGGCC CTGGCGCGAG GGAGCCAAGC AGGCGGCCCG GTCGATCCTC GTTAATCCGC AGGACATCGC GGACGTCTCC CTGGCCGTCT GCAACGCTCC GGACACTTCC TGGCAGGAAC GTACCCTGGC GCACATGTCG CTCTGCATGA TCGAGATGCG CCTTCGCGAT CTGAAGCCCA GGGCTCGGGC GTCTGGCGAG ATCGTCGTAC GAAATCCGAC CTTGCTTGAG ACAACTTCGG TTTTCGCTCC TGCGAACTCC AAGGGTGAAG GGGCCTATCT TCCGGACTCC GAGCAAGTGC TTCGCCGAGT TCGGGACTAC ACATCAATGG GAGAGCCGCA CGCCGGGCTG ACCGAGCATG TGAGAGCTGT GGGTGACCCG GACTCAACCC CGTTCGCCCT GTTCGGTCTT CCACCGGCGA CGGCGCCCGA AGGCCTCGCT CGCTATCTGG GAGCTCCACG CATCAAGGAC TTGGGACTCG ATCTTGTTGT CGCCGATCTG ACCAGCACCG GTCGATCCGA GATCCGTAAC TCCTGGGAAA AGAAGCTGGG CGTCTTGTTG ACTGCTCTCG GTGCGATCCA GGGTCGGCGC CCGGCGCTAT TCGTCCTGAC GGACGATAGC TTCACCCATC GGCGCGCATT CAGTCTTCTG CGCATGCATG CCGAGAAGCG GCGTCCGAAG ATCAGGGCGC AACAACTGGG ATTGTTTCTG GAGAAACCGA CGCTGCGCGG GCCGGCGGCC GAGCCGCCGA GGGATACTGC GCCGCTCTCC GTCCAGGCGG ACATCAAGGA TGCCGCCCTG GCTCCTCTTC GACAGGAATT GCTGGCGATC GGCGGCAAGC TGCGCGATTA TGGAGCTGAC GACGACGCGG ACCAGATCAA GCGTGCCCTG GCTTTTGTCA GACGCTCAGC CTCGCTTCCC CTCGGCATGC GCGAAGCTCG CGACATCACC GATGTCCTCT ACGACGAAGT CGGAGAGTTC GACGATGCCC TCAAGAAGCT TTTCCGTCCC AAGATGGCCC TCAGCGACCT CCTCGCGGTC GGCCTGCGTC AGCCCACGTT CGCACCAGCC ATCGACGCGG TCGTCCAGCA GATCGAACGC AAAGTCGCGA ATTGGGAAGA GGATACCCCG GTCGCGGCAA AGCTCGCCGA ACTACTCACA TCCGCCGATA TCAATTCTGG TAAGACCTCG ATCGCACTGC CCGGCCGACG CATCAGCGAG GTCTACCTGG CTTCCGATCG GGCGGTCCAT TGCAACTGCG CCATTGTCGA CCATCACAGT CTGCTCGATC ATCTCGAAGG TCAGGATCCG GAGAGGCTGA TCGTCATCGG TCCGACGCCG GAATCCATCC GCGCCTTGCT GACCGCCCGC AAGGTCCCCA GTACGGTCTA TCTCCTCGGC GACGCTGCAG GAAGCTCCCT GCTCTCATCC GAATTGGCCG CGATCGAAAC TATTCCGGAA TTCTCCCAGT TCGCCGTCAG AGCCAAAGCC TTGACGACGG CGCTGCGACG CGGCGGGGCC GACGAGTCTC TAGATCAAGC GGAGGCAGAA TTCCATGCCG CACCGCTCGT CAAGGAAAGA GGAGTCGATT TTACGCAATC CGACGGCAGG TATCGCGGTG ACGTCGTTCA TCTCATGATG CAGAGCGGGA TCCGGCTGGA TTATCGCCCG GGCGGCGAAG TCCTCAAACA GTCCCCGGGC GAATTGAGAC CATTCGAACG GGCACCCGCG CGGGAGATCA GGAAGGATGA CCGCATCCTC GTCCTTGATG CTTCGATCCG CGAGCCGCTC CGGCTCGCAC TCGCGACGTC CCGCACCAGT CAAGCAGGGC TCTGCGTCTA TCATGGCGAG ATCGAAAGGA TCCGCACCAG ACTACCGGGG GGAACGATAG CCGAAAAGGC GCGGCACGTC CTTGCGATCA TGAAACGGAT CGACGCGACC GTCGGCGATG AGCAGTACAA CATTCAACGG TGGCTGAGAG CCGACATCGC GCCAGCGACG GCCATTGGGA CGCGAGCGCC GGGGGCCGCC CGTGACTGGA ATCGCTTCCG GATATTCATG GAGGCGGTTG GCGTCGATAG CCAGATGGCC GAAGTGTACT GGAAAGCCGC CGTCCTTCCG ACGCGTTCGT ACCGAGCCCA CGAGGGACAC CAATTCAATC AGCGCGTCGT GAGCTTCGTG CTGGACAAGG AGGCCGAAGA GGCCTGGAAG ACGAAGCAAG GACTGTGGCA GCAGGTGCTG GAGTCCGTCG ACGTCGTCAT CGACGTCGAG AAAAAATCCG TCGGAGCGAG CAATGGCTGA
|
Protein sequence | MLSWPARPGG AFVATCALLR EARSSGRLAH ATIAYWPWRE GAKQAARSIL VNPQDIADVS LAVCNAPDTS WQERTLAHMS LCMIEMRLRD LKPRARASGE IVVRNPTLLE TTSVFAPANS KGEGAYLPDS EQVLRRVRDY TSMGEPHAGL TEHVRAVGDP DSTPFALFGL PPATAPEGLA RYLGAPRIKD LGLDLVVADL TSTGRSEIRN SWEKKLGVLL TALGAIQGRR PALFVLTDDS FTHRRAFSLL RMHAEKRRPK IRAQQLGLFL EKPTLRGPAA EPPRDTAPLS VQADIKDAAL APLRQELLAI GGKLRDYGAD DDADQIKRAL AFVRRSASLP LGMREARDIT DVLYDEVGEF DDALKKLFRP KMALSDLLAV GLRQPTFAPA IDAVVQQIER KVANWEEDTP VAAKLAELLT SADINSGKTS IALPGRRISE VYLASDRAVH CNCAIVDHHS LLDHLEGQDP ERLIVIGPTP ESIRALLTAR KVPSTVYLLG DAAGSSLLSS ELAAIETIPE FSQFAVRAKA LTTALRRGGA DESLDQAEAE FHAAPLVKER GVDFTQSDGR YRGDVVHLMM QSGIRLDYRP GGEVLKQSPG ELRPFERAPA REIRKDDRIL VLDASIREPL RLALATSRTS QAGLCVYHGE IERIRTRLPG GTIAEKARHV LAIMKRIDAT VGDEQYNIQR WLRADIAPAT AIGTRAPGAA RDWNRFRIFM EAVGVDSQMA EVYWKAAVLP TRSYRAHEGH QFNQRVVSFV LDKEAEEAWK TKQGLWQQVL ESVDVVIDVE KKSVGASNG
|
| |