Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1369 |
Symbol | |
ID | 3908474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1560271 |
End bp | 1561914 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883263 |
Product | hypothetical protein |
Protein accession | YP_484990 |
Protein GI | 86748494 |
COG category | [S] Function unknown |
COG ID | [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATGA TGACGGCCAC TGGCGTTCGC GGTTCTGCTC CACTTCCAGA CAGTTCGGAA TTTCTCCAGG CAGTCATGAT GGCAGAACCG ATCCCCGCAT CCGGCGCGGT CCGACAGGAG GACATCCTCC AATTCGTCGC CGACCCGGCG ACGCATGGTG GCTTGCCGGT CAGACGTATC GACACCCATG GCGCCGCGGT GTTTCTGGTC GGCGACCGCG CGCTGAAGAT CAAGCGGGCG GTGCGGTTCC CGTTCCTCGA TTATTCGACG CCGGCGCGGC GCAAGATCGC CTGCGAGCAG GAACTCGTGG TCAATCGCCG GTTCGCGCCG CAGATCTATC GCCGGGTCCT GCCGATCACC CGCAAGGCCG ACGGAACGCT CGAGCTCGGC GGGGACGGCG AAGCGATCGA ATGGGCGGTC GACATGATGC GTTTCGACGA ACGCCGCACG GTCGACCATC TCGCCGCCGC AGGCCCGCTC GCGCCGGATC TCGTCACCGC CATCGCCGAC GTGATCGCTG CGTCGCATCG TTCCGCGCCG GAAGCCGCCA CCGCGCCCTG GATTGCCTCG ATCGAGCTGA TCATCGCCGA CAACACCGCC TCGTTCACAT CCGGCGGCTT TCCGGCCGAT CAGATCGCAG CGCTCGATCG CGCCAGCCGC GCTGCGTTCG AACGCCACCG CGGCCTGCTC GGCCAGCGCG GCGCGCAAGG GTTCGTGCGC CGGTGCCACG GCGACCTGCA TCTGGCCAAC ATCGTGGTGA TCGACAGCGC GCCGGTGCTG TTCGACGCGA TCGAGTTCGA TCCTTTGATC GCCTCGGTCG ACGTGCTGTA CGACCTCGCC TTCCCGCTGA TGGACTTCGT CCATTACGGC CGCTGCGGCG CTGCCGCCGA ACTGCTGAAC CGCTATCTGG CGATCACCCC CTCACAGAAC GACGACGCGT TCGGCCTGCT CCCGCTGATG CTGTCGATGC GCGCCGCGAT CCGCGCCAAG GTGATGCTGT CGCGGCCGGC CGACGACGCC GGAACGATGC GCGGCAACCG GGAGACCGCC GGCGCCTATT TCGCCCTCGC CGCACGCCTG ATCTCGCCGC CCCAGCCGCG GCTGCTCGCC GTCGGCGGAC TGTCCGGAAC CGGCAAGTCG GTGCTGGCCC GTGCGCTGGC TGGTCGTATC CCGCCGTTGC CCGGCGCTGT CGTGCTGCGG TCCGACGTCG CACGCAAGCG GCTGTTCGGC GTCGGCGACA CCGAACGGCT GCCGCCGACC GCATATTCGC CCGAGGTGAC GGCCGAGGTC TATCGCGGCC TCGGCGAGCG CGCCGCCCAT ATTCTGGCGC AAGGGCATTC GGTGATCGTC GATGCGGTGT TCGCCAAGGC CGAGGAGCGG CAGGCGATCG AAGCCATTGC GACGGACGCC GGGTGCGCGA TGCTCGGGTT GTATCTCATC GCCGATCTGG CGACCCGGAT CGATCGCGTC AGCCGCCGCG TCGGCGACGC CTCGGACGCA ACGCCCGACA TCGTCCGGCA GCAGCAAGCC TACGCGCAGG ACGCTGTCGG CTGGACCGAG ATCGACGCCG CCGGCACGCC AGACCAAACG CTGGCCAGCG CCAAGGCGGC GCTGGGGCGC GACGATCAGG CCTGCAGCAC GTAG
|
Protein sequence | MSMMTATGVR GSAPLPDSSE FLQAVMMAEP IPASGAVRQE DILQFVADPA THGGLPVRRI DTHGAAVFLV GDRALKIKRA VRFPFLDYST PARRKIACEQ ELVVNRRFAP QIYRRVLPIT RKADGTLELG GDGEAIEWAV DMMRFDERRT VDHLAAAGPL APDLVTAIAD VIAASHRSAP EAATAPWIAS IELIIADNTA SFTSGGFPAD QIAALDRASR AAFERHRGLL GQRGAQGFVR RCHGDLHLAN IVVIDSAPVL FDAIEFDPLI ASVDVLYDLA FPLMDFVHYG RCGAAAELLN RYLAITPSQN DDAFGLLPLM LSMRAAIRAK VMLSRPADDA GTMRGNRETA GAYFALAARL ISPPQPRLLA VGGLSGTGKS VLARALAGRI PPLPGAVVLR SDVARKRLFG VGDTERLPPT AYSPEVTAEV YRGLGERAAH ILAQGHSVIV DAVFAKAEER QAIEAIATDA GCAMLGLYLI ADLATRIDRV SRRVGDASDA TPDIVRQQQA YAQDAVGWTE IDAAGTPDQT LASAKAALGR DDQACST
|
| |