Gene RPB_1369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1369 
Symbol 
ID3908474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1560271 
End bp1561914 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content69% 
IMG OID637883263 
Producthypothetical protein 
Protein accessionYP_484990 
Protein GI86748494 
COG category[S] Function unknown 
COG ID[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGA TGACGGCCAC TGGCGTTCGC GGTTCTGCTC CACTTCCAGA CAGTTCGGAA 
TTTCTCCAGG CAGTCATGAT GGCAGAACCG ATCCCCGCAT CCGGCGCGGT CCGACAGGAG
GACATCCTCC AATTCGTCGC CGACCCGGCG ACGCATGGTG GCTTGCCGGT CAGACGTATC
GACACCCATG GCGCCGCGGT GTTTCTGGTC GGCGACCGCG CGCTGAAGAT CAAGCGGGCG
GTGCGGTTCC CGTTCCTCGA TTATTCGACG CCGGCGCGGC GCAAGATCGC CTGCGAGCAG
GAACTCGTGG TCAATCGCCG GTTCGCGCCG CAGATCTATC GCCGGGTCCT GCCGATCACC
CGCAAGGCCG ACGGAACGCT CGAGCTCGGC GGGGACGGCG AAGCGATCGA ATGGGCGGTC
GACATGATGC GTTTCGACGA ACGCCGCACG GTCGACCATC TCGCCGCCGC AGGCCCGCTC
GCGCCGGATC TCGTCACCGC CATCGCCGAC GTGATCGCTG CGTCGCATCG TTCCGCGCCG
GAAGCCGCCA CCGCGCCCTG GATTGCCTCG ATCGAGCTGA TCATCGCCGA CAACACCGCC
TCGTTCACAT CCGGCGGCTT TCCGGCCGAT CAGATCGCAG CGCTCGATCG CGCCAGCCGC
GCTGCGTTCG AACGCCACCG CGGCCTGCTC GGCCAGCGCG GCGCGCAAGG GTTCGTGCGC
CGGTGCCACG GCGACCTGCA TCTGGCCAAC ATCGTGGTGA TCGACAGCGC GCCGGTGCTG
TTCGACGCGA TCGAGTTCGA TCCTTTGATC GCCTCGGTCG ACGTGCTGTA CGACCTCGCC
TTCCCGCTGA TGGACTTCGT CCATTACGGC CGCTGCGGCG CTGCCGCCGA ACTGCTGAAC
CGCTATCTGG CGATCACCCC CTCACAGAAC GACGACGCGT TCGGCCTGCT CCCGCTGATG
CTGTCGATGC GCGCCGCGAT CCGCGCCAAG GTGATGCTGT CGCGGCCGGC CGACGACGCC
GGAACGATGC GCGGCAACCG GGAGACCGCC GGCGCCTATT TCGCCCTCGC CGCACGCCTG
ATCTCGCCGC CCCAGCCGCG GCTGCTCGCC GTCGGCGGAC TGTCCGGAAC CGGCAAGTCG
GTGCTGGCCC GTGCGCTGGC TGGTCGTATC CCGCCGTTGC CCGGCGCTGT CGTGCTGCGG
TCCGACGTCG CACGCAAGCG GCTGTTCGGC GTCGGCGACA CCGAACGGCT GCCGCCGACC
GCATATTCGC CCGAGGTGAC GGCCGAGGTC TATCGCGGCC TCGGCGAGCG CGCCGCCCAT
ATTCTGGCGC AAGGGCATTC GGTGATCGTC GATGCGGTGT TCGCCAAGGC CGAGGAGCGG
CAGGCGATCG AAGCCATTGC GACGGACGCC GGGTGCGCGA TGCTCGGGTT GTATCTCATC
GCCGATCTGG CGACCCGGAT CGATCGCGTC AGCCGCCGCG TCGGCGACGC CTCGGACGCA
ACGCCCGACA TCGTCCGGCA GCAGCAAGCC TACGCGCAGG ACGCTGTCGG CTGGACCGAG
ATCGACGCCG CCGGCACGCC AGACCAAACG CTGGCCAGCG CCAAGGCGGC GCTGGGGCGC
GACGATCAGG CCTGCAGCAC GTAG
 
Protein sequence
MSMMTATGVR GSAPLPDSSE FLQAVMMAEP IPASGAVRQE DILQFVADPA THGGLPVRRI 
DTHGAAVFLV GDRALKIKRA VRFPFLDYST PARRKIACEQ ELVVNRRFAP QIYRRVLPIT
RKADGTLELG GDGEAIEWAV DMMRFDERRT VDHLAAAGPL APDLVTAIAD VIAASHRSAP
EAATAPWIAS IELIIADNTA SFTSGGFPAD QIAALDRASR AAFERHRGLL GQRGAQGFVR
RCHGDLHLAN IVVIDSAPVL FDAIEFDPLI ASVDVLYDLA FPLMDFVHYG RCGAAAELLN
RYLAITPSQN DDAFGLLPLM LSMRAAIRAK VMLSRPADDA GTMRGNRETA GAYFALAARL
ISPPQPRLLA VGGLSGTGKS VLARALAGRI PPLPGAVVLR SDVARKRLFG VGDTERLPPT
AYSPEVTAEV YRGLGERAAH ILAQGHSVIV DAVFAKAEER QAIEAIATDA GCAMLGLYLI
ADLATRIDRV SRRVGDASDA TPDIVRQQQA YAQDAVGWTE IDAAGTPDQT LASAKAALGR
DDQACST