Gene RPB_4473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4473 
Symbol 
ID3912289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5062939 
End bp5064039 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content69% 
IMG OID637886376 
ProductSel1-like protein 
Protein accessionYP_488067 
Protein GI86751571 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGC TACGCCCGAT CTCGATCGCT GCGGCCATGC TGCTGCTCGC CACCGGCGCG 
TCGGCGCAAT TGTCGCTGAC GCCGTCGCCG CCCAACCCGT TTCCCAAGCC GCTGGAACCG
GAAAAGCCCA AGCCGCGGCC GAAACCTGCC GCCGCCCCGG CCGACAAGGA CAAGGCGAAA
AAGCCTTCCG CCGACAAGGC CGGCGCGGCC AAGTCCGGCG CTACGGCGGA GGGCGCAGAG
ACCACCACCG ATGTCGACGA TCCCAATGTC GATCTGGTCT ATGGCGCGTA TCAGCGCGGG
TTCTACAAGA CCGCGTTCGA AATCGCGCTG CAGCGCGCCC GGGATTTCAA CGATCCCAAG
GCGATGACGA TGCTGGGCGA GCTTTACGCC AACGCGCTCG GAATCAAGCG CGACTACGAC
AAGGCGGTGG AATGGTACAG GCGCGCCGCC GATCTCGGCG ACCGCGAGGC GATGTTCGCG
CTGGCCATGG CGCGGATGGC GGGGCGCGGC GGCGGGCCGG CGAACCGCGA GGAAGCCGCC
AAATGGCTGG CGTCCTCGGC CAAGCTCGGC GAGCCGCGCG CGGCGTATAA TCTGGCGCTG
CTGTATCTCG ACGGCCAGAC CTTCCCGCAG GACATCAAGC GCTCCGCCGA ACTGCTGCGG
GTGGCCGCCG ACGCCGGCAA TCCCGAGGCA CAATATGCGC TGGCGACCTT CTACAAGGAG
GGCACCGGGG TGGAGAAGAA CGTCGAGCAG TCGGTGCGGC TGCTGCAGGC CGCGGCGGTC
GCCGGCAATG TTCCCGCCGA GGTCGAATAC GCGATCGCGC TCTACAACGG CACCGGCACG
GTGAAGAACG AGCCGGCCGC GGTGGCGCTG CTGCGCAAGG CGGCGCGCGC CAACAACCCG
ATCGCGCAGA ACCGGCTGGC GCATGTGCTG CTCAGCGGCC AGGGCGCGCC GCGCGACCCG
GTCGAGGCGA TCAAATGGCA CCTGGTCGCC AAGACCGCCG GCAAGGGCGA CCTGATGCTC
GACGAGGCGC AGGCCCAGCT CAGCGCCGAG GACCGCGCCA AGGCCCAGGA GGCCGCGAAG
AAATGGGTCG GCGGGAAATG A
 
Protein sequence
MKALRPISIA AAMLLLATGA SAQLSLTPSP PNPFPKPLEP EKPKPRPKPA AAPADKDKAK 
KPSADKAGAA KSGATAEGAE TTTDVDDPNV DLVYGAYQRG FYKTAFEIAL QRARDFNDPK
AMTMLGELYA NALGIKRDYD KAVEWYRRAA DLGDREAMFA LAMARMAGRG GGPANREEAA
KWLASSAKLG EPRAAYNLAL LYLDGQTFPQ DIKRSAELLR VAADAGNPEA QYALATFYKE
GTGVEKNVEQ SVRLLQAAAV AGNVPAEVEY AIALYNGTGT VKNEPAAVAL LRKAARANNP
IAQNRLAHVL LSGQGAPRDP VEAIKWHLVA KTAGKGDLML DEAQAQLSAE DRAKAQEAAK
KWVGGK