Gene RPB_4417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4417 
Symbol 
ID3912232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5004058 
End bp5005569 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content67% 
IMG OID637886322 
Producthypothetical protein 
Protein accessionYP_488014 
Protein GI86751518 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACG CCCTGGAAAA ACTCGCCTAC GGTATCTCGC TGTCGCTCGA GCCGTCGAAC 
CTGCTGTACG CGGCGATCGG CTCGGTGCTC GGCACGCTGG TCGGGGTGCT GCCGGGGCTC
GGCCCGGTGA CGACGATCGC GGTGCTGCTG CCGCTCACCT ATCACGCCGG CTCGCCGCTC
GGCGCCATCA TCATGCTGGC CTCGATCTAT TACGGCGCGA TGTATGGCGG CTCGACCACC
TCGATCCTGC TCAAGGTCCC GGGCGAGGCC GCCTCGGTGA TCACCTGCAT CGACGGCTAT
CAGATGGCCA AGAAGGGCCG CGCCGGCCCG GCACTGGCGA TCGCCGCGAT CGGGTCGTTC
ATCGCCGGCA CCGTCGCAGT GTGCGCGCTG GCGCTGGTCG GCCCGCTGTT CGCCAAATTC
GCCGTCACCT TCGGGCCACC CGAGTACTTC GCGCTGGCCC TGTTCGGGCT GTCGTTGAGC
GCCACGCTGT CCGGCGGCTC GCCGGTCCGC GGCATCACCA TGGTGCTGGT CGGGCTGCTG
CTCGGTCTGG TCGGCATCGA CACCATCACC GGCGTCGAGC GCTACACTTT CAACATCATG
GCCATCACCG ACGGCATCGA TCTGGTGCCG ATGCTGATGG GCCTGTTCGG CGTCGCCGAG
ATCCTGCACA ATCTCGAGGA GAAATCGCGC GGCTCGCTGC TGTCGACCAA GATCGGCCGG
CTGTTTCCGA GCCGCCAGGA TTGGCGCGAA TCCAGTGGCC CGATCGCACG CGGCTCGGTG
ATCGGGTTCT TCGTCGGGCT GATTCCCGGC GGCGGCGCCA TTCTCGCCTC GCTGATGAGC
TACACGCTGG AGAAGAAGCT GTCGAAGACG CCGGAGCAGT TCGGCCACGG CGCCATCGCC
GGCGTCGCCG GGCCGGAATC GGCCAACAAT TCGGCGGCGA CGGCCTCGTT CATTCCGCTG
CTCACGCTCG GCCTGCCGGG CAACGCGGTG ACGGCGGTGC TGTTCGCCGG TCTGCTGATC
CAGAACGTGC AACCCGGCCC ATTGATGCTG GTGAAGAATC CCGACGTGTT CTGGGGTGTC
ATTGCGTCGA TGTATGTCGG CAACATCATG CTGCTGGTGC TGAACCTGCC GCTGGTCGGG
CTGTGGGTGC AGTTGCTCCG CGTGCCGTCC TGGCTGCTCA GCGCCACGAT CCTGCTGATC
GCGATCTTCG GCACCTACAG CCTGCGCAGC AATTTCGCCG ACGTGACCAC GCTGATGCTG
TTCGGCGGCA TCGGCTATCT GCTGCGCAAG GCCAGCCTCG ACGCCGGTCC GCTGATCATG
GCGTTCATCC TCGCCAACAT TCTCGACACC GCGCTGCGGC AATCGATGCT GATGGGCGAC
GGCAGCCTGC TGATCGTCCT GCAGCGGCCG ATGTCGCTGA CGATCCTGCT GGTCGCCGCG
GTCATCCTGA CGGCCCAGCT GTGGTCGCAT TTCGGCCGCA CGCGCCACCG CGCCGTGCCG
TCGGAGGGCT GA
 
Protein sequence
MFDALEKLAY GISLSLEPSN LLYAAIGSVL GTLVGVLPGL GPVTTIAVLL PLTYHAGSPL 
GAIIMLASIY YGAMYGGSTT SILLKVPGEA ASVITCIDGY QMAKKGRAGP ALAIAAIGSF
IAGTVAVCAL ALVGPLFAKF AVTFGPPEYF ALALFGLSLS ATLSGGSPVR GITMVLVGLL
LGLVGIDTIT GVERYTFNIM AITDGIDLVP MLMGLFGVAE ILHNLEEKSR GSLLSTKIGR
LFPSRQDWRE SSGPIARGSV IGFFVGLIPG GGAILASLMS YTLEKKLSKT PEQFGHGAIA
GVAGPESANN SAATASFIPL LTLGLPGNAV TAVLFAGLLI QNVQPGPLML VKNPDVFWGV
IASMYVGNIM LLVLNLPLVG LWVQLLRVPS WLLSATILLI AIFGTYSLRS NFADVTTLML
FGGIGYLLRK ASLDAGPLIM AFILANILDT ALRQSMLMGD GSLLIVLQRP MSLTILLVAA
VILTAQLWSH FGRTRHRAVP SEG