Gene RPB_0696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0696 
Symbol 
ID3908202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp782939 
End bp784159 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID637882588 
Producthypothetical protein 
Protein accessionYP_484318 
Protein GI86747822 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.103838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACA TGAATGTGTC CAAGGGCGAA TCCTGGAGCG GGTCGCGCGC GTCCGACTTC 
GACTACAGGG CCGCGGCTCG CGCGATCCTG CCACGCCTGG CCGCGACGTC GGACAGCAGC
GAGCGCCTGC GCAGGCTGGA CGACGATGCC GCCGCGGCAC TCCGCGGATC GGGCCTGGCC
CGCGTTCTGA CGCCGAAGAA ATTCGGCGGC TTCGAGCTCT CCCCAAGCGC CCACATCTGG
ACCTGTGCGG AACTCGCCCA GGGCTGTTCG GCCGCAAGCT GGGTGTTGAT GGTTTGTGTG
GCCCACGACT ACATCGTCGG ACGGTTCTCG GAAGAGTGCC AGAAGGAAGT CTATGACGGC
GACGCCGACA ACCTGCTCGC CGGCGCCCTG GCTCCGCAGG GCACGATCGA ACGCACCGCC
GGCGGTTGGC GTCTCAATGG GCGTTGGCAA TTCGGCAGCG GCTGCGACCA TTCTCCCTGG
TTCATTCTCG GCACCAAGGT GGTCAATCCG GACTCGGGCG GCTATCTCAA CTACCATGTG
ATGGTGCCGC GGGCGGACAT CGAGATCGAC GATACGTGGT ACACGCTCGG CATGCGCGGA
ACAGGATCGA AAGATCTCGT CGCACGCGAT GTGCTCGTGC CCGACTATCG GGCGATGCCG
ACCTATCCGA CCTTCATGGG GTCGACCCCG CATACGAACA GTCCCGTCTA TCGCTTGCCC
GTCTATGCCG GTCTTTCGTC GATGCTGTCG GGCACCGTGC TCGGGATGGC GGAGCGCGGC
TTGAAGCACT TCATCGAGCG GACCTCCGCC CGCAGGACCG CCCATGGCGT ATCGAAGGCG
GAGAACGCCA ACATGCAACA ACGAGTGGCG GAGTCGACGG CCGAAGTCGC CGCCGCCCGG
CGGCTGCTGG AAAACATCTG CGAGCGCTTC GATCAGGCGA TGGTTGCCGA CCAGGGGCCG
ATGTCCGCCA GCGACCGCGT CCAGTTCCGG TGGGACGCGG CCTATGTCGT CGAACTGAGC
CGACGGGCGA TCGATCGGGT GTTCGCCGCT TCCGGCGCAC ACGGAGTCTA CGAGGGCAGC
CCGGTGTATC GCGCCTACCG CGATATCAAC ACGGCCTGCC ATCACGCGGT GATCGACTTC
GACACGGTCT CTGGATTACG CGGGCAGATC GCCCTGCTCG GCGACATCGG CGAGAACCCC
CGTTCGGTGC CTCTCGCCTA G
 
Protein sequence
MNHMNVSKGE SWSGSRASDF DYRAAARAIL PRLAATSDSS ERLRRLDDDA AAALRGSGLA 
RVLTPKKFGG FELSPSAHIW TCAELAQGCS AASWVLMVCV AHDYIVGRFS EECQKEVYDG
DADNLLAGAL APQGTIERTA GGWRLNGRWQ FGSGCDHSPW FILGTKVVNP DSGGYLNYHV
MVPRADIEID DTWYTLGMRG TGSKDLVARD VLVPDYRAMP TYPTFMGSTP HTNSPVYRLP
VYAGLSSMLS GTVLGMAERG LKHFIERTSA RRTAHGVSKA ENANMQQRVA ESTAEVAAAR
RLLENICERF DQAMVADQGP MSASDRVQFR WDAAYVVELS RRAIDRVFAA SGAHGVYEGS
PVYRAYRDIN TACHHAVIDF DTVSGLRGQI ALLGDIGENP RSVPLA