Gene RPB_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3601 
Symbol 
ID3911403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4133224 
End bp4134498 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content63% 
IMG OID637885503 
ProductAcyl-CoA dehydrogenase-like 
Protein accessionYP_487207 
Protein GI86750711 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.86159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.643653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATTTTT CCCTGCCGCC GGATCTGGTC GCCTATCTCG CCGAGCTCGA TCGCTTCATC 
GACGCCAAGA TCAAGCCGCT GGAACAGGCG GACGACAACA TCCGCTTCTT CGATCATCGC
CGCGAATGGG CGCGCACCGA TTTCGAGGGC GGCGGACTGC CGCGGCACGA CTGGGAAGAG
CTGCTGCGCA AGGCGAAGAA CATCGCCGAT GACGCCGGGC ATCTGCGCTT CGCGATTCCG
AAGCGCTATG GCGGGCAGGA CGGCACGAAT TTGTGGATGG CGGTGATCCG CGAGCACTTC
GCCGCCAAGG GCCTCGGCCT GCACAATGAT CTGCAGAACG AGCATTCGAT CGTCGGCAAT
TTCCCGATCG TCAAGATGCT CGACCTCTAC GGCCGCGACG ATCAGAAGGC GATGATCGAC
GGCTCGATCA CCGGAAAATA TCGCATCACG TTTGGTCTGA CGGAGCCCGA TCACGGCTCG
GATGCGACGC ATATGGAAAC GCGTGCGGTC GAGGCGGTGC GCGACGGCAA GAAGGGCTGG
GTGATCAACG GCGAGAAGAT GTGGACGACC GGCATGCACG TCGCCACGCA TTGCGCGCTG
TTCGCCCGCA CCACCGGCCA AGACGGCGAC GCCCGCGGCA TCACCTGCTT TCTGGTGCCG
GCCGATGCGC CCGGCGTCAA GGTCGAGGAA TATCTCTGGA CCTTCAACAT GCCGACCGAT
CATCCGCGGG TGAGCTTCAC CGACGTCTTC GTCACCGAAG ACGCGTTGTT CGGCGAGGTC
GGCCGCGGCC TGTCGCTGGC GCAATGCTTC GTGCACGAGA ACCGCATTCG CCAGGCGGCG
AGTTCGCTGG GTGCTGCGGT GTTCTGCATC AATGAAAGCG TCAAATACGC GCGGGAGCGA
AAACCGTTCG GCGAGGAACT GGCGCGCAAC CAGGCGATCC AGTTTCCGCT GGTCGAACTC
GCCACCCAGG CCGAAATGCT GCGCCTCTTG ATCCGCAAGA CCGCGTGGGA GATGGATCAG
ATGACCCAGG CGCAGGTCGA GCACACGCTG TCCGACAAGG TGTCGATGTG CAACTACTGG
GCGAACCGGC TGTGCGGCCA GGCCGCCGAT CGCGCGATCC AGGTCCACGG CGGCATCGGC
TATTCGCGGC ACAAGCCGTT CGAACACATC TATCGCCACC ACCGCCGCTA TCGCATCACC
GAAGGCAGCG AGGAAATCCA GATGCGCAAG GTCGCGGGAT TCCTGTTCGG CTATATGGGT
GTGAACAAGA GGTGA
 
Protein sequence
MDFSLPPDLV AYLAELDRFI DAKIKPLEQA DDNIRFFDHR REWARTDFEG GGLPRHDWEE 
LLRKAKNIAD DAGHLRFAIP KRYGGQDGTN LWMAVIREHF AAKGLGLHND LQNEHSIVGN
FPIVKMLDLY GRDDQKAMID GSITGKYRIT FGLTEPDHGS DATHMETRAV EAVRDGKKGW
VINGEKMWTT GMHVATHCAL FARTTGQDGD ARGITCFLVP ADAPGVKVEE YLWTFNMPTD
HPRVSFTDVF VTEDALFGEV GRGLSLAQCF VHENRIRQAA SSLGAAVFCI NESVKYARER
KPFGEELARN QAIQFPLVEL ATQAEMLRLL IRKTAWEMDQ MTQAQVEHTL SDKVSMCNYW
ANRLCGQAAD RAIQVHGGIG YSRHKPFEHI YRHHRRYRIT EGSEEIQMRK VAGFLFGYMG
VNKR