Gene RPB_4078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4078 
Symbol 
ID3911885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4649272 
End bp4650456 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content68% 
IMG OID637885982 
Productlipid-transfer protein 
Protein accessionYP_487682 
Protein GI86751186 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.254094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCAT CCGTCTACGT CGCCGGTGTC GGCATGATTC CTTTCGTCAA GCCGGGCGCC 
AACGAGCCCT ATCACCTGAT GGGCGCCGAA GCGGCGCGGC GCGCGCTCAG CGACGCGGGT
GTCGGCTACG ACGCCATCCA GCAGGCCTTC GTCGGCTACG TTTACGGCGA CTCCACCTGC
GGGCAGCGCG CGCTGTATCA GGTCGGCATG ACCGGCGTGC CGATCGTCAA CGTCAACAAC
AACTGCTCGA CCGGCTCGAC CGCGCTGTTT CTGGCGCGGC AGGCGATCGC CTCGGGTGCG
GCCGATTGCG TGCTGGCGCT CGGCTTCGAG CAGATGAAGC CCGGTGCGCT CGGCTCGGTG
TTCGTCGATC GCCCCAGCGC GTTCGAGGAT TTCGATGCCG CCGCCGACAA GTTGATCGAT
GCGCCCGGCA TTCCGCTGGC GCTGCGCTAT TTCGGCGGCG CCGGCCTCAG CCACATGCAG
AAGCACGGCA CGCCGCTGTC GTCCTTCGCC AAGGTCCGCG CCAAAGCGAG CCGCCACGCC
GCGAAAAATC CGCTGGCGTT GTTCCGCAAG GAAGTCACCG CGGAGGACGT GCTGAACGAC
CAGGTGATCT GGCCCGGCGT GATGACGCGG CTGATGGCGT GCCCGCCGAC CTGCGGCGGT
GCCGCGGCTG TGCTGGTGTC GGAAGCCTTC GCCAAGAAGC ACGGCCTCAA CATCAACGTC
CGCATCGCTG CGCAAGCAAT GACAACCGAC ACGCCCTCGA CATTCGACGC GGGCGACATG
ATGCGGGTGG TCGGCTACGA CATGGCGCGT GCCGCGGCCG ACAAGGTCTA CGAGCAGGCC
GGCGTGGGCC CGAAGGACAT CGACGTCGTC GAGCTGCACG ACTGCTTCGC CCACAACGAG
TTGATCACCT ACGAGGCGCT CGGCCTGTGC CCCGAAGGCG GCGCCGAGAA GTTCATCGAC
GACGGCGACA ACACCTATGG CGGCCAATTC GTCACCAATC CGTCCGGCGG GTTGCTGTCG
AAAGGCCATC CGCTCGGCGC CACCGGGCTC GCGCAGTGCT ACGAACTGAC CCGGCAGTTG
CGCGGCTCCG CGGAGGCGAC GCAGGTGGAC GGCGCGAAGC GCGCGCTGCA GCACAATCTC
GGCCTCGGCG GGGCTTGCGT CGTCACCCTT TACGAACGCG CCTGA
 
Protein sequence
MASSVYVAGV GMIPFVKPGA NEPYHLMGAE AARRALSDAG VGYDAIQQAF VGYVYGDSTC 
GQRALYQVGM TGVPIVNVNN NCSTGSTALF LARQAIASGA ADCVLALGFE QMKPGALGSV
FVDRPSAFED FDAAADKLID APGIPLALRY FGGAGLSHMQ KHGTPLSSFA KVRAKASRHA
AKNPLALFRK EVTAEDVLND QVIWPGVMTR LMACPPTCGG AAAVLVSEAF AKKHGLNINV
RIAAQAMTTD TPSTFDAGDM MRVVGYDMAR AAADKVYEQA GVGPKDIDVV ELHDCFAHNE
LITYEALGLC PEGGAEKFID DGDNTYGGQF VTNPSGGLLS KGHPLGATGL AQCYELTRQL
RGSAEATQVD GAKRALQHNL GLGGACVVTL YERA