Gene RPC_3934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3934 
Symbol 
ID3969318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4373350 
End bp4374573 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content59% 
IMG OID637927037 
Producthypothetical protein 
Protein accessionYP_533779 
Protein GI90425409 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGT TCGTCTGGCT GGATTATTCA GAGCGCGATC GGCGAAAAAT GCTCGACGTG 
GTCGATCTTT TCAAAGAGCA CGATACGCGT GACGAACTTG GTCTCGGTGC TATACGGGAC
TCATTCGCAG ATCAGTTCTT TCCCGGCACA AGCACGATCA TGACGCGTGC AAGATACTTT
CTCCTGGTCG CCTGGACCTA TCAACGGCTG GAGAGGCAGC GCGTCACCTC GGCGCGAATA
CCGGAGCGTG GCCGGCGGGC CGAAACGGAC TTGATCGAAG CTATTGAACA ATCGGACGAC
AAGGAAGGTA ATATCGGTAA ATACGCAAAG ACGACGCTGA AGCGCCTGCC CAGCAGCGTC
TATTGGCACG GGTTGAACGT ATGGGGGATT CGGACGTTTG CCGGCGCGCA GTCCCAATAT
CACCGCAGCC TTGATCGGAG CTATGCGCAG CTCCATCGCC ATGGCGGCCG GGTTACGGAG
CGCGATGCCG AGCACGATGA TCTCATAGCA CCGAACTGGC ACGCCGGCCT CGTTCTGCCG
CCCGAGGCGT TTCCGGACGA ATGCTCGCTA CGTCTGACGC GCGCCGAAGC TCAGTATCTA
AGCGAGCGCA TCCGCCTCAG TCCGCAATGT GCTGGTACGC TTCTCGCCGA GCTTGTCGCA
CGTCACCACC ATCACGAGGA TGTCGCTTTC GTGTGGCAGC TTCCGTATCT TGCCGAAATG
CCGTTGAAAT TGCGCGAGAT GTTGGAGCAT GCGCGCAATT TTTCGGAAAT CACCCACGGC
GCACCGTTGC TCTACAATCT GATCCTTGCC GAGCAGGAGC GCTGGGACGA GGGCGTTGAG
GAATATAGGG AACGCTTCGC GGCGTGGGCA CAACTCGTCA CGGGCCGCTC TTCAATCCTG
AAGGCATGGA AGCGCAACCG TTTTTGGGAG CTCGCCCGTG CCGGCAATCC CCGCATCAGC
GCTCCGACTT ATGATTTCAG CAATGCGTGG TGGGATCGCG TCTTGGGCAC CGACCCCGCC
ACCTTGTGCG ACAGCCAAGC CGTTCGCGCC CTCATCCGTG ACCGCGAACG TAAACTCAAA
ACGGACCTCG CCCGGATCGG CAATCCCCGT GCACGGGAGC TTTGGAACGG CGACTCAGGC
TCCGCCCAAT TAGAATATCG GTGGCTCATC AGCCAGCGGC TGCTTGGAGA CATTTTCAAA
GGCCTAGAGG CGTCCGATGC TTAG
 
Protein sequence
MSAFVWLDYS ERDRRKMLDV VDLFKEHDTR DELGLGAIRD SFADQFFPGT STIMTRARYF 
LLVAWTYQRL ERQRVTSARI PERGRRAETD LIEAIEQSDD KEGNIGKYAK TTLKRLPSSV
YWHGLNVWGI RTFAGAQSQY HRSLDRSYAQ LHRHGGRVTE RDAEHDDLIA PNWHAGLVLP
PEAFPDECSL RLTRAEAQYL SERIRLSPQC AGTLLAELVA RHHHHEDVAF VWQLPYLAEM
PLKLREMLEH ARNFSEITHG APLLYNLILA EQERWDEGVE EYRERFAAWA QLVTGRSSIL
KAWKRNRFWE LARAGNPRIS APTYDFSNAW WDRVLGTDPA TLCDSQAVRA LIRDRERKLK
TDLARIGNPR ARELWNGDSG SAQLEYRWLI SQRLLGDIFK GLEASDA