Gene RPD_3244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3244 
Symbol 
ID4023753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3600400 
End bp3601494 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID637963448 
Producthypothetical protein 
Protein accessionYP_570370 
Protein GI91977711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.262677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.336955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCCG AAATCTCCAA TTCCGCGGTC TCGATCGTCG ATCCGGCGCG GACGCGGATT 
GCCGGCGCGA TCAAGCAGGC TTCCGGCGCC ACCGGCGCCT CCTTCGAATA TCTGCTCGCC
ACCGCCAAGA TGGAGTCGAA CTTCAATCCG CAGGCGGCGG CCTCGACCTC CTCGGCGAAG
GGCCTGTTCC AGTTCATCGA CCAGACCTGG CTCGGCACGG TGAAGGAAGC GGGCAGCCAG
TTCGGCTACG GCCAATATGC CGACGCGATC AGCAAATCGG CGTCCGGCAG CTACTCGGTC
AGCGATCCGG CGGCGCGCCA GGCGATCATG GATCTGCGCA ACGACCCGGT GATCAGCTCG
GCGATGGCCG GCGCGCTGAC GCAATCCAAC AGCTTCAAGC TCACCGGTGA TATCGGCCGG
CGTCCGACCG ACGCCGAACT TTACATGGCG CATTTCATGG GCGTCGGCGG CGCCGCCAAG
CTGATCAGCA CCGCGCAGGA CAATCCGGGC GCGATCGGCG CCGCGCTGTT TCCCAATGCC
GCCGCCGCCA ACCAGTCGAT CTTCTACGAC CGCTCGGGCC AGGCCCGCAC CGTGGCGCAG
GTCTATGACA ACCTGACCTC GCGCTACGAT GCGGCGGCGA ATTCACAGGC GACGCACAGC
GCGATGGCCT CGGTCGGCGG CGCGGTTCCG GCCGGCGCCG CGATGCTGCT CGCCTCGGCC
GCGCCGGTCG ACAATGCGGC CTATCTGTCG AGCTTCCCGG ACGTGCGTAA CGTCGCGCCG
GTCAGCGCGA CATCGCCGGC CGATGCGGCG GCGTCGTCGA CGCGGCAATC CTCCGAGCCG
ATGTTCCGCA CCCTGTTCCT CGGCGGCGAC CGCAGCGAGC CGGTGTCGCC GGCGGTCCAG
CAATTGTGGA ACGGAACGTC GGGTCCGCCG CCGACCACCG CGCCCCCGAC AACCAGCCTG
TCCTACGCGC CGACGACATC GGGCACATCG GCGCTGTCGA TGCCGCCGAC CAGCAGCCTG
TCCGCCACCC CGACCGTCCG TGCCCCGCAA CCGCTCGATC TGTTCAGCGA CCGCAGCGGC
ACCTTCGCCA ACTGA
 
Protein sequence
MSAEISNSAV SIVDPARTRI AGAIKQASGA TGASFEYLLA TAKMESNFNP QAAASTSSAK 
GLFQFIDQTW LGTVKEAGSQ FGYGQYADAI SKSASGSYSV SDPAARQAIM DLRNDPVISS
AMAGALTQSN SFKLTGDIGR RPTDAELYMA HFMGVGGAAK LISTAQDNPG AIGAALFPNA
AAANQSIFYD RSGQARTVAQ VYDNLTSRYD AAANSQATHS AMASVGGAVP AGAAMLLASA
APVDNAAYLS SFPDVRNVAP VSATSPADAA ASSTRQSSEP MFRTLFLGGD RSEPVSPAVQ
QLWNGTSGPP PTTAPPTTSL SYAPTTSGTS ALSMPPTSSL SATPTVRAPQ PLDLFSDRSG
TFAN