Gene RPD_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3938 
Symbol 
ID4024454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4377190 
End bp4378344 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content49% 
IMG OID637964142 
Producthypothetical protein 
Protein accessionYP_571060 
Protein GI91978401 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAG CGCACGACCC ATCGCTTGCG TCCGGCACTA AACTGCTTAT GGCAAACAAT 
ATTTTCGATG AGTCATTTCA GCACGTTCCG TTTTCGGATT GGCTCGACGA GAACAAAAGC
ATTTTTCGAT ATGCTAAGCC GAAGAGCGTC TATCTCGCGC GCGTTCAAGA GAGGCGAGAG
GCTTACGTAG CGCGACTTTG TAAGCGCGCA GAAGTTCCGT TCTTCCTACC CCTGGTATGG
AACAGGCGGA AAAACGACTG GGCGTGCCTA TGGCCGGGGC AAATAGCAGT GTCCTTGGCA
ACCAAAAAGC GGCTATCTTG GATCGAAACA CAGAGCGTTG TTACGCAGCT AGAGCGTTGT
AACAGTTCCG TCTTAGAGCA GATGAGAGGC ATTGGTGTTT TTAACAGGCG GGCCTTCGAT
GGAGTTCTTA GTGAGCTGGG GCCGATCGCA GGAAAAACAG TTGGCGAAAT AGCCGGCCCT
TTCAGCGAGG CACGTTCGAA GCGTATCTCC AAAATTGATG ATACGCGGGT TAAATATACG
GTCGAGCACC TCGGGCGCAC TGTTCAGGCT ATCCTGCCTA TTGAAGCACT CCGAGATAGA
GATCAATTTG TTTCGTTAGA TGAAATCTAT CTAGGCCAAG GCCGACGCGC GGCAGAACTT
GATGCACGCG TAGAAACTCT AAATGCAGAA CTCATCAAGT ATTTGCAGAG GCATCCAGAT
TTTCTCTTTC AGACGTCACC GCATAAATTC GAAGAAATCA TCGCATCCCT CTTGGTAGAT
ATGGGGTTTG AAATTCAATT CACGCCACGT GGCGGGGGCG ATGGCGGCAA AGACATTCTT
GCAGCGATGA AACTACCGAT CGGAACACTC ATGACGATCG TAGAATGTAA AAGATACGCA
CCCAATAGCC GCGTCAGTTC TGACATCGTA GAGCGCTTCA TGTACACTAT TGATCGTAAA
GAAAATGCAT CGTGTGGCTT GATTGCGACT ACTTCTTTTT TTGCAGGTGA AGCGAAGGCA
ATGGAAGAGA AGTTCAAATG GCGTCTGAAA TTGAGAGACC TTGAGTCTAT CCGTAAATGG
CTGAGCAATT ATGGGAAATG GACTACTGAC GAAAAATCTG GCCTATTTGT CCCAACACAG
CCGAAACTTA TCTAG
 
Protein sequence
MTAAHDPSLA SGTKLLMANN IFDESFQHVP FSDWLDENKS IFRYAKPKSV YLARVQERRE 
AYVARLCKRA EVPFFLPLVW NRRKNDWACL WPGQIAVSLA TKKRLSWIET QSVVTQLERC
NSSVLEQMRG IGVFNRRAFD GVLSELGPIA GKTVGEIAGP FSEARSKRIS KIDDTRVKYT
VEHLGRTVQA ILPIEALRDR DQFVSLDEIY LGQGRRAAEL DARVETLNAE LIKYLQRHPD
FLFQTSPHKF EEIIASLLVD MGFEIQFTPR GGGDGGKDIL AAMKLPIGTL MTIVECKRYA
PNSRVSSDIV ERFMYTIDRK ENASCGLIAT TSFFAGEAKA MEEKFKWRLK LRDLESIRKW
LSNYGKWTTD EKSGLFVPTQ PKLI