Gene RPD_4135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4135 
Symbol 
ID4024657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4603347 
End bp4604660 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content63% 
IMG OID637964343 
Productprotein of unknown function DUF224, cysteine-rich region 
Protein accessionYP_571255 
Protein GI91978596 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.335861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTG AATTCAGTCT CGCCCAGCTC GCCGATCCGG ATATCGCCGA GGCCGACAAG 
ATCCTGCGCG CCTGCGTGCA TTGCGGGTTC TGCACGGCGA CGTGCCCGAC CTATGTGCTG
CTTGGCGATG AACTCGACAG CCCGCGCGGC CGCATTTACC TGATCAAGGA GATGCTGGAA
AACAACGCGC CGCCGACGGC GGACGTGGTC AAGCATATCG ATCGCTGCCT GTCGTGCCTC
GCCTGCATGA CCACTTGTCC GTCGGGCGTG AACTACATGC ATCTGGTCGA TCAGGCGCGC
GCCCGGATCG AGCGGGACTA TACGAGGCCG CTGCCCGATC GCCTCGTGCG CGAGCTGTTG
TCGTGGCTGA TGCCGCACCC CGGCATGTTC CGTTTCAGCA TGTGGATGGC GCGGCTGTTG
CGGCCGGTGG CGGCGCTGCT GCCCGGATCG CACGATCTCG CCCATCCGAC GTTCCTCAGC
CGGATCAAGG CGATGCTGGC GCTCGCCCCG AAGCATTTGC CGGAGCCTGG CCCGGCCTCT
GGAACCATGT TCCCGGCGGT CGGACCCAGG CGCGGACGCG TCGCACTGCT GCACGGCTGC
GCCCAACAGG TTCTGGCGCC GCGTATCAAC CGCGCCGCCA TCAATTTGCT GACACGCCAC
GGCATCGAGG TCGTGCTCGC GGCGGATGAA GCCTGCTGCG GCGCCCTGAT CCATCATCTG
GGGCGTGACA CGCGGACCCT CGAATACGCC CGTACCAACA TCAAGGCGTG GCTGCGCGAG
ATCGATCGCG GCGGCCTCGA CGCGGTTCTG GTGACGACCT CAGGCTGCGG CACCGTCATC
AAGGACTATG GTTACATGTT GCGCGAGGAT CCGGAATTCG CGGCATCGGC GGCGAAGGTC
TCGGCGCTCG CAAAGGATAT CAGCGAATAT ATCGGCACCC TTGAGCTGTC GCCGCCGCAG
CCGCATGGCG ATGTCGTCGT CGCTTATCAC TCCGCATGTT CGCTGCAGCA CGGTCAGAAA
GTCACGCAGC TCCCCAAAGA ATTGCTTTCC AAGTCCGGAT TCGTGGTGAA AGATATCCCG
GAGAGTCATT TGTGTTGTGG TTCGGCGGGC ACGTACAACA TTCTCCAGCC TGACATCGCG
ACCAGATTGC GCGACCGCAA AGTCGCCAAC ATCGCTTCCG TCAAGCCGGA CATGATTGCC
GCTGGCAATA TCGGCTGCAT GGTGCAGATC GCCAGCGGAA CGGACGTCCC TGTAGTGCAC
ACGATTGAGC TTCTCGATTG GGCGACAGGT GGTCCCCGGC CGGCGATCAG CTGA
 
Protein sequence
MKTEFSLAQL ADPDIAEADK ILRACVHCGF CTATCPTYVL LGDELDSPRG RIYLIKEMLE 
NNAPPTADVV KHIDRCLSCL ACMTTCPSGV NYMHLVDQAR ARIERDYTRP LPDRLVRELL
SWLMPHPGMF RFSMWMARLL RPVAALLPGS HDLAHPTFLS RIKAMLALAP KHLPEPGPAS
GTMFPAVGPR RGRVALLHGC AQQVLAPRIN RAAINLLTRH GIEVVLAADE ACCGALIHHL
GRDTRTLEYA RTNIKAWLRE IDRGGLDAVL VTTSGCGTVI KDYGYMLRED PEFAASAAKV
SALAKDISEY IGTLELSPPQ PHGDVVVAYH SACSLQHGQK VTQLPKELLS KSGFVVKDIP
ESHLCCGSAG TYNILQPDIA TRLRDRKVAN IASVKPDMIA AGNIGCMVQI ASGTDVPVVH
TIELLDWATG GPRPAIS