Gene RPD_3684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3684 
Symbol 
ID4024200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4111844 
End bp4113310 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content62% 
IMG OID637963889 
Producthypothetical protein 
Protein accessionYP_570807 
Protein GI91978148 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0446196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATGT TCGCGTCGAT GAAGGCGTTG GCACCAGCGG CAATAGCCGC GATGGCGGTC 
GGAACGCTCG GTCTGGAAAG CGCCACCGCC AAGGACAGAC TCGCAGCAGG CGAGGTCAAG
GTGATCGCAG AGGAAGCGTT CGTCTACGGC TTTCCAATGG TGATGAGCTA CGCCATTTAT
TACGAATCGT TCGTCGATAC CAAGTCCTCG CAATACAAGG CGCCGTTCAA TCAGCTGTAC
AACACCGCGC GCGTTTATAC GCCGGCCGAT ACCGCGGTGG TGACGCCGAA CAGCGACACG
CCGTATTCCT TCATCGGCAT GGACCTGCGC GCCGAACCGG TGGTCATCTG CAATCCGGAA
ATCGAGAAAT CGCGCTACTT CTCCCTTCAA CTGATTGACA TGTACACTTT CAACTACGGT
TACATGGGCA CCCGGACCAG CGGCAACGCC GCGCAATGCG CGCTCATCGC CGGGCCGCGC
TGGAAGGGCA AGGCGCCGGA CGGCATTGCC AAGGTGTTTC GCAGCGAGAC CGAGTTTTCG
CTCGGCTTGA TCCGGACCCA GTTGTTCAAC GGCGCGGATC TCGACAACGT TAAGAAAATT
CAGGCCGGAT ACCGAGCGGT ACCACTGTCG AAATTCCTCG GGCGCGCCGC GCCCGCGGCC
GCGCCGGCGG TCCAGTGGCC GATGATTGAC AAAGAGCTCG CGGCCAGGGA CCCGTTCACC
TATCTGAATT TCCTGCTCAC CTTCACGCCG GCGACCGGGC CTGCGGCCGT CGAGGCGCCA
ATGCGCGCGC GTTTTGCGAA GATCGGAATC CTGCCGGGCA AGCCGTTCAA CGTCAGGGCA
TTGAGCGCTG CGCAGAAGGA AGAACTCGAG GCGGGGGTCA AGAGCGGGCT GGAGAAGATC
AAGGCGACCA TCGACACGCT CGGCCGGGTC GAGAATGGTT GGCGCGTCGC CACCAGCGCA
TTCGGCGATC GGGCGATGTA TGGCGCAGAT TTTGCGCGCC GCGCGGCGGC GGCGATGGCC
GGCATTTACG GCAACGACGC CAGCGAGGCG CTTTATCCGA TGCTCGCCGC GGATAGCGAG
GGTAAGAAGC CGGACACGGG CGTTGCCAAC TACGCTCTGA CGTTCCCGGC GGGATCGCTG
CCGCCCACAA AGGCGTTCTG GTCGGTGACG ATGTATGACG GCAAGACGCA ACTCCTGATC
GACAATCCGA TCAATCGATA TCTGATCAAC TCGCCGATGC TGCCCGACCT CAAGAAGAAC
CCGGACGGCT CTCTGACGCT GTTACTGCAG AAAGAGTCCC CGGGGCCGGA CAAGACCTCG
AACTGGCTGC CGGCGCCGAA CGGACCGGCT TACATCGTGA TGCGGATCTA TTGGCCGGAG
CCGACAGCAT TGAATGGCGC GTGGAAACCG CCTGTGGTCC AGCCCGTCAA GCTGGAATCG
AGCGCGAATC CTGCGAAGCC GGAATAG
 
Protein sequence
MKMFASMKAL APAAIAAMAV GTLGLESATA KDRLAAGEVK VIAEEAFVYG FPMVMSYAIY 
YESFVDTKSS QYKAPFNQLY NTARVYTPAD TAVVTPNSDT PYSFIGMDLR AEPVVICNPE
IEKSRYFSLQ LIDMYTFNYG YMGTRTSGNA AQCALIAGPR WKGKAPDGIA KVFRSETEFS
LGLIRTQLFN GADLDNVKKI QAGYRAVPLS KFLGRAAPAA APAVQWPMID KELAARDPFT
YLNFLLTFTP ATGPAAVEAP MRARFAKIGI LPGKPFNVRA LSAAQKEELE AGVKSGLEKI
KATIDTLGRV ENGWRVATSA FGDRAMYGAD FARRAAAAMA GIYGNDASEA LYPMLAADSE
GKKPDTGVAN YALTFPAGSL PPTKAFWSVT MYDGKTQLLI DNPINRYLIN SPMLPDLKKN
PDGSLTLLLQ KESPGPDKTS NWLPAPNGPA YIVMRIYWPE PTALNGAWKP PVVQPVKLES
SANPAKPE