Gene RPD_3942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3942 
Symbol 
ID4024458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4382628 
End bp4387577 
Gene Length4950 bp 
Protein Length1649 aa 
Translation table11 
GC content62% 
IMG OID637964146 
Producthypothetical protein 
Protein accessionYP_571064 
Protein GI91978405 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACTT TCACACCAGG TCCAGGTTCG CAGATGCCCA ACCCCCCGAA GAAGCAGGCA 
GGAGGCCGTG CGACCGAGGC AGGGATGGAT TTTCAGGCTG AGGTCGCGAC TTGGGCGGCT
GCTCATCTTC TGGCGCGGCT GCCGATCGGT GGCAGGCTCG GCCTCGCCAA CACTGCGCTC
GCGATCGAGA TTCGCCTGGA AACAGGGGAG GGCCTCGACG ACACGCTCAT CATCTTAGAC
GATGATTCCA GGATTGAAAT CCAGAGCAAG ACTACAGCGA GCTTGTCGCC GATAGCGTCG
AGCGCACTCG GCAAGACGAT TGCTCAACTC GCAAGATATG TAGCCGCCTC GCAGAAGTCC
GGCCTCAATG TAGTTCCATT TAAGACTCGC GCGGTACTAG CAGTCGGTGC ACCTGCGCCG
CGCACTCTGG ACGACCTAGA GAAGGGATGC CGCGCTTTCG ACCTAGGCGG CGAATGGGCC
ACGACGAGGG CCCAGCGTAG CCGGAATGAG CGCGACGCGC TCGACGTGTT CGAGACGCAT
GCGCGTAGAG CGTGGGCAGA GTCCTCGGCC ACCCCCCCCA CCGACGAGGA CATGGTGGAA
ATGGCGCGCA TGTTTCAGAT AGTGCGTTTT TCAATGGACG AGGGCGCGGA CAACTGGCGC
GAAGCAGCTC GGGCGCTCGG CGGCCGCCTT TACGGATCGG AGGCCGCTGG CGACGCGCCT
CTACGCGAGC TGAAAGCGAT TGTCCGAGGG CTGATTGGCC GCGGCGCACC GGCCGACCGC
GCCGGCCTCC TTCGAGCGCT GCGGAATCGC AGCCACCATG ATGTCGGTGC GCCGGACTTC
GAATCCGACC TGACCAAGCT CGAAGAAGCG AGCCGAGCGG AGCTTGCTCG GCTTGGCGTG
CACACACGCC TGCCTATTGC CGGCGGAGTT GCGATGGAAC GTGAGAGCGA TGGTCCTCTC
GCGAAGGCTA TCGCCGATGG ATCGCTAATC GTCATCGGAG AACCTGGCGC GGGAAAGACC
GGAGCGCTCG TCACGGTCGC CCTTGCCCGC CACGCCGCGG GTGACACCGT TGTGTTCCTC
TCGGTCGATC GATTTCCCGG GGTGGCAATC GCGGCCGACC TTCAGTCCGA GCTTCGACTC
GACCACCCTC TAGTCGAAGT TCTCGCTGCT GCGCCGGGCG CGTCTCGCAA GCTTCTCTTC
ATCGATGCAC TTGACGCCGC CCGCGGCGGT TCCGCCGAGG GCGTTTTTGC ACAGCTCATC
GAGACCGTCG GAGTGCTGCT CGACGGCGTG TGGACCATCA TAGCGTCGAT CCGGACATTT
GATCTGAAGA ATGGACGGCG CTATCGCGAG GCGATGCCCG GCTCACCTCC CGATCCAGCT
TTCGCAGATG CGGCGCTGAG TAGGGTCAGA CACTTCCAGG TGCCGCGATT GACCGACGGT
GATTTCGCGA CTGCGGGGAT TAAAGCGCTG GCACTTGGCG CCTTGTTGAC GGCTGCTCCC
GAAACCCTTC TGGACTTGCT GCGCAACGTC TTCAACCTTT CGCTTGCTGC ACAACTAATA
ACTGATGGTG CCACGCCAAG CAGCATTCGC GCTGTTTCAA CCCAGTCGGA CCTGATCGAC
GCGTATGAGG ATCGCCGCCT CATGGGAACG TCGCTTCAAC AGGCGGCGGC GACGGCCGTG
AGCGTGATGG TTCAGCGAAG GCGCCTAGCC GTCCGCAAGG TCGTCATTGC GCATGACCGC
CTCGACGAAG CCATCCAATC GGGTGTGCTC TCCGATGCTG GTGACCTTGT CCGCTTCGCG
CATCACGTCC TGTTCGACCA TGTCGCTGGC CGCTTCTTCC TCGATTGGGA TGACCCGTCG
AGCCTCATCG GGCAAATCAG CGGCGAAAGT TCGATTGCCC TCATGCTCGC CCCCGGGCTT
CGATTTGCGG TTGAGCGGCT CTGGCGTTCG GATAGCAACG GCAAGCCCGC AGTATGGTGC
TTTATTGCCG ACATCTACAA GGACACGAAC GTAGACCCGG TGCTGGCGAA CGTCGCACTT
CGCACCGCCA TCGAACGGGT CAGCGAGACC GCAGACGTCG CTGGGCTTGC GCTGCTCGTG
GTCGAGCGCG GGAGCGAAGA GCCGATCGCC ACCATGCTCT CTCGGCTCGC TCGCTTCGTC
GGTCTCGCGG TTGATGCAAG CGGCGGCGTG GCGAACGATG AAGCCAAAGC TTGGGCGACG
ATTGCCGAAG CTGCCGCCAA TGTTGGCTCG CGAGATCTCG CCGATCCTGC CCGGTTCCTA
CTCCAGACCT TGTTCGAAAA GGGAGACCTG TCTGATCCGG CGTTGCTCGG GATATACGGG
AGTGCGGCCC GCGCGCTTCT CACGCTGGCA TGGGCGGCAG ATCCTCTGAT GCAACTAACG
GCGACCAATG CGATCCGGTT TGTCGGCAAG AGCTTCGCAT CCGATCCGGT AGCGTCCCGC
ACGCTTCTTG ATCGTGCGCT TCGCGATCCA CACTTCTCCG CGCATGCCGA CAAGGAGGCG
ACATGGCTCG CCGAGCAGAT CATGCCGATA GCAGGCGCCG ATCCTGACTT CGCGGTCGTA
ATATTTCGTG TTTTATATTC GCGCGACATC ACCGACGAAA GCATGTCCTA CTTCGGCGGG
CAGGCCAGCC GTATTATGCC GCTGTCCTCG AACCGTCGGC AGGATTACCG CAGTTGCCGG
TACAATCTTG GCCGCAACGC TGGCCACCTG CTTGGACTCT CTGCGAAGTG GGGAACGAGG
GCTGTGATCG AAGCGGCGCT CGGAGACGCC GACCGTGAGG CGCCGGGTGG CGATGATCGC
GAGCGGGTGA CCGTAGCGGG ACGACCGGCG TTCGACCTTG TTGGTAAAGC ACTGGGATTC
AACGCCTGGG ACGATCCTGA CAGACACCGC GGCAACCAGG ATGACGATGT CCTCACACAC
TATGTGACGT TCTTGCGCGG CTGCTCGGTT GATGCGTTCG CTGAAAGCCT AGACGCTGCG
GCTTCGGGCT ATTCAACGCC TGCGGTTTGG GCGCGGCTGT TTGGCGTGGG CTCCGAGCGA
GTCATCGATG TCGCCGACGT CCTATGGCCC TTCGCGAGCA ATCCGACCAT ACTTGCTCAT
GCGGACACCG TCCGTGATGC CGTCCGCTTC CTAGCGGCGG CTTACCCACA GCGCCCAATC
GACCAACGCA ACGCCTTCGA GATCGAAGCG CTCAAAAGCG ATCTGTTCAC CGACGACCGC
GAACGAACTT GGTGGAGGCA CTCGCTCAGC CGGTTGCTTT CGACTGTCGA CGAGGCGGCG
CTCTCAACTG ATTCGATGCG CGCCTTTCGC GCCGACCTCG CCGCCTCCGG CGAACTCGGT
GGCAACCCGC CCCTCCGCTC GATGACGGTG AGTTGGCGCT CTTCGCAAGG AGTGACGCGG
AGCCTGCTTT CCAGGCAGGG CGTCAACGTC GATGAGGGGA TCGACGCACA GATGTTGGCG
CAGTCCGAGG CACTATACGA ACTTCTCCAG CAGACACCAG CGGATAGCGA TGCGGGTGGT
CTCGCCGCTC TGTGGGCAGC TACCGAGGCG ACGATTGCGT TTTTCGATGC TCACGCCGAC
CGGCTTCACG AACACGTCGA GCAGCCGGTT TGGGGCCATG TCAGCAATGC TGTCGAGCGT
ATTGCCGGAA GCTCGGCGTA TTTGCCTGGA ACGTCGGGGA TGCCGACAAT CGAATTGCTG
CTCTCTGTTC TCCGCCGGCT CTGGGTAAGC CGCTTCCCCG AACCGAAGGA CCGCGCCGAC
AGCAGTCTGA GTTGGGGCAA CTGGGACGTC CGCGTCTACG CTGCCGAAGC ATACGTGTCT
TTGACGGGTC GTTTCGGCGC CGAACACCAC GAAATCGCCG AGATGATTGA CGCAATCCTG
GCGGATCCGG TACCGCAGGT ACGGTCGCAG GCGGCGCAGA GCCTTCAAGT TCTTAGCAGG
ATTGCGCCCG AGCGCATGTG GCGACTCGCG GCACTGATCG CAGGAAAGGA GATGCATCCG
CAGGTGGTTG GCTCGTTCCT CAATTACGTC GTCTGGAAAT TCACTTGGCA GGAGGTGGAT
CGATGCGAGG CCATCATCGA GACAGTGATG GCCCGACGGC TCGCCGACGA GTGCAAGGAA
TCATCGGGGC ACGACCAGGT GGCGATACCG TTGGGCGGCC TCACGGCCCA GTTATGGGTC
TGGCAAGACC GATCAAAGGC GCTCGGATGG CTCGCCGGCT GGTCGGGAGA TCCTGTCGAA
CATCATGATC TTCTGACATC CTTCCTTTCA TTGCTGCGTT CCGCGTTGTT CGCCCGCTAC
GCGTCTGGAG AGGATCACGA TCCGGCGCTA ACTAACCGCG CACAGCATGC CGCAATGGTG
ATCCTGCAGG CCTGCTCGAC GATCGCGCTT GATGATCACG CTACGGCCAC GTCAGACGGT
ACAGATGGCG ACGCACGCGA GGCGGCCGTC GCGCGCTATC GCGCAGCGGA GCAGGTTATC
GGCCATCTCA TGAACCAGCT CTACTTTGGG TCGGGTGCAT ACGCAGATAG CAGGAATGCG
GTGATTGGGT TGAAGAGCCC TGACGCCATG CATCAGTTTT TGAAAGATTA TGACCAAATA
CTGCGGTTGC TGGCCGGTTC GCATGAGCCG GCGACGCTCC ACCATCTCGT CGAACTATAT
GAGTTCCTTA TCCCTGGGGA TCCCGCTGGC GTTTTCGATG CGCTGCATTC GTTGCTGCTC
GGTGCTGGAG CACGGGAAGG ATATCATCAC GAAGGTCTGG CAGCGCCCGT CATCGTTCGG
ATGATCATGC GCTATATAGC CGATCATCGT CCGATCTTTG AGGACGACGC CAGACGCGCG
CGCCTCGTCC AGATACTTCG GCTGTTCTCG GATGTCGGCT GGTCCGACGC ACTCAGGTTG
CTCTATGATC TCCCCGAGCT CTTGCGGTAG
 
Protein sequence
MMTFTPGPGS QMPNPPKKQA GGRATEAGMD FQAEVATWAA AHLLARLPIG GRLGLANTAL 
AIEIRLETGE GLDDTLIILD DDSRIEIQSK TTASLSPIAS SALGKTIAQL ARYVAASQKS
GLNVVPFKTR AVLAVGAPAP RTLDDLEKGC RAFDLGGEWA TTRAQRSRNE RDALDVFETH
ARRAWAESSA TPPTDEDMVE MARMFQIVRF SMDEGADNWR EAARALGGRL YGSEAAGDAP
LRELKAIVRG LIGRGAPADR AGLLRALRNR SHHDVGAPDF ESDLTKLEEA SRAELARLGV
HTRLPIAGGV AMERESDGPL AKAIADGSLI VIGEPGAGKT GALVTVALAR HAAGDTVVFL
SVDRFPGVAI AADLQSELRL DHPLVEVLAA APGASRKLLF IDALDAARGG SAEGVFAQLI
ETVGVLLDGV WTIIASIRTF DLKNGRRYRE AMPGSPPDPA FADAALSRVR HFQVPRLTDG
DFATAGIKAL ALGALLTAAP ETLLDLLRNV FNLSLAAQLI TDGATPSSIR AVSTQSDLID
AYEDRRLMGT SLQQAAATAV SVMVQRRRLA VRKVVIAHDR LDEAIQSGVL SDAGDLVRFA
HHVLFDHVAG RFFLDWDDPS SLIGQISGES SIALMLAPGL RFAVERLWRS DSNGKPAVWC
FIADIYKDTN VDPVLANVAL RTAIERVSET ADVAGLALLV VERGSEEPIA TMLSRLARFV
GLAVDASGGV ANDEAKAWAT IAEAAANVGS RDLADPARFL LQTLFEKGDL SDPALLGIYG
SAARALLTLA WAADPLMQLT ATNAIRFVGK SFASDPVASR TLLDRALRDP HFSAHADKEA
TWLAEQIMPI AGADPDFAVV IFRVLYSRDI TDESMSYFGG QASRIMPLSS NRRQDYRSCR
YNLGRNAGHL LGLSAKWGTR AVIEAALGDA DREAPGGDDR ERVTVAGRPA FDLVGKALGF
NAWDDPDRHR GNQDDDVLTH YVTFLRGCSV DAFAESLDAA ASGYSTPAVW ARLFGVGSER
VIDVADVLWP FASNPTILAH ADTVRDAVRF LAAAYPQRPI DQRNAFEIEA LKSDLFTDDR
ERTWWRHSLS RLLSTVDEAA LSTDSMRAFR ADLAASGELG GNPPLRSMTV SWRSSQGVTR
SLLSRQGVNV DEGIDAQMLA QSEALYELLQ QTPADSDAGG LAALWAATEA TIAFFDAHAD
RLHEHVEQPV WGHVSNAVER IAGSSAYLPG TSGMPTIELL LSVLRRLWVS RFPEPKDRAD
SSLSWGNWDV RVYAAEAYVS LTGRFGAEHH EIAEMIDAIL ADPVPQVRSQ AAQSLQVLSR
IAPERMWRLA ALIAGKEMHP QVVGSFLNYV VWKFTWQEVD RCEAIIETVM ARRLADECKE
SSGHDQVAIP LGGLTAQLWV WQDRSKALGW LAGWSGDPVE HHDLLTSFLS LLRSALFARY
ASGEDHDPAL TNRAQHAAMV ILQACSTIAL DDHATATSDG TDGDAREAAV ARYRAAEQVI
GHLMNQLYFG SGAYADSRNA VIGLKSPDAM HQFLKDYDQI LRLLAGSHEP ATLHHLVELY
EFLIPGDPAG VFDALHSLLL GAGAREGYHH EGLAAPVIVR MIMRYIADHR PIFEDDARRA
RLVQILRLFS DVGWSDALRL LYDLPELLR