Gene RPD_3941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3941 
Symbol 
ID4024457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4380650 
End bp4382389 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content60% 
IMG OID637964145 
Producthypothetical protein 
Protein accessionYP_571063 
Protein GI91978404 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAGC TTCCCATCCA CAACTACAAG CTCTACCCGC GCATGGCTCA CTGGTTCAAT 
CCAGTGCTGC TTTTCAAGCT GCTGGTGAAC GTCGTCATCT CCTCGGTCTT CGGATCGTAC
GCCGACAGGC GGCTGCTCAT CGCCGCGCTC GATACGACGG ACACGCAGAA GCTCCTGCAG
CGCGCGCGGG AAACGAGAGA GATGCTTCAA CCGGGCCCGG ACGGAGCCCT CTGGCTCGAT
TTCGTCGCTG ATCTCGGCGA CGGCTTCGAC AGCACGTACT CGGTCGCCAC CCTGCTCGCG
CAGAAGCAAC TACGGGTCGG CGGCAGGGAT CTTCCGCGCG GCCAAGCCCT GATCATGGGC
GGTGATGAAG TCTATCCGAA AGCCACCGCT GACGCCTACC GGTACCAACT ATATTGGCCT
TACGCCTGGG CATCACCCGA TCCACATCCA GGAGAAGCGA CAGGCACGCC CCTCTTCGCC
ATTCCGGGAA ACCACGACTG GTACGACGGG TTGTCGCTAT TCCTTGCCTG GTTCTGCCGC
GCGAAGCCGG TGCGCTTCGG AAGCTGGCGC ACCGTCCAGC GACGCAGCTA TTTCGCCAAC
CAGATCACGG ACACCTGGTG GATCTGGGCG ATCGACATCC AGCTTGCCGA CAACATGGAC
CAGCCTCAGG CGGACTATTT CAAGACCATC GCCGAGAACA TGCCCGAGAA TTCGAAGATC
ATCCTTTGCA GCGCCGAACC CGGTTGGCTG TACGTGGAGA CGTCGTCCGA ATCCACGTCT
TGGGAGATCG TCGAATATGC GATCGAACTC GCCGAGAACG CGGGGAAAGG ACTGACGGTG
CCGGTCGTGC TCTCCGGCGA CACCCACCAC TACAACCGGT ACACCGGCCT GAAAAATCAG
CAGTACATCA CCTCCGGCGG GGGCGGCGCC TTCCTTCATC CAACGCACCA GTTGGAGGAC
GTCATCCCGT TGAGGCGCTG CGGCGTAAAT CAATCGCTGA CCCTGGCCAG CGCCTCTGAT
AAGGGAGCGG GACCCGCCGT CTATCCGGGC TTCGAACTCA GCAAGTCCCT GGTATGGCGC
AATCTGTATT TCGCACTCAC GAATTGGGAT TTTTCGCTCC TTATGGGCAT GGTTTATTTC
CTTTTCGGTG TAGCTATCTC GCTACGACCC CATTGGGACA TGTACCTCGC TACGGTCGCA
ATCCTCGCAT GGTCCCTGAT GGGCTACACG ATCAAGCAGG AAAAATCGAA GAGGCGCGCG
GTCGTCCTGA CAAGCGCCCT GCACTCGCTC GCACATGCGG CGGTCGTGAT CGGCGCCGGC
ACGTACTTCG TGGCGCTGAA CGCTGCAATC TTCCTGTTTG AGGGTCCCTA CGCTGTGCAC
CTGTGGCTGC TCGCGCTCCT CGTCGAGATG TTTCCGATCG GCTTCGCTTT GGGATCGAGC
CTTTTCGGAT GGAACATGAT GCTGACCTGT CGATACCTGC AGATGAACCG GAACGATGCA
TTCAGCGCGC TGCGGATCGG CGCGTACAAC AACTTCGTCC GGATGCGGAT CACCGAGGAC
GACATAGAGT TCTTCGTCGT CGGCCTCGAC GCTGTCCCTT CGCGAGGTGA TTGGAAAGAA
AATCCGAAGC ACGGCGCGCA TACGGCAGAT GAACCCCGCT TCATTCCGGC AACGCCCTTG
ACGCCTCACC TCGTCGAAAG CTTCTCATTG AACAAGCCCG TCACGAAGCC CGAGGTCTGA
 
Protein sequence
MPQLPIHNYK LYPRMAHWFN PVLLFKLLVN VVISSVFGSY ADRRLLIAAL DTTDTQKLLQ 
RARETREMLQ PGPDGALWLD FVADLGDGFD STYSVATLLA QKQLRVGGRD LPRGQALIMG
GDEVYPKATA DAYRYQLYWP YAWASPDPHP GEATGTPLFA IPGNHDWYDG LSLFLAWFCR
AKPVRFGSWR TVQRRSYFAN QITDTWWIWA IDIQLADNMD QPQADYFKTI AENMPENSKI
ILCSAEPGWL YVETSSESTS WEIVEYAIEL AENAGKGLTV PVVLSGDTHH YNRYTGLKNQ
QYITSGGGGA FLHPTHQLED VIPLRRCGVN QSLTLASASD KGAGPAVYPG FELSKSLVWR
NLYFALTNWD FSLLMGMVYF LFGVAISLRP HWDMYLATVA ILAWSLMGYT IKQEKSKRRA
VVLTSALHSL AHAAVVIGAG TYFVALNAAI FLFEGPYAVH LWLLALLVEM FPIGFALGSS
LFGWNMMLTC RYLQMNRNDA FSALRIGAYN NFVRMRITED DIEFFVVGLD AVPSRGDWKE
NPKHGAHTAD EPRFIPATPL TPHLVESFSL NKPVTKPEV