Gene RPD_2339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2339 
Symbol 
ID4022828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2610904 
End bp2612463 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content48% 
IMG OID637962532 
Producthypothetical protein 
Protein accessionYP_569472 
Protein GI91976813 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.431758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.266165 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAAA TTGATGCCCT GTCGGCCATC CACAAAACGC TCATTTCTTT TAACAAACGT 
GCCGAGCGAT CTTCAAATGA GCTTCTTGTG GCGACGTTCG TTGACTCGGC GCCCCTATAC
GATCTTCTTT CAACTACTAA TAATCAAGTC ATCTATGGGC GGAGGGGTAC TGGAAAGACT
CACGCTTTAA AGTTTCTCTC TGAGAATGTG ACGAAGCAGG GCGATCTTGC GATTTATCTT
GATCTTCGTG GCATCGGATC CAACGGATCA ATCTACAGCG ACTCATCGAA GACCTTGTCT
GAAAGGGCGT CTGTACTAGT TGTAGATGTG CTGAGCGCTG TGCATGATGA ATTATTTCAG
GTCGCTATCG CGCAAATAGA TTCCGCTATC AACCCGGAGC AGATCACTCT ACGCGTTGAT
GATCTTGCTA AATCAATCAC TACCGTAGCG ATTGGTGGGC CCGCAGAAAC TGTAGAGAAG
GTGAAAAACG AGATCTCGTC GAGCGCTGGT GCGATTGCGG GTGGCGAACT GTCTCAGACC
TCAAAGGTCA GTGGAAAATT AAGCAGTGAG GTAGTGCAGA GGGAATCCCG GGATCATGCG
ATAAAGCGTT CAGGTCCGGA GCGTATCCGC CTCAATTTCG GAAGTGTTAT GAGCGCGCTT
TCTGGATTGC TTGAAGTTCT GGGGCAGCGT CGAGTATGGC TCTTGATCGA TGAGTGGAGC
GAAATTCCGG TCGACCTGCA GCCATATCTT GCCGATCTTT TAAGAAGAAC TGTTCTTCCC
GTCGGAAGAA TCACCGTCAA AATTGCGGCG ATCGAGCATC GGTCGAACTT TGCTATTTTG
GAAGATCGGG GCGAGTATGT GGGAATTGAG CTGGGGGCAG ACCTTTCTGC CGATCTGAAT
TTGGATGATT TTCTTGTGTT CGAAAACAAT CAACAAAAAT CTACAAATTT CTTCAAGAGC
TTATTCTTTA GGCACTACAC CACAAGCGAT GACGCGCTAC CGGAGATTGA TACAGCTGAG
AAGCTAATTC AAACCGTATT CACTCAATTT CCCGTCTTCG AGGAGTTTGT CAGAGCGGTC
GAGGGTGTTC CGAGGGACGC TCTGAATCTT GCCGCGAAGA TTGCTACGAA AGCATTCGGA
CAAAAAGTGG CAGTCCAGCA TGTCCGCGGA GCTGCACGAG ATTGGTATCA GCAGGATAAG
GCATCCGTTA TCCGTAGCAA CGAAAAACTC GCGACCCTTC TGAATAACGT CATTGATGAA
GTGATTGGAA ATCGAAAGGC GCGAGCATTT CTTATCAGCA GCAGCGCGCG CGATGCTAGA
ATTGATCAGC TGTTTGATTC TCGCTTGCTT CACATTCTTA AGAAGAACAT TTCATCAAAC
GACGAACCTG GGGCTCGCTA TGATGTTTAT AAAATCGATT ACGGTTGTTA CGTTGATCTG
ATCAATACTG CCAAAATGCC GGCGGGGCTC TTTCTGATTG AAGAAGATGG TAAAGAGCAT
TTCGTCGAAG TCCCTAGGGA CGATTATAGA TCAATTCGCC GCGCGATTCT GAGAATATGA
 
Protein sequence
MQEIDALSAI HKTLISFNKR AERSSNELLV ATFVDSAPLY DLLSTTNNQV IYGRRGTGKT 
HALKFLSENV TKQGDLAIYL DLRGIGSNGS IYSDSSKTLS ERASVLVVDV LSAVHDELFQ
VAIAQIDSAI NPEQITLRVD DLAKSITTVA IGGPAETVEK VKNEISSSAG AIAGGELSQT
SKVSGKLSSE VVQRESRDHA IKRSGPERIR LNFGSVMSAL SGLLEVLGQR RVWLLIDEWS
EIPVDLQPYL ADLLRRTVLP VGRITVKIAA IEHRSNFAIL EDRGEYVGIE LGADLSADLN
LDDFLVFENN QQKSTNFFKS LFFRHYTTSD DALPEIDTAE KLIQTVFTQF PVFEEFVRAV
EGVPRDALNL AAKIATKAFG QKVAVQHVRG AARDWYQQDK ASVIRSNEKL ATLLNNVIDE
VIGNRKARAF LISSSARDAR IDQLFDSRLL HILKKNISSN DEPGARYDVY KIDYGCYVDL
INTAKMPAGL FLIEEDGKEH FVEVPRDDYR SIRRAILRI