Gene RPD_2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2333 
Symbol 
ID4022822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2603101 
End bp2605623 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content66% 
IMG OID637962526 
Producthypothetical protein 
Protein accessionYP_569466 
Protein GI91976807 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.761339 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAAA GCGCACTTGG TGTCATCGCA GGCGTCGATC GCAAGACCCT CTCGACCTGC 
CCGGCCTCCG ACAAGCTGTG GGCCGCGCAT CTCGGCGCGT CGCTCTGCCT CTCCTTCATC
GTCGTGCTCG GCGTTTCGTA TCACGCCACC GGCTACATGA TCGAGAGCGT CTCGACGCGG
TTGCTGGTCT CCGGCGTGAT CGCCCTCACC GTGCTGATGT TCGATCGCGC GCTGTGCCAG
TCGGACTGGT TCTATCAGGG GACGCTCTGG GATACCGCGC CGATCCAGTC GAGCGCCGAA
GCAAAGCAGA CCGCCTGGCG GTTCGTGCGC GTCGGGATCC GGCTGCTGAT GTCGTTCGGG
CTCGCCTGGG TGATCGCGAT GTTCCTGGAA CTCGCGATCT TCTCCGGCAC CATCAACGAG
AAGATCGAGG CGGACCGCGT CGCCGCCAAT CAGCCGATCT ACAACAAGAT CGCGCAATAT
GAGTCGGGGC TGAACGCCGA GGTCGCCCGC CGGCAGGCCA GCATTGATGC GCTGGAGGCC
CTGCACCGCG ACGCCCTTAC CGAGGCGCCG GCGCCGGAGC CGAACCTCCA GACCCGCAAC
GACGACATCG AACAACAGAT CAAGGCGCTG GTTGTCCGCG AGAGCGAAAT CCGCGCCGAT
ATTGGTCAGA TCGACCGCAC CATCCAGCGC TACATCGCCG ACATGAACGC CGAAGAGTTC
GGGATGAAGG CCTCGCCGAA CAATTCGGGC CGTCCCGGCG CGGGGCCGCG CTATGAATTC
GCCAAGAAGC AGAAAGAGGC GTTCCTGGTC CAGCGCGCCG CGCGCGAAGC CGAGATCGCC
CAGCTCCACG TCAAGCGCGA CGAACTGCGC GCGGCGCAAT CCAAGATCGG CGCCGAGGCG
CTGGTCGCGC GCGATCAGGA GCGCGCCGCG ATCAAGGCCA AGGCCGATGC GCTGCAGACC
CGGATCGATG CCGCCCGCGC CGAGCTCAAG TCGTTCGAAA CGGCCAAGGT CGCCAGCGTC
GCCGAGTTCC GCAGCAAGGC GCTGGCCGAG TCGTACTACC AGGACAAGAG GGAGCTGGTC
GACCCGCTGA CACGGATCGC AGCCTATCAG GAGTTGAAGA ACGACCCCAA GGACGGCGGC
ACCATGACGC TGTTCTCGTG GATGACGCGC TTCTTCATCA TCTTCCTCGA AATCGTTCCA
GTGGTTGCGA AAATCTTCTT CTCGCCGCCG AGCGTCTATG CCGCCAAGAT CCAGGCCCAG
GTCGAGCGCG CACGGCAGCG GATCGAAAAC AATGAGGATC TAGACGACGA CAAGCCGGCG
GAGCCCGCCC CCGTGCTGGC GATGGCCGCG ATGCCGCTGC CGGCGATGAG CCTCGACCCG
GTTGTTGTGA AACCTGCGCC GCAGCCGAAG GCCGAGCCGC CGAAACAGCC GGCCCGACAG
CCCGAGCGAC AATATTATGA GGAAGACCAG CACGAACTGC CGCGGCGACC GGCCCGCGAC
CGCGACGACC GCCTCCCCGA CTACAGGCCG CGCGAATATG CCGAGCGCGA CGCCGATCCG
CGCGGCTATG GCCGGCGTCA CGACGATGCG CGACGCTACG GCGCGCGGGA CGACTATGCC
GGGTATGACG ACTTCGATCG CAGGAGTTTC GAGCGTCGTG ACCTGCATCC CCGCTACGAC
GACGAACGCG ACTACGATCG GCCGCGGACG GCGCCGGTGC GCCGACGCAC GCCGCAATTC
GCCGCCCAGG ACTATGCGGC GCACGACTAT GCAGCTCGAA ATTTCGGCGC GAGCAGCTAC
GGCGAGCCGG AGTTCGATAC GAGCGAGATC GACGTCCGCA ATCTGATCGT GCCGCGGAGG
GGTGCGCCGC GGTATGACAC CGCGGACCAA CAGCCGGGCA GCAGAATCCG ATTTACTGTA
AAGGACGACG CGCCGGCCAC ACCGGACCAC GCCGCACGGC CCGCCGCGAA CGCCGAGCGC
AATCTGCGCT GGCCGGACGG CCTACCGGTG GACGATCTGA ACACCCGGGA ACTGCTGCGG
CAGGTGCTCG CAGATGCCGA AACCGGCCCG CGCGCCGAGC CGGAGCGCAA GCAGATGGCC
GCAATTGCGG ACACGCCGCC GCCGGTTTCC GAGGATGCGC CGATCCCCGC TGACGCGACC
GCGCAGCCTG TCGTCGCTGA CGCGCCAGCT CAGCCAGTCG TCGAGGCCGA AGCGGCCGCA
GAGCCGGATG TGATTCCGAC ATTTCTCGGG AAAAAGAATC ACGGATCCAG CGGCGAAAAC
TCACCGGCCG AGGCGGCGCC TCCCGCGCCG CACGCAACCG AAGAGGTTGC GGCTCCGCCC
AGTGAAATCG TCGGCCCGGT TCAGAAATAC AGCGCCGACG TCGAGAAGCT GATGACCGAG
GCGATGGAAC TATCGCGCGC CAGAAAGAAG GCGGCACATC CGCGGCGCGA CCATCACCAC
GAGGGCGACG CCGAATTGCC TCTGGACAAC AAGGTCCAGT ACGAGATCTT TCCGAAGCAA
TAG
 
Protein sequence
MIQSALGVIA GVDRKTLSTC PASDKLWAAH LGASLCLSFI VVLGVSYHAT GYMIESVSTR 
LLVSGVIALT VLMFDRALCQ SDWFYQGTLW DTAPIQSSAE AKQTAWRFVR VGIRLLMSFG
LAWVIAMFLE LAIFSGTINE KIEADRVAAN QPIYNKIAQY ESGLNAEVAR RQASIDALEA
LHRDALTEAP APEPNLQTRN DDIEQQIKAL VVRESEIRAD IGQIDRTIQR YIADMNAEEF
GMKASPNNSG RPGAGPRYEF AKKQKEAFLV QRAAREAEIA QLHVKRDELR AAQSKIGAEA
LVARDQERAA IKAKADALQT RIDAARAELK SFETAKVASV AEFRSKALAE SYYQDKRELV
DPLTRIAAYQ ELKNDPKDGG TMTLFSWMTR FFIIFLEIVP VVAKIFFSPP SVYAAKIQAQ
VERARQRIEN NEDLDDDKPA EPAPVLAMAA MPLPAMSLDP VVVKPAPQPK AEPPKQPARQ
PERQYYEEDQ HELPRRPARD RDDRLPDYRP REYAERDADP RGYGRRHDDA RRYGARDDYA
GYDDFDRRSF ERRDLHPRYD DERDYDRPRT APVRRRTPQF AAQDYAAHDY AARNFGASSY
GEPEFDTSEI DVRNLIVPRR GAPRYDTADQ QPGSRIRFTV KDDAPATPDH AARPAANAER
NLRWPDGLPV DDLNTRELLR QVLADAETGP RAEPERKQMA AIADTPPPVS EDAPIPADAT
AQPVVADAPA QPVVEAEAAA EPDVIPTFLG KKNHGSSGEN SPAEAAPPAP HATEEVAAPP
SEIVGPVQKY SADVEKLMTE AMELSRARKK AAHPRRDHHH EGDAELPLDN KVQYEIFPKQ