Gene RPD_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1406 
Symbol 
ID4021883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1570732 
End bp1572501 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content67% 
IMG OID637961598 
Producthypothetical protein 
Protein accessionYP_568544 
Protein GI91975885 
COG category[S] Function unknown 
COG ID[COG3108] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGACG ATCGACTTGG GGAAACGGGC GTCGTTTTCG GTTCGGTAAC GCTCCGTCGG 
CGATGTAACG ACCCTGTCCG GCATTTGGTC TGGCAGACTT GGTCTGACAG ATCTGGGGTT
TGCCGACAGG CTCCCGTTCG GTCGTCCTTC TGGTGGGGCT TCTTCGTGCT AGCTGGACTC
ACGCGCCGTC TAAAATCATT GTCATTTCCC AGAGCCGGCT ATGGCGCCGT GCTGAGTTCG
GCCGTGTTGC TGGCCGGCGC GGGATCCGTG CACGACGCCT CGGCGGTCGG GGACAGCCGC
ACCCTCTCCT TCCACCACAC CCATTCCGGC GAAGACCTCA CCGTCACCTT CAAGCGCAAC
GGCCGCTACG ACGAAGAGGC GCTCGGCAAG CTCAATCACT TCCTGCGGGA CTGGCGCAGC
CAGGACAAAA CCGTGATGGA CCGGACGCTG TTCGACATCC TCTGGGAAGT CTATCGCGAC
GTCGACGGCA AGCAGCCGAT CCAGATCATC TCCGCCTACC GCTCCCCCGC CACCAACGCG
ATGCTCCGCC GCCGTTCCTC CGGTGTGGCC CGCCATAGCC AGCATACGCT GGGCCACGCG
ATGGATTTCC ATATTCCGGG CGTGGCGCTC GAGCAGATCC GCTTCGCCGG GCTTCGCCTG
CAGCGTGGCG GCGTCGGCTT CTACCCGACC TCCGGCTCGC CCTTCGTCCA TCTCGACACC
GGCCGGATCC GCCACTGGCC GCGCATGAGC TCCGACCAGC TCGCCAGGGT GTTCCCGGAT
GGCCGCACCG TTCACCTGCC GACCGACGGC AGGCCGCTGC GCGGCTACGA ACTGGCGCTA
GCCGACATTC AGAAGCGCGG CGACGGCAGC AACGCCTCGT CGTCCAAATC CAATTTCCTG
ACCGCGCTGT TCCGCGGCAA ATCGGCCGAC GACGAAGACG AGGGCGCGAG CGCCGCCAGC
GCAGCCGCCA GCGCCAAACC GGCCGGCAAG CTTCCGGACA TCAAGGTCGC GGACGTCAAG
GCCGCCGAGG TCAAGCTCGT CAACACCGAG CCGGTGCCGA TGCCGCGCGC TAAGCCGGCC
GTGCCGATTC AGGTCGCGTC CGCCGACAAC ATCGTCCCGC TGCCGTCCGC GAAGCCCGCC
AGACAGGCAG CGAAGTCCGA ATCGAAGCCG CAGACTCCGA CGGATATCAT CAACGCGCGC
GGTTTCTGGG ATGACATTCC CGCAGCGCCG AAGCAGGCCA GCCCGGCGCA AGTCGCCGCG
ATCAGCGCCC GTCAGGCGCT GGAGGCGGCG GACCCGCAAT CGCCGATGAA CGCGATGGCA
TTTGCCGCCA ACTCGAGCGA GAAGGCCGCG AAGCCGCAGA CCCATCCGCA AGTGGTGACA
GCGAGCGCGC CGATCCCGTC CGGTAGCCGA TCCGCATCGC TGGCGCGCCA TCCGGTGACG
ACCAGCAGGA TCGACACCGT GGTCGGCAAG ACCGCACAGG GCAAAGTCGG CGTGGTCGCA
AATTCAGCGC GGATCACCGC CGCCGGCAGC CGCGACAGCG ACGTCTGGAT GCGCGCGATG
ATCCTGATGC CGAGCGCGGT CACGACCGGC GCCACGGCGA TCGGCGATGC CGACATGACG
CAGCTGACCA AGCACTTCGC CAAGCCGCAG ACCACGGTGA CGATGGCGTT CAGCGACGAT
CCGCAGCCCG GCCTCTACGC CGACACGTTC ACCGGCCCGG CCGTGGCGAC GGTGTCGACC
ACGACCTTCA CCACCGCCGC GCTGCGCTGA
 
Protein sequence
MVDDRLGETG VVFGSVTLRR RCNDPVRHLV WQTWSDRSGV CRQAPVRSSF WWGFFVLAGL 
TRRLKSLSFP RAGYGAVLSS AVLLAGAGSV HDASAVGDSR TLSFHHTHSG EDLTVTFKRN
GRYDEEALGK LNHFLRDWRS QDKTVMDRTL FDILWEVYRD VDGKQPIQII SAYRSPATNA
MLRRRSSGVA RHSQHTLGHA MDFHIPGVAL EQIRFAGLRL QRGGVGFYPT SGSPFVHLDT
GRIRHWPRMS SDQLARVFPD GRTVHLPTDG RPLRGYELAL ADIQKRGDGS NASSSKSNFL
TALFRGKSAD DEDEGASAAS AAASAKPAGK LPDIKVADVK AAEVKLVNTE PVPMPRAKPA
VPIQVASADN IVPLPSAKPA RQAAKSESKP QTPTDIINAR GFWDDIPAAP KQASPAQVAA
ISARQALEAA DPQSPMNAMA FAANSSEKAA KPQTHPQVVT ASAPIPSGSR SASLARHPVT
TSRIDTVVGK TAQGKVGVVA NSARITAAGS RDSDVWMRAM ILMPSAVTTG ATAIGDADMT
QLTKHFAKPQ TTVTMAFSDD PQPGLYADTF TGPAVATVST TTFTTAALR