Gene RPD_0015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0015 
Symbol 
ID4020469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp19336 
End bp22488 
Gene Length3153 bp 
Protein Length1050 aa 
Translation table11 
GC content62% 
IMG OID637960191 
Producthypothetical protein 
Protein accessionYP_567156 
Protein GI91974497 
COG category[R] General function prediction only 
COG ID[COG1483] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.195292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAGA CAATCAAGGA TGCCTGCCAA TTCGATCCAA AGGCGATCGA CTATGCGCTG 
AGCGACCAGA TCGAGAATCT CGACGACCTG GTCGGTCACG ACCCCGTCGC CGCCGAGGCA
TTTTTCAGGA AGACCTACGT CACCGGCGGT ATGAAAACGT TGCTCCGCCA AGGTCTGCAG
CGCCTCGCGG GCAGTTCCGG ACAGGCGGTC TTCGAACTTA AGCAAGCGAT GGGCGGCGGC
AAAACCCACT CGATGTTGGC GTTGGGCTAT CTGGCGGCCA ATCCCAAGTT GTTCGAACTC
GTCGCCAAGG ACATCACGCA AGGCATCAAG GCCGAGCCGG CGCAGGTCGT CGCGATTTCC
GGCCGCTCGA TCTCTCGCGA CCAGCATCTA TGGGGTGACA TCGCCGACCA ACTCGGCAAG
GCGGACAAGT TTCTCGAGTT CTACAAGGGC ATGCCGCAGG CCCCTAACGA AAAGGACTGG
ATCGGTTTGA TCGGTGACGC GCCGACGCTG ATCTTGCTCG ACGAGCTTCC GCCCTATTTC
AAGAATGCCA TCACTCAGAA CGTGGGCGGC GGCACCCTGG CCGACGTCAC CACCTATGCC
GTCTCGAACT TGCTGTCGGC GGCACTGAAG TTGCCGCGGC TGTGCATCGT GATCTCGAAC
TTGTCCGGCG CCTATGAGGG CGCCACTAAG AGCATCACCA CGATGGTCGC GAAGGCCACC
CGGGATCTCC AGAACGAAAC CGGTCGGCAG GCCAAGGGCA TCACGCCGGT CGAGCTCGGC
TCCGACGAGA TCTACAACAT CCTGCGCACC CGTCTCCTCA CCAAGGACCC CGACCAGAAG
GTCATCAACG CCGTCTCCAC GGCGTTTTCC GACTCGATCT CCGACGCCGT CAAATCCAAG
ACGATCGCGA AGTCGGCCAC CCAGATCGCC GACGAGATCG CCGCCAGCTA TCCTTTCCAT
CCCTCCTTCA AGCACATCCT GGCACTGTTC AAGGAAAATG AGCGTTTCCG TCAGACGCGC
GGCCTGATGA CGATGGCGGC CCTGATGGTG AAGTCGGCGC TCGGTCGACC GACCAACGAC
GTCTATCTGG TCGGCTGCCA ACACATCGAT TTGGCCCAGC CGGACGTGCG TGACGTCATC
ACCAACGTCT ACGATCTCTC CGGAGCGATC ACCCACGACA TCGCCGGCAC CGCCACCGAG
CGTGCGCACG CCCAGACCAT CGACGACGTG GCCGATACCG ATGCCGCCAG CCAGGTGGCC
AGGCTTGTCC TGATGTCCTC GTTGTCCGAG GCCAACGACG CCGTGAAGGG CCTCACCAAG
TCGCAGGTCG TCGAGAACCT CGTCGCCCCG CAGCGCTCGC CACATGAATT CGATGAGGCA
TTCGAAAAGC TCCGGATCGA GTGCTGGTAT CTGCACCGCA AGGAGAACGA CGCCTGGTAC
TTCTCGAAGA ACGAGAACCT CAAGAAGAAG ATCGAAAAAT ACGCCTCGAC CGCCGCGCAA
CCGAAGATCG ACGCTGAGAT GGAGCGTCGA CTGCAGATGG TCTTCGAGCC GAAGCGCCGC
AATGCCTATT CGTCGGTGCA GGCCCTGCCG AAGATCGAGG ACATCAAGCC GTCCGGGGAC
CGCGTGCTGC TGGTGTTGAG CCCCGACAAA CGCGTTCCTC CCGAGGATGC GCAGCGGCTA
TTCGATGCGA TCGTCGAGAA GAACAACTTC TGCGTGGTGA CCGGCGACGG CACCGACCTG
GCGAAGCTCG AGGACAAGGT CCGTCGGATC TGGGCGACGG CCAAGGTTCT CCAAGAGGAC
GGCGGCGATC GCTCGCCGAA CCTGGCCGAG CTTCAGGAGG AGACCGAGAC CGCCGAGTTC
GAATTCAACT CGACCCTGAT CGCCCTGTTC AATCGCGTGT ACTATCCGTC CAGACTTCCG
AAGCCGCTGC CCAACGGTGC GAATGAGGGC CTGACCTATG CGGCCCTGAA GCTCGTGGAA
CGTCGCACCA GGGAGAACGG CCCCACCACG ATCGACGGAG AAGCGGCCGT CGAGGAAGCG
TTGAGCGCCA CCGGCGCCTC GAAGCTCATT CTGACGCCCG CGGACGAAGG TAACGCGGCC
AGCCTGCGAA ATCGCGCCCA GGATGTGTTG TGGGGCACGA CGGAGCGAAA GACTCGCTGG
AAGGACGTCG AGGAGCGCGC GATCAGCAAC GTCCGCTGGC CGTGGCTTCC GCCCAAAAGC
CTCGATGAGA TCAAGCGCAC CGCCGTCAGC ATCGGCGAAT GGCGCGACTC CGGCGACGGC
TACATCGAGA AGGGACCTTT CCCTCAGGCG AAGACCTCCG TGAAGGTCGT CACCCGCGCC
TACAACGATG ATACCGGCAT GGCGACGATC GAGCCGACGC CGGTCGACGC CGGACCGAAC
GCCCGCGTCC ACTTTGCGCC AACCAAGGAC GTGTCGGGGT CGTCGCCGGT CGTTCCGGAC
ACTATCTTCG ACCGGGATGA CACCGTGCTC TGGTTCTTGG CGGTCGATCC GGACGGCAAG
CACGAAACCG GCGACCCGGT GAAGTGGAGC AACACTCTTA CGCTCACCCA TCAACCGAAG
GAGGTCATGG GCAAGCGTAC CGTCGAGTTG ACGGTGAAGC CGCGTGGAAC GATCCGATGG
AACATCGACG GTACGAACGC GCGAGAGGGC AAGGCTTATT CGGGTCCGAT CGCCCTGAAC
GGTGACGACG AAGTGAAGAT CTACGCCTAC GCCGAGGACG CCGGCGTAGA GGTGACCAAG
GTTTTCCCAA TCCGAGCCGC AGCGGAAGAC GAGTTCAAGA TCGACCCGGA ATTGCCCGTC
GCTATCAAGA AGCGTCAGAA GATCGTCTCC ACCAAGGACG TGTTTGCGAC GCTCAAAGCT
CTCAAGGAAG CGCGCGGCCT CCTCAAGGGA AGCCTTTCCG CCACGGTTGG TCAGGGCGAC
GTGAACGCGA CCACTCGGTT CGGACCGCAG ACCAATCTGA ATTCGGCCGC GATCGAGACC
TTCCTTGGCG CTGCGCGCCT CTCGATCGCC CAGGAGGCGG CTGAGGTCGA AATCGGTTTC
TCGGAAGTTC ACTTCGAGAC CGGCCGCGAG ATGGAAGAGT TCGTCAAAGC AGTTGGCTGG
ATCGTTTCAC CAAACGAGGT TGAACAGCAG TGA
 
Protein sequence
MLQTIKDACQ FDPKAIDYAL SDQIENLDDL VGHDPVAAEA FFRKTYVTGG MKTLLRQGLQ 
RLAGSSGQAV FELKQAMGGG KTHSMLALGY LAANPKLFEL VAKDITQGIK AEPAQVVAIS
GRSISRDQHL WGDIADQLGK ADKFLEFYKG MPQAPNEKDW IGLIGDAPTL ILLDELPPYF
KNAITQNVGG GTLADVTTYA VSNLLSAALK LPRLCIVISN LSGAYEGATK SITTMVAKAT
RDLQNETGRQ AKGITPVELG SDEIYNILRT RLLTKDPDQK VINAVSTAFS DSISDAVKSK
TIAKSATQIA DEIAASYPFH PSFKHILALF KENERFRQTR GLMTMAALMV KSALGRPTND
VYLVGCQHID LAQPDVRDVI TNVYDLSGAI THDIAGTATE RAHAQTIDDV ADTDAASQVA
RLVLMSSLSE ANDAVKGLTK SQVVENLVAP QRSPHEFDEA FEKLRIECWY LHRKENDAWY
FSKNENLKKK IEKYASTAAQ PKIDAEMERR LQMVFEPKRR NAYSSVQALP KIEDIKPSGD
RVLLVLSPDK RVPPEDAQRL FDAIVEKNNF CVVTGDGTDL AKLEDKVRRI WATAKVLQED
GGDRSPNLAE LQEETETAEF EFNSTLIALF NRVYYPSRLP KPLPNGANEG LTYAALKLVE
RRTRENGPTT IDGEAAVEEA LSATGASKLI LTPADEGNAA SLRNRAQDVL WGTTERKTRW
KDVEERAISN VRWPWLPPKS LDEIKRTAVS IGEWRDSGDG YIEKGPFPQA KTSVKVVTRA
YNDDTGMATI EPTPVDAGPN ARVHFAPTKD VSGSSPVVPD TIFDRDDTVL WFLAVDPDGK
HETGDPVKWS NTLTLTHQPK EVMGKRTVEL TVKPRGTIRW NIDGTNAREG KAYSGPIALN
GDDEVKIYAY AEDAGVEVTK VFPIRAAAED EFKIDPELPV AIKKRQKIVS TKDVFATLKA
LKEARGLLKG SLSATVGQGD VNATTRFGPQ TNLNSAAIET FLGAARLSIA QEAAEVEIGF
SEVHFETGRE MEEFVKAVGW IVSPNEVEQQ