Gene RPD_3656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3656 
Symbol 
ID4024170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4079834 
End bp4081918 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content63% 
IMG OID637963860 
ProductAAA ATPase, central region 
Protein accessionYP_570780 
Protein GI91978121 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA AGAACCACAA GCGGAAGTCG GCCGATCGTC CGAAAAATCG GATGCCGGCG 
CAATGGGCCG CCAAGGGCGT CCCCGTGACC ATGAAGGTCG ATGAAGCGAC CCGCACGATC
ACGCAAGCCA AGAGCGAGGA CAGGTTCGAC CTTCCCGACG AAATGGATGA ATCCGCACCG
AATGCGGAAA AGCCGAAAGC TCCCCAGGCC CGGCCACGCA CCAGTGCGAT GACCGCCATC
GTCGGCTCGC TGTTCGATGC CTCGATCCCG CGGGATGTTC GCCGCCGGCT GAACCATGCC
AAGGCCCTCG CGGTCGTAGT CCTGGTCCCG GGACCCGCGT GGGTCGTGCC GGCCGAGATC
TACTTTCGGG ATGTCTTTGG GACCCGGTGG ATCCGGCATG CACGCGATGG CGCCGCAAAG
AGCTCGCTGA AGTCGTCGGG TGACTCCGAC GAGGCAGCAA GGGATCTTGC CCGGGGACGC
TGCGTGGTGG GCTTCGCTGC CGACGCCAGC CATCTTCCTT CGACGCTCGT CACTGCCGCC
GACATCACGA TCCGCCTTTC CGCTCCGACA GGACCGGTTC TCCGTACGGC GATCACGCGC
TTTGCGAGAC GTTCGCCCGG TGATCTCGCC GAAGGCCATG CGGTCGGCCT CGATCTCAGC
GACATCGTGG CCGCATTCCG CCCGGGCTCC GGAGCGGAGC GCATCGCGCA GCGGCTGGCG
AACTCTGTTT CGATCGTCAA CGGAACCGGC GGTCCCGGGG AATCGATCCC CGATCTGTCC
TCAGCCGTGG AATATGGAGC GGCACAGAAA TTTGGTCTGG GTCTCGCTCG AGACGTCAGA
GATTATCGTT TAGGAATTAT TTCCTGGTCC GACCTGCCTC GCGGAATAGT GCTGCACGGC
GAGGTCGGCA CCGGCAAGTC GCTTTACGCA AGAATGCTGG CGAGCGCGTG CGCACTACCT
CTGGTTTCAA CATCTGTTGC CGACTGGTTT ACCGGACCAC GTGGAGGGTA CCTCCACGAC
GTCATAGCAC AGATGCGCGG TGCCTTCGAT CGGGCAACTT CGCTCGCCGG CGCTAAGGGC
TGCTGCCTGC TCTTTATGGA TGAATTAGAT AGCATTCCCA ACAGGTTGAC CCTTGATAGC
AGAAATTCCG ACTACTGGAA ACCGCTTTGC AACGATCTGT TGACCCGCCT GGATAATTCC
ATGTCCGATA CCAGACGGGG AATCATAGTC GTATCGGCTA CAAACAACCT AAGCGCGATC
GATCCTGCTT TGATGCGCCC CGGCCGACTC GAATTGGCGA TCGAGATTGA AAGGCCCGAT
TTCGCCGGCA CTCTGAACAT CCTCCGCCAC CACGCGCCCG ACCTTCCGCA CGCGGACCTC
TCCGTAATCG CGTCCCTCGC CGAACGTTCG ACGGGCGCCG AGATCATGTA CCTGGTGCGC
GAGGCGAGGC GCCGGGCGCG TCATGCCGGC CGTGCGCTGA CGCTCGAAGA TCTGAAATCC
ACGCTGATGC CTAAGATAGA GGTAGGCGCG GACGGATTCT GGCGGATTTG TCTGCATGAG
GCGGGGCATG CCACCGCGGC GCTCTGCCTC CAGTCCGGAA CGGTCCGTCG CATCGTGGTT
GGATATCGCG TCGGCTCCGG TGGCCACACG CTGATCGAGC CGGGCAAAGA CGATCTGCTC
ACTCGCGAAC GCCTCGAGGA TCGGGCGATC ACTCTTCTTG CAGGTCGTTC CGCGGAGCGT
GTCGTCCTCG GGAGCGAATC CGCCGGCGCG ACCGGGGATC TGGAAAGCGT GACCCAGATC
GTAGCATCCA TGCATGCGAG CGCCGGCATG GGCGACACGA TCGCCTTTCT CGCGCCGGCG
GCTTCGGCGC TCGACGCGCT CCGCATGGAT CCCGGGCTGC AGGCCCGCGT CGAATACGAT
CTCCAGCTTC TGCAGCGACG CGCGGACGAG ATCATCCGTC GTCAGCGCGC AGCCCTGATC
GCAATCGCGG TCGCGCTGCG CGACCACCGT CACCTCTCGG GGGAGGCCGC CCGCGAGATA
TTCCTCAGGC ATGCCCTCCC GGCCGTCGCA ACCCCGTCCA AGTAG
 
Protein sequence
MKKKNHKRKS ADRPKNRMPA QWAAKGVPVT MKVDEATRTI TQAKSEDRFD LPDEMDESAP 
NAEKPKAPQA RPRTSAMTAI VGSLFDASIP RDVRRRLNHA KALAVVVLVP GPAWVVPAEI
YFRDVFGTRW IRHARDGAAK SSLKSSGDSD EAARDLARGR CVVGFAADAS HLPSTLVTAA
DITIRLSAPT GPVLRTAITR FARRSPGDLA EGHAVGLDLS DIVAAFRPGS GAERIAQRLA
NSVSIVNGTG GPGESIPDLS SAVEYGAAQK FGLGLARDVR DYRLGIISWS DLPRGIVLHG
EVGTGKSLYA RMLASACALP LVSTSVADWF TGPRGGYLHD VIAQMRGAFD RATSLAGAKG
CCLLFMDELD SIPNRLTLDS RNSDYWKPLC NDLLTRLDNS MSDTRRGIIV VSATNNLSAI
DPALMRPGRL ELAIEIERPD FAGTLNILRH HAPDLPHADL SVIASLAERS TGAEIMYLVR
EARRRARHAG RALTLEDLKS TLMPKIEVGA DGFWRICLHE AGHATAALCL QSGTVRRIVV
GYRVGSGGHT LIEPGKDDLL TRERLEDRAI TLLAGRSAER VVLGSESAGA TGDLESVTQI
VASMHASAGM GDTIAFLAPA ASALDALRMD PGLQARVEYD LQLLQRRADE IIRRQRAALI
AIAVALRDHR HLSGEAAREI FLRHALPAVA TPSK