Gene RPD_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1973 
Symbol 
ID4022455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2211750 
End bp2213102 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content56% 
IMG OID637962166 
ProductATPase 
Protein accessionYP_569109 
Protein GI91976450 
COG category[V] Defense mechanisms 
COG ID[COG1401] GTPase subunit of restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0978731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.372719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGTG ATAATAATAA GGGTGTCGAG ACCGATTGGA CTCAGGGAAT CCGAGCTCTT 
GCGCGGTGTA GCAAAAAGAC GGAGCTCGCG AAAAAGAAGT TCGAAATCGA GCTTGATGAT
GTCTTCATCC TTCCTAGGTC AATCGAAAAG AGCGAGTTGC TGACCTTTTC GCCGGCGACT
TACGCACGGG ACCTTCAGGA AGCGGCAATT GTTGGCCTTA ACAATTATGC ATCGCAAGTC
GTTCAGCTTC TGTCCGACAA GGAGTTCGCA ACTCTTGGCG CCGTCATCGC CGAACTCTTG
CCCGACGTGC AAGAGGATAT ATTAAAGCGT ATTCCGGGTG CTGATTCGGT CGAACTTCTT
GTTGCGGTGG CCAAGCCCGC AATTTCACAC AACCCAGCAC CGCTTCTTGT TTCCCAGATC
GATGACGAAG ATCCTATCCT CGCGCAAGTG CGCCAGTTAG TCGAAATTGA CGGCTGGGGT
GGTGTGCTTC TCACGGGTGC ACCCGGCACA GGTAAGTCGT TTTATGCACG CGAGATCGCG
ATCAAACTGA CGGGTGGTGA CCGTCGGCGG ATCCGAGAAG TGCAATTCCA CCCCTCTTAC
CAGTACGAAG ACTTCGTCGA AGGTTACGTC CCTGACGGCA AGCAGGGCTT CCGCCTGGCC
GATAAGCACA TGCTGGAGAT GGCAGAGATT GCCAAAAGCG AGACCGCCCC CGTGGTGCTG
GTGATCGACG AATTCAGCCG CACCGATCCG GCGCGCGTGC TTGGTGAAGT AATGACGTAT
ATGGAGGGTT CGCTCCGTGA CAAGGATTTC TACCTGCCAT CTGGACGCCG CGTGCGTATT
CCACGGAATC TGATCTTCAT TGCGACGATG AATCCGGAGG ATCGCTCGGT TGACGAGATT
GACGCGGCCA TGGAGCGGCG CTGGGCTAAG GTTGGTCTGA AACCTGACGT CAGGAAGCTG
CGCGACTTTC TTGTTAATAA TAGAGCGGAT GAGCTCATGA TGGGACCAAC CGTCGACCTT
TTTCTAGGCC TTCAGAAGCA CATGGAAATC GGACATGCCT TCTTCCGTAC GGTCAAGGAT
CCGGCGGGAC TCTCACGGCT TTGGAACAAC CAGCTTTCCT ACGTCTTTAG AAAGCGATTC
CGTTTCGACA CTGAAACACT TGGTGCCGTC GATGCAATAT GGGGTACGTG CGAGGCCGCT
CTCATCGCAG CGACTCCTGC GGCGGATTTG GCGGCTCCGC AGGTTGGAAC GGTCCCTGGA
GCCGGCGCGG CAGCACCCGT CGAGTCATCC GCACCTTCGC CGGCGCAAGA ACCAATAGCT
TCAACTGAAC AGACGCCTGG AGCGGGCGCT TGA
 
Protein sequence
MGSDNNKGVE TDWTQGIRAL ARCSKKTELA KKKFEIELDD VFILPRSIEK SELLTFSPAT 
YARDLQEAAI VGLNNYASQV VQLLSDKEFA TLGAVIAELL PDVQEDILKR IPGADSVELL
VAVAKPAISH NPAPLLVSQI DDEDPILAQV RQLVEIDGWG GVLLTGAPGT GKSFYAREIA
IKLTGGDRRR IREVQFHPSY QYEDFVEGYV PDGKQGFRLA DKHMLEMAEI AKSETAPVVL
VIDEFSRTDP ARVLGEVMTY MEGSLRDKDF YLPSGRRVRI PRNLIFIATM NPEDRSVDEI
DAAMERRWAK VGLKPDVRKL RDFLVNNRAD ELMMGPTVDL FLGLQKHMEI GHAFFRTVKD
PAGLSRLWNN QLSYVFRKRF RFDTETLGAV DAIWGTCEAA LIAATPAADL AAPQVGTVPG
AGAAAPVESS APSPAQEPIA STEQTPGAGA