Gene RPD_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1224 
Symbol 
ID4021700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1384273 
End bp1387356 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content66% 
IMG OID637961416 
Productpyruvate phosphate dikinase 
Protein accessionYP_568363 
Protein GI91975704 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID[TIGR01828] pyruvate, phosphate dikinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.516679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAG CCGCTTCCAA GACCAAGAAG ACCCTCAAAT CTTCAGCCTT AAAGTCTTCG 
GCATCCAAGT CTTCGGCATC TAAGTCTTCG GCACCAAAAT CTTCGGCGTC CAAGTCTCAG
GCGTCCAAGT CTTCGGCGTC CAAGTCTTCG GCGCCGAAGT CCTCGGCGCC TGCGCGTGGC
AAGACCGCCG CGCCGAGCGC CCGCAAGGCG CTGACGAAGA CTCCGCCGAA GCAGGCTCCG
AAGGCCGCCA AGAAGGCTGT TAGCAAGCCG GTCGCCAAGT CGGTGGCAAA GCCGTCCGCG
AAGAGCGTCG CCAAGAGCAA GCCGGTGGCG AAGGCCGCCA GCAGCAAGAC CGCGGCGAAG
GCCGCCAGCA GCAAGACCGC GGCGAAGGCA GTCAGCGCGC CGAAGGCCGG CAAGTGGGTC
TACACCTTCG GCGACGGCAA GGCCGATGGC AAGGCGAGTA TGCGCGACCT GCTCGGCGGC
AAGGGCGCCA ACCTCGCCGA GATGGCCAAT CTCGGCCTGC CGGTGCCCCC GGGCTTCACC
ATTCCGACCT CGGTCTGCAC CTACTTCTAC GCCAACGACA AAACCTATCC GAAGGAGCTG
AAGGCGCAGG TCGAGCGTTC GCTGGAATAT GTCGGCAAGC TGACCGGCAA GCGCTTCGGC
GATCCGAAGA ATCCGTTGCT GGTTTCGGTC CGATCCGGCG GCCGTGCGTC GATGCCGGGC
ATGATGGACA CCGTGCTCAA CCTCGGGCTC AACGACGAGA CCGTCGAGGC CGTGGCGCTG
ATGTCCGGCG ATCGCCGCTT CGCCTATGAC AGCTATCGCC GCTTCATCAC GATGTATTCC
GACGTGGTGC TCGGGCTCGA GCATCATCAT TTCGAGGAAA TCCTCGATAG CTACAAGGAC
AGCAAGGGCT ACACGCTCGA CACCGACCTC ACTGGCGACG ACTGGGTCCT GCTGGTCGGC
CAGTACAAGG ACGCCGTCGC GCGCGAGATC GGTCAGGACT TCCCGCAGGA TCCGAACGAT
CAGCTTTGGG GTGCGATCGG CGCGGTGTTC TCGTCCTGGA TGAATGCGCG CGCGGTGACC
TATCGCCGTC TGCACGACAT TCCGGAATCC TGGGGCACCG CCGTCAACGT GCAGGCGATG
GTGTTCGGCA ACATGGGCGA GACCTCGGCC ACCGGCGTCG CCTTCACCCG CAATCCGTCG
ACCGGCGAGA GCCGGCTGTA CGGCGAGTTC CTGATCAATG CGCAAGGCGA GGACGTCGTC
GCCGGCATCC GCACCCCGCA GGACATCACC GAGTACGCGC GCCTCGAATC CGGCTCCGAC
AAGCCGTCGA TGGAAGTCGC GATGCCCGAC GCCTTCCAGG AACTGACGCG GATCTACACC
CTGCTGGAAA AGCACTACCG CGACATGCAG GACATGGAAT TCACCGTCGA GCAGGGCAAG
CTGTGGATGC TGCAGACCCG CGGCGGCAAG CGCACCGCCA AGGCGGCGTT GCGGATCGCG
GTCGAACTCG CGCATGAAGG CCTGATCAGC AAGATCGACG CTGTGGCCCG TATCGAGCCG
AGTTCGCTCG ACCAGCTGCT GCATCCGACG ATCGATCCGC ACGCCAAGCG CGACGTCATC
GCGACCGGCC TGCCGGCGTC GCCGGGCGCC GCTTCGGGCG AGATCGTGTT TTCCTCGGAC
GAAGCCGCCA AGCTCCAGGC CGACGGACGT AAGGTCATTC TGGTCCGGAT CGAGACCAGC
CCCGAAGACA TTCACGGGAT GCACGCCTCT GAAGGCATTC TCACCACCCG CGGCGGCATG
ACCTCGCACG CCGCGGTGGT CGCGCGCGGC ATGGGCAAGC CCTGCGTGTC CGGCAGCGGC
GCGATCCGCG TCGATTATGG CCGCGGCACG ATGACGATCG GCGCGCGCAC CTTCAAGACC
GGCGACATCA TCACCATCGA CGGCTCGATC GGCCAGGTGC TGGCCGGCCG GATGCCGATG
ATCGAGCCCG AATTGTCGGG CGATTTCACC ACGCTGATGG GCTGGGCCGA TCAGGTCCGC
AAGCTCAAGG TCCGCGTCAA CGCCGACACC CCGGTCGATG CGCGCACCGC GATCAAGTTC
GGCGCCGAAG GCATCGGTCT GTGCCGCACC GAGCACATGT TCTTCGAGGA GACCCGCATC
CGCACCGTAC GCGAGATGAT CCTCGCCGAG GACGAGCAGT CGCGCCGCGC GGCGCTGGCC
AAGCTGCTGC CGATGCAGCG CGCGGACTTC GTCGAGCTGT TCGAGATCAT GAAGGGCCTG
CCGGTCACGA TCCGGCTGCT CGATCCGCCG CTGCACGAGT TCCTGCCGCA CAGCCACGCC
GAAGTCGAGG AAGTCGCGCG TGCGATGAAC GCCGATCCGC GACGGCTCGC CGATCGCGCC
CGCGATCTCG CCGAGTTCAA TCCGATGCTC GGCTTCCGCG GTTGCCGTCT GGCGATCGCC
TATCCGGAGA TCGCCGAGAT GCAGGCCCGC GCCATCTTCG AGGCCGCGGT CGAGGCCGAG
AAGCGCACCG GCGAAGCGGT CGGGCTCGAA GTGATGGTGC CGCTGATCGC GACCCGTGCG
GAGTTCGACC TGGTCAAGGC CCGGATCGAC GCCACCGCGC ATGCGGTGTC GCGTGAGACC
GGAGCCCAAC TCAAGTACCA GGTCGGCACC ATGATCGAGC TGCCGCGCGC CTGCCTGATG
GCTGGCGACG TCGCCGAGAC CGCCGAGTTC TTCTCGTTCG GCACCAACGA CCTGACCCAG
ACCACCTACG GCATCAGCCG TGACGACGCC GCGAGCTTCC TCGGCACCTA TATCGAGAAG
GGCATCTTCG CGGTCGATCC GTTCGTGTCG GTGGATCGCG ACGGCGTCGG CGAACTGGTG
AAGATCGGCG TCGAGCGCGG CCGCAAGACC CGGCCGGACC TCAAGGTCGG CATCTGCGGC
GAGCACGGCG GCGATCCCGC CTCGGTCGCC TTCTGCCACG AGGTCGGTCT CAACTACGTG
TCGTGCTCGC CCTATCGCGT GCCGATCGCG CGGCTGGCGG CGGCGCAGGC CGCGCTCGGC
AAGGCGGCTG CCGGCCAGGC CTGA
 
Protein sequence
MAKAASKTKK TLKSSALKSS ASKSSASKSS APKSSASKSQ ASKSSASKSS APKSSAPARG 
KTAAPSARKA LTKTPPKQAP KAAKKAVSKP VAKSVAKPSA KSVAKSKPVA KAASSKTAAK
AASSKTAAKA VSAPKAGKWV YTFGDGKADG KASMRDLLGG KGANLAEMAN LGLPVPPGFT
IPTSVCTYFY ANDKTYPKEL KAQVERSLEY VGKLTGKRFG DPKNPLLVSV RSGGRASMPG
MMDTVLNLGL NDETVEAVAL MSGDRRFAYD SYRRFITMYS DVVLGLEHHH FEEILDSYKD
SKGYTLDTDL TGDDWVLLVG QYKDAVAREI GQDFPQDPND QLWGAIGAVF SSWMNARAVT
YRRLHDIPES WGTAVNVQAM VFGNMGETSA TGVAFTRNPS TGESRLYGEF LINAQGEDVV
AGIRTPQDIT EYARLESGSD KPSMEVAMPD AFQELTRIYT LLEKHYRDMQ DMEFTVEQGK
LWMLQTRGGK RTAKAALRIA VELAHEGLIS KIDAVARIEP SSLDQLLHPT IDPHAKRDVI
ATGLPASPGA ASGEIVFSSD EAAKLQADGR KVILVRIETS PEDIHGMHAS EGILTTRGGM
TSHAAVVARG MGKPCVSGSG AIRVDYGRGT MTIGARTFKT GDIITIDGSI GQVLAGRMPM
IEPELSGDFT TLMGWADQVR KLKVRVNADT PVDARTAIKF GAEGIGLCRT EHMFFEETRI
RTVREMILAE DEQSRRAALA KLLPMQRADF VELFEIMKGL PVTIRLLDPP LHEFLPHSHA
EVEEVARAMN ADPRRLADRA RDLAEFNPML GFRGCRLAIA YPEIAEMQAR AIFEAAVEAE
KRTGEAVGLE VMVPLIATRA EFDLVKARID ATAHAVSRET GAQLKYQVGT MIELPRACLM
AGDVAETAEF FSFGTNDLTQ TTYGISRDDA ASFLGTYIEK GIFAVDPFVS VDRDGVGELV
KIGVERGRKT RPDLKVGICG EHGGDPASVA FCHEVGLNYV SCSPYRVPIA RLAAAQAALG
KAAAGQA