Gene RPD_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3594 
Symbol 
ID4024108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4006270 
End bp4007895 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content69% 
IMG OID637963798 
Producthypothetical protein 
Protein accessionYP_570718 
Protein GI91978059 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.234761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.928094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA TGACCGGCGG CGAAGCGATC GTACAAGGCC TCGTCGCGCA CGGCGTCGAC 
ACCGTGTTCG GCCTGCCCGG CGCGCAGATC TACGGCCTGT TCGACGGCTT CGCCAAGGCG
CAGTTGCGGG TGATCGGGGC GCGGCACGAG CAGGCCTGCG GCTATATGGC GTTCGGCTAT
GCGCGCGCCT CGGGCCGCCC CGGCGTGTTC AGCGTCGTGC CCGGCCCCGG CGTGCTCAAT
GCGGGCGCTG CGATGCTCAC CGCGTTCGGC TGCAACGAGC CGGTGCTGTG TCTCACCGGG
CAGGTGCCGA GCGCTTATCT CGGCCGCGGC CGCGGCCATC TGCACGAGAT GCCGGACCAG
CTCGCGACGC TGCGCAGCTT CATCAAATGG GCGGAGCGGA TCGAATATCC CGGCAGCGCG
CCGGCGCTGG TGGCGCGCGC GTTTCAGGAA ATGATGTCCG GCCGGCGCGG CCCGGTGGCG
CTGGAAATGC CCTGGGACGT GTTCACGCAA CGCGCCGAGA CCGCGGCCGC GATCAAGCTC
GATCCGGTCG CGCCGCCGCT GCCCGATCCC GACCGGATCG ACGCGGCGGC CAGGCTGATC
GCCGCGAGCA GGACGCCGAT GATCTTCGTC GGCTCCGGCG CGCTCGACGC CGGCGACGAG
ATTCTCGAAC TCGCCGAGGC GATCGACGCG CCGGTCGTGG CGTTCCGCTC CGGCCGCGGC
ATCGTCAGCA ACGCGCATGA GCTGGGCCTC ACCTTCGCCG CCGCCTATCA GCTCTGGCCG
CAGACCGATC TGATCATCGG CATCGGCACG CGGATGGAGT TGCCGACCAC GTTCCGCTGG
CCGTTCCGCC CGGCGGGGCA GACCTCGGTG CGGATCGACA TCGATCCCGC CGAGATGCGC
CGTTTCTCGC CGGACGCAGC CGTCGTCGCC GATGCGAAAG CCGGCGCGCG CGCGCTGGTC
GACGCGGTGA GCAAGCGCGG CTACAGCAAG ACCCAGGGCC GCCGCGACAC CATCCGCGAC
GCGACTGCGC GCACGCTCGA ACAGATCCAG TCGGTGCAGC CACAGATGGC GTATCTGAAG
ATCCTGCGTG AGGTGCTGCC GGACGACGCC ATCGTCACCG ACGAGCTCTC GCAGGTCGGC
TTCGCCTCCT GGTACGGCTT CCCGGTGTAT CAGCCGCGCA CCTTCCTCAC CTCGGGCTAT
CAGGGCACGC TCGGCTCCGG CTTCCCGACC GCGCTGGGCG CCAAGGTCGC CTTCCCTGAC
AGGCCCGTGG TGGCGATCAC CGGCGACGGC GGTTTCATGT TCGCGGTACA GGAGCTCGCC
ACCGCGGTGC AGTTCAACAT CGGCGTGGTC ACGCTGGTGT TCGACAATTC GGCCTACGGC
AACGTCCGGC GCGATCAGGT CACCCAGTTC GAGGGCCGCG TCGTGGCGTC CGATCTGGTC
AACCCGGATT TCGTCAAGCT CGCGGAATCG TTCGGCGTCG GCGCCGCGCG CGTCACCGCG
CCGGATCACT TTCGGCCCGC GCTGGAAAAA GCGCTGGCGC ATGGCGGTCC GTATTTGATC
GCGATCGACG TAGCGCGCGA CAGCGAGGCC AGCCCGTGGC CGTTCATCCA TCCGGCGAAA
CCGTAG
 
Protein sequence
MSIMTGGEAI VQGLVAHGVD TVFGLPGAQI YGLFDGFAKA QLRVIGARHE QACGYMAFGY 
ARASGRPGVF SVVPGPGVLN AGAAMLTAFG CNEPVLCLTG QVPSAYLGRG RGHLHEMPDQ
LATLRSFIKW AERIEYPGSA PALVARAFQE MMSGRRGPVA LEMPWDVFTQ RAETAAAIKL
DPVAPPLPDP DRIDAAARLI AASRTPMIFV GSGALDAGDE ILELAEAIDA PVVAFRSGRG
IVSNAHELGL TFAAAYQLWP QTDLIIGIGT RMELPTTFRW PFRPAGQTSV RIDIDPAEMR
RFSPDAAVVA DAKAGARALV DAVSKRGYSK TQGRRDTIRD ATARTLEQIQ SVQPQMAYLK
ILREVLPDDA IVTDELSQVG FASWYGFPVY QPRTFLTSGY QGTLGSGFPT ALGAKVAFPD
RPVVAITGDG GFMFAVQELA TAVQFNIGVV TLVFDNSAYG NVRRDQVTQF EGRVVASDLV
NPDFVKLAES FGVGAARVTA PDHFRPALEK ALAHGGPYLI AIDVARDSEA SPWPFIHPAK
P