Gene RPB_1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1958 
Symbol 
ID3908037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2224451 
End bp2225467 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content70% 
IMG OID637883852 
Productglycine oxidase ThiO 
Protein accessionYP_485577 
Protein GI86749081 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0655991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCGGCA CGCCGCCGGA CCTGCGCAAG GACGCGCCGG TCTCGGTGAT CGGCGCCGGC 
ATCGCCGGGG CCTGGCAGGC GCTGCTGCTG GCGCGCGCCG GCCGCAACGT CACGCTGTAC
GAGCGCGGCG ACCGCGAAAT GACCCAGGCC ACCAGCCATT GGGCCGGCGG CATGCTGGCG
CCGTATTGCG AGGCCGAGAC CGCCGAACCG ATGGTCGGGC TGATGGGCCT GCGCTCGCTG
GAGATGTGGC GCAAGGAATT CCCCGAGACC GCGTTCAACG GCTCGCTGGT GGTGGCGCAT
GCGCGCGACC GCGCCGATTT CGAGCGCTTC GCCAAGATGA CCGCGGGCCA CAAGCGGCTC
GACGCCGACG GCGTCGCCGA GCTGGAACCG GCGCTGGCCG GCCGCTTCCG CGAGGGCCTG
TACTTCCCCG ACGAGGGCCA TGTCGAGCCG CGGCTGGTGC TGGCGCGGCT GCACGAGCGG
CTGGTCGAGG CCGGCGGCGC GATCCATTTC GAATCGGAGA TGACGCCGGA GGAACTCGAC
GGCCTGGTGA TCGATTGCCG CGGCCTCGCC GCGCGCGACA AGGCGCCGGA ACTGCGCGGC
GTCAAAGGCG AGATGGTGGT GATCAAGACC ACCGAGGTGA CGCTGTCGCG GCCGGTCCGC
TTGATGCATC CGCGCTGGCC GCTCTACGTC ATCCCGCGCG AAGACAATCA CTTCATGCTG
GGCGCCACCT CGATCGAGAG CGAGGACGAA CTCGTCACCG TGCGCTCGGC GCTGGAACTG
CTCAGCGCCG CCTATGCGGT GCATCCGGCG TTCGGCGAGG CGCATATCGT CGAGATCGGC
GCCGGCCTCC GCCCGGCCTT CCCCGACAAT CTGCCGCGCA TTTCCATCGG CAACCGCCGC
ATCGCCACCA ACGGCCTGTA CCGCCACGGC TTCCTGCTGG CGCCGGCGCT GGCCGAGAAG
ATGCTGGCCT ATGTCGAGCG CGGCGTCGTC GACAATCAGG TGATGCGATG CTTGTGA
 
Protein sequence
MRGTPPDLRK DAPVSVIGAG IAGAWQALLL ARAGRNVTLY ERGDREMTQA TSHWAGGMLA 
PYCEAETAEP MVGLMGLRSL EMWRKEFPET AFNGSLVVAH ARDRADFERF AKMTAGHKRL
DADGVAELEP ALAGRFREGL YFPDEGHVEP RLVLARLHER LVEAGGAIHF ESEMTPEELD
GLVIDCRGLA ARDKAPELRG VKGEMVVIKT TEVTLSRPVR LMHPRWPLYV IPREDNHFML
GATSIESEDE LVTVRSALEL LSAAYAVHPA FGEAHIVEIG AGLRPAFPDN LPRISIGNRR
IATNGLYRHG FLLAPALAEK MLAYVERGVV DNQVMRCL