Gene RPC_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2166 
Symbol 
ID3971987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2358072 
End bp2359112 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content66% 
IMG OID637925274 
Productglycine oxidase ThiO 
Protein accessionYP_532039 
Protein GI90423669 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.402403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCAGA CCACGCAGTC TAAACGCGGG CCGTTGCCCG TTGATCTCCG CACAAATGCG 
CCGATTTCGG TGATCGGCGC CGGCATTGCC GGGGCCTGGC AGGCCTTGAT GCTGGCCCGC
GCCGGACGCG ACGTGACGCT CTACGAGAGC GGCGATTCCG AAATGACCCA GGCCACCAGC
CATTGGGCCG GCGGCATGCT GGCGCCGTGG TGCGAGGCCG AATCGGCCGA GCCGGTGATC
AGCCGGATCG GCATGCGCTC GCTCGATATC TGGCGCGAGG AATTCCCCGA GACGCCGTTC
AACGGCTCCT TGGTGGTGTC GCATCCGCGC GACCGCGCCG ACTACGAGCG CTTCGCCAAA
TTGACCACCG GGCATCAGCG GCTCGACGCC AAGGGCGTCG CCGAACTGGA GCCGGCGCTG
GAAGGCCGCT TCCGCGAAGG CCTGTTCTTC GCCGACGAAG GCCATGTCGA GCCGCGCGTG
GTGCTCGCCA AATTGCACGA ACGGCTGATC GAGGCCGGCG GAACTATTCA CTTCATGTCG
GCGCAGAATC CCGACGAGCT CGACGGCGTG GTGATCGATT GCCGCGGACT GTCCGCGCGC
GATGCCGCCC CCGAACTGCG CGGCGTCAAG GGCGAGATGA TCGTTATCGA GTCCAAGGAC
GTGCAATTGT CGCGCCCGGT GCGGCTGATG CATCCGCGCT GGCCGGTCTA TGTGATTCCG
CGCCCCGACA ACGTGTTCAT GGTCGGCGCC ACCACCATCG AGAGCGAGGA CGAGGGCGTC
AGCGTCCGCT CGGCGCTGGA ACTGTTGACC GCGGCCTACG CGCTGCATCC GGCGTTCGGT
GAGGCGCGGA TTCTGGAATT CGGTTCCGGT CTGCGCCCGG CGTTCCCGGA CAATCTGCCG
CGGATCTCGC TCGGCAACGG CCGCATCGCG GTCAATGGCC TGTATCGCCA CGGCTTCCTG
CTGTCGCCGG CGCTCGCCGA GATGACGCTG GCCTATGTGC AGCGCGGGGT CATCAACAAC
GAGGTGATGC AATGCGTGTG A
 
Protein sequence
MYQTTQSKRG PLPVDLRTNA PISVIGAGIA GAWQALMLAR AGRDVTLYES GDSEMTQATS 
HWAGGMLAPW CEAESAEPVI SRIGMRSLDI WREEFPETPF NGSLVVSHPR DRADYERFAK
LTTGHQRLDA KGVAELEPAL EGRFREGLFF ADEGHVEPRV VLAKLHERLI EAGGTIHFMS
AQNPDELDGV VIDCRGLSAR DAAPELRGVK GEMIVIESKD VQLSRPVRLM HPRWPVYVIP
RPDNVFMVGA TTIESEDEGV SVRSALELLT AAYALHPAFG EARILEFGSG LRPAFPDNLP
RISLGNGRIA VNGLYRHGFL LSPALAEMTL AYVQRGVINN EVMQCV