Gene RPC_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2022 
Symbol 
ID3973922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2205177 
End bp2206337 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content61% 
IMG OID637925131 
ProductUBA/THIF-type NAD/FAD binding fold 
Protein accessionYP_531896 
Protein GI90423526 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.236297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGCAA CCTCGGGTGG GAGCGATGGC GGGCTCAGCA ACGAGGAGGT GAGGCGCTAT 
GCTCGCCACA TCACGCTGCC AGGGGTCGGC CGCGAAGGCC AGGCGAAATT GAAGAACGCT
AAGGTGCTGA TTATTGGCAC CGGCGGGTTA GGTTCGCCGA TCAGCCTCTA CCTCGCCGCG
GCAGGCGTCG GCGTGATCGG ACTGGTTGAT TTCGATGTCG TGGAGATGAG CAACTTGCAG
CGGCAGGTCG TGCACGGCAC AAACACCATC GGGATGCCGA AAGTCAATTC GGCCAAGGCG
CGGCTAAACG AGCTCAACCC TGCGATCACG GTCGAGACTT ACGACACAGC CTTCAGCGTT
GAAAATGCCC TCGACCTGGT CGGCCGATAT GACGTCGTGG TCGACGGAAG CGACAATTTC
AACTGTCGCT ATATCGTCAA CGATGCCTGC ACGATCCTGA AGAGGCCGCT GGTCTATGGC
GCGATCTATC GGTTCGAGGG CCAGGTTAGC GTATTCAACC ACGACGGCGG GCCGTGTTAC
CGTTGTCTTT TCCCGCAACG CCCGCCCGCC GAATTGTCGC CGAGCTGCAA TGCCGGTGGC
GTCTTCGGCG TGCTGCCGGG GGTGATCGGG GCGATCCAAG CGACGGAGGC GGTCAAGCTG
ATCTTGGGGC TTGGCCATTC GCTCTCGGGT CGGCTGGTGC GCTACGACGC CCTGGAAATG
AAGTTTGACG AGATTCGGTT TTCCAACAGG GCGAACTGCC CGGATTGCGG CAGCCGGCGC
AGTCAGATGC ATCCGCCGGA TCGATCCGTG GATAGCATGC TCGCGGCGCC TCGTGCGGCC
GAACTGCCGC AGGCAATGTT CATCTCGCCG ACAGAGCTGG CCGAGAACCT CGATCGATAT
GTGCTGCTCG ATGTGCGCGA TCCGAACGAA CTCGAGATCT GTGCTATCCC GGGGTCCCTG
AACGTCCCGC TGGCCGATTT GGTGAGCCGC TTCGACGAAC TGCCGCGCGA TCGCGCGCAT
TGCATCATCT GTCATTCCGG AGCGCGGGCA AAGTCGGCCG CGGCGAGGTT TCTCGATGCC
GGAGTTTACG ATTTCCGCAT CCTGGAAGGC GGCATCAAGC GTTGGGTGAG GGACGTCGAA
CCGACGATGC CGATCTACTG A
 
Protein sequence
MLATSGGSDG GLSNEEVRRY ARHITLPGVG REGQAKLKNA KVLIIGTGGL GSPISLYLAA 
AGVGVIGLVD FDVVEMSNLQ RQVVHGTNTI GMPKVNSAKA RLNELNPAIT VETYDTAFSV
ENALDLVGRY DVVVDGSDNF NCRYIVNDAC TILKRPLVYG AIYRFEGQVS VFNHDGGPCY
RCLFPQRPPA ELSPSCNAGG VFGVLPGVIG AIQATEAVKL ILGLGHSLSG RLVRYDALEM
KFDEIRFSNR ANCPDCGSRR SQMHPPDRSV DSMLAAPRAA ELPQAMFISP TELAENLDRY
VLLDVRDPNE LEICAIPGSL NVPLADLVSR FDELPRDRAH CIICHSGARA KSAAARFLDA
GVYDFRILEG GIKRWVRDVE PTMPIY