Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2022 |
Symbol | |
ID | 3973922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2205177 |
End bp | 2206337 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637925131 |
Product | UBA/THIF-type NAD/FAD binding fold |
Protein accession | YP_531896 |
Protein GI | 90423526 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.236297 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGCAA CCTCGGGTGG GAGCGATGGC GGGCTCAGCA ACGAGGAGGT GAGGCGCTAT GCTCGCCACA TCACGCTGCC AGGGGTCGGC CGCGAAGGCC AGGCGAAATT GAAGAACGCT AAGGTGCTGA TTATTGGCAC CGGCGGGTTA GGTTCGCCGA TCAGCCTCTA CCTCGCCGCG GCAGGCGTCG GCGTGATCGG ACTGGTTGAT TTCGATGTCG TGGAGATGAG CAACTTGCAG CGGCAGGTCG TGCACGGCAC AAACACCATC GGGATGCCGA AAGTCAATTC GGCCAAGGCG CGGCTAAACG AGCTCAACCC TGCGATCACG GTCGAGACTT ACGACACAGC CTTCAGCGTT GAAAATGCCC TCGACCTGGT CGGCCGATAT GACGTCGTGG TCGACGGAAG CGACAATTTC AACTGTCGCT ATATCGTCAA CGATGCCTGC ACGATCCTGA AGAGGCCGCT GGTCTATGGC GCGATCTATC GGTTCGAGGG CCAGGTTAGC GTATTCAACC ACGACGGCGG GCCGTGTTAC CGTTGTCTTT TCCCGCAACG CCCGCCCGCC GAATTGTCGC CGAGCTGCAA TGCCGGTGGC GTCTTCGGCG TGCTGCCGGG GGTGATCGGG GCGATCCAAG CGACGGAGGC GGTCAAGCTG ATCTTGGGGC TTGGCCATTC GCTCTCGGGT CGGCTGGTGC GCTACGACGC CCTGGAAATG AAGTTTGACG AGATTCGGTT TTCCAACAGG GCGAACTGCC CGGATTGCGG CAGCCGGCGC AGTCAGATGC ATCCGCCGGA TCGATCCGTG GATAGCATGC TCGCGGCGCC TCGTGCGGCC GAACTGCCGC AGGCAATGTT CATCTCGCCG ACAGAGCTGG CCGAGAACCT CGATCGATAT GTGCTGCTCG ATGTGCGCGA TCCGAACGAA CTCGAGATCT GTGCTATCCC GGGGTCCCTG AACGTCCCGC TGGCCGATTT GGTGAGCCGC TTCGACGAAC TGCCGCGCGA TCGCGCGCAT TGCATCATCT GTCATTCCGG AGCGCGGGCA AAGTCGGCCG CGGCGAGGTT TCTCGATGCC GGAGTTTACG ATTTCCGCAT CCTGGAAGGC GGCATCAAGC GTTGGGTGAG GGACGTCGAA CCGACGATGC CGATCTACTG A
|
Protein sequence | MLATSGGSDG GLSNEEVRRY ARHITLPGVG REGQAKLKNA KVLIIGTGGL GSPISLYLAA AGVGVIGLVD FDVVEMSNLQ RQVVHGTNTI GMPKVNSAKA RLNELNPAIT VETYDTAFSV ENALDLVGRY DVVVDGSDNF NCRYIVNDAC TILKRPLVYG AIYRFEGQVS VFNHDGGPCY RCLFPQRPPA ELSPSCNAGG VFGVLPGVIG AIQATEAVKL ILGLGHSLSG RLVRYDALEM KFDEIRFSNR ANCPDCGSRR SQMHPPDRSV DSMLAAPRAA ELPQAMFISP TELAENLDRY VLLDVRDPNE LEICAIPGSL NVPLADLVSR FDELPRDRAH CIICHSGARA KSAAARFLDA GVYDFRILEG GIKRWVRDVE PTMPIY
|
| |