Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3594 |
Symbol | |
ID | 4024108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 4006270 |
End bp | 4007895 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637963798 |
Product | hypothetical protein |
Protein accession | YP_570718 |
Protein GI | 91978059 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.234761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.928094 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATCA TGACCGGCGG CGAAGCGATC GTACAAGGCC TCGTCGCGCA CGGCGTCGAC ACCGTGTTCG GCCTGCCCGG CGCGCAGATC TACGGCCTGT TCGACGGCTT CGCCAAGGCG CAGTTGCGGG TGATCGGGGC GCGGCACGAG CAGGCCTGCG GCTATATGGC GTTCGGCTAT GCGCGCGCCT CGGGCCGCCC CGGCGTGTTC AGCGTCGTGC CCGGCCCCGG CGTGCTCAAT GCGGGCGCTG CGATGCTCAC CGCGTTCGGC TGCAACGAGC CGGTGCTGTG TCTCACCGGG CAGGTGCCGA GCGCTTATCT CGGCCGCGGC CGCGGCCATC TGCACGAGAT GCCGGACCAG CTCGCGACGC TGCGCAGCTT CATCAAATGG GCGGAGCGGA TCGAATATCC CGGCAGCGCG CCGGCGCTGG TGGCGCGCGC GTTTCAGGAA ATGATGTCCG GCCGGCGCGG CCCGGTGGCG CTGGAAATGC CCTGGGACGT GTTCACGCAA CGCGCCGAGA CCGCGGCCGC GATCAAGCTC GATCCGGTCG CGCCGCCGCT GCCCGATCCC GACCGGATCG ACGCGGCGGC CAGGCTGATC GCCGCGAGCA GGACGCCGAT GATCTTCGTC GGCTCCGGCG CGCTCGACGC CGGCGACGAG ATTCTCGAAC TCGCCGAGGC GATCGACGCG CCGGTCGTGG CGTTCCGCTC CGGCCGCGGC ATCGTCAGCA ACGCGCATGA GCTGGGCCTC ACCTTCGCCG CCGCCTATCA GCTCTGGCCG CAGACCGATC TGATCATCGG CATCGGCACG CGGATGGAGT TGCCGACCAC GTTCCGCTGG CCGTTCCGCC CGGCGGGGCA GACCTCGGTG CGGATCGACA TCGATCCCGC CGAGATGCGC CGTTTCTCGC CGGACGCAGC CGTCGTCGCC GATGCGAAAG CCGGCGCGCG CGCGCTGGTC GACGCGGTGA GCAAGCGCGG CTACAGCAAG ACCCAGGGCC GCCGCGACAC CATCCGCGAC GCGACTGCGC GCACGCTCGA ACAGATCCAG TCGGTGCAGC CACAGATGGC GTATCTGAAG ATCCTGCGTG AGGTGCTGCC GGACGACGCC ATCGTCACCG ACGAGCTCTC GCAGGTCGGC TTCGCCTCCT GGTACGGCTT CCCGGTGTAT CAGCCGCGCA CCTTCCTCAC CTCGGGCTAT CAGGGCACGC TCGGCTCCGG CTTCCCGACC GCGCTGGGCG CCAAGGTCGC CTTCCCTGAC AGGCCCGTGG TGGCGATCAC CGGCGACGGC GGTTTCATGT TCGCGGTACA GGAGCTCGCC ACCGCGGTGC AGTTCAACAT CGGCGTGGTC ACGCTGGTGT TCGACAATTC GGCCTACGGC AACGTCCGGC GCGATCAGGT CACCCAGTTC GAGGGCCGCG TCGTGGCGTC CGATCTGGTC AACCCGGATT TCGTCAAGCT CGCGGAATCG TTCGGCGTCG GCGCCGCGCG CGTCACCGCG CCGGATCACT TTCGGCCCGC GCTGGAAAAA GCGCTGGCGC ATGGCGGTCC GTATTTGATC GCGATCGACG TAGCGCGCGA CAGCGAGGCC AGCCCGTGGC CGTTCATCCA TCCGGCGAAA CCGTAG
|
Protein sequence | MSIMTGGEAI VQGLVAHGVD TVFGLPGAQI YGLFDGFAKA QLRVIGARHE QACGYMAFGY ARASGRPGVF SVVPGPGVLN AGAAMLTAFG CNEPVLCLTG QVPSAYLGRG RGHLHEMPDQ LATLRSFIKW AERIEYPGSA PALVARAFQE MMSGRRGPVA LEMPWDVFTQ RAETAAAIKL DPVAPPLPDP DRIDAAARLI AASRTPMIFV GSGALDAGDE ILELAEAIDA PVVAFRSGRG IVSNAHELGL TFAAAYQLWP QTDLIIGIGT RMELPTTFRW PFRPAGQTSV RIDIDPAEMR RFSPDAAVVA DAKAGARALV DAVSKRGYSK TQGRRDTIRD ATARTLEQIQ SVQPQMAYLK ILREVLPDDA IVTDELSQVG FASWYGFPVY QPRTFLTSGY QGTLGSGFPT ALGAKVAFPD RPVVAITGDG GFMFAVQELA TAVQFNIGVV TLVFDNSAYG NVRRDQVTQF EGRVVASDLV NPDFVKLAES FGVGAARVTA PDHFRPALEK ALAHGGPYLI AIDVARDSEA SPWPFIHPAK P
|
| |