Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1143 |
Symbol | |
ID | 3909231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1313352 |
End bp | 1316315 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883037 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_484764 |
Protein GI | 86748268 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.983501 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG CTTCACGCAT CAGCGGCGGC CTGATTGATC GCGACAAGCC GCTGCGATTT TCCTTCGACG GCACGGCGAT GACCGGCTTC GCCGGCGACA CGCTGGCCTC GGCGCTGGTG GCCAACGGCA CCCGCCTGGT CGGCCGCTCG TTCAAATATC ATCGGCCGCG CGGCATTTTC TCCGCCGGCT CCGAAGAGCC GAACGCGCTG GTCGAATTGC GCAGCGGTGC GCGGCGTGAG CCCAACACCA AGGCGACCAC GGTCGAGCTC TATGACGGCC TCGAGGCGCA CAGCCAGAAC CGCTGGCCGT CGCTGGCGTT CGATTGGCGC GCGGTGCATC AGCTCGCGTC GCCGCTGATC GTCGCCGGCT TCTACTACAA GACCTTCATG TGGCCGGCCG CGTTTTGGGA AAAGCTCTAC GAGCCGCTGA TCCGCCGCTC CGCCGGGCTC GGCCGCCTCA GTGGCGAGCC CGATCCGGAC ACATATGAGA AAGCCACCGC GTTCTGCGAC CTGCTGATCA TCGGCGGCGG CCCCGCGGGC CTTGCCGCGG CGCTGGCGGC GGGGCGTGCC GGCGCGCGCG TGATCCTGGT CGAAGAAGAC TTTGCGCTCG GCGGCCGGCT GCTGTCCGAG CTTTGCGAGA TCGACGGACT GTCGGGGGCC GGGTGGGCAC AACTGGCGGA AGCCGAACTC GCAACCCTGA GTAATGTCCG GATCCTGCGC CGGTCCAGCG TGTTCGGCGT CTATGATGAT GAGTTCGGCG TGATCGAGCG CGTCGCCGAT CATCTGCCGG TGCCGCCGGC GTTTACGCCG CGGCAGCGGC TGTGGAAGAT CGTGGCGCGG GAGTCGTTGC TGGCGACCGG CGCGACCGAG CGGCCGATCG TGTTCGGCGG CAACGACCGG CCCGGCGTGA TGCTGGCCTC CGCTGTGCGG AGCTACGTCA ACCGCTTCGC CGCAGCGCCG GGACAGCGCG CGGTGGTGTT CACGACCAGC GACGACGGCT GGCGCAGCGC GGCTGATCTG TCGCGCGCCG GAATTGTAGT CGCTGCCGTA GTCGATCCGC GCCGCGAGGT CGCCGCTTCC ATCCGCGCGT TGGCCGGCAA CGCGCCGGTG CATCTTGGCG CGTCGGTCAC CGACGCCATC GGCGGGCAAT CGCTGCGTGC GGTCGAGATC GTCGATGCCG CCGGCAAACG GCAAAAACTC GCCGCCGATC TGCTCGCTGT GTCCGGCGGC TGGAATCCGA ACATCGCGCT CGCCACCCAT CTCGGCGGCA AGGGCGAATG GAATCCCGAG ACATCGGCGT TTCTCGCGGC CGGCGCGCCG AAGGCGATGA CCATCGCGGG TGCCGCGGCC GGCCGTTTCA CGCTGGCGCA GGCGTTGGAG GACGGCGCGC GCTGGGGCGC GGAGGCCGCA TCGCGTTGCG GCCATGCCGG CGCCGCGCAG CCGGCCTATC GCGCCAGCGA CGAAGCTTTC GCCGTGACGC CGCTGTGGCA GGTCGCAGGC GCCCGCAGCA AGGCCTTCGT CGATCTGCAG AACGACGTCA CCGCCGCCGA TATCGCGCTC TCGGCGCGCG AGGGCTTTCG CTCGGTCGAG CACCTGAAGC GCTACACCAC GCTCGGCATG GCGACCGACC AAGGCAAGAC CTCCAACGTC AACGGCCTGG CGATGATGGC GGCGCTCACC GAGCGCAGCA TCGCCGCCGC AGGCACCACG CGGGCCCGGC CGCCGCAGGT GCCGGTCGCG ATCGGCGCAT TCGGCGGCCT CAGCACCGGC AAGCATTTCA AGCCGACGCG TCTCACCGCG ACTCACGACT GGTCCGCCCA GCAGGGCGCG AGTTTCGTCG AGACCGGGCA GTGGTTGCGC GCGCAATGGT TCGCACGGCC GGGCGAGACC GACTGGCTGC AAAGCGTGTC GCGCGAAGTC GATGCCGTGC GGAGCGCGGT CGGGATTTGC GACGTCTCCA CCCTCGGCAA GATCGCGCTG TGCGGCGCCG ACGTCGGCGT GTTTCTCGAC CGAGTCTACA TCAACACCTT CTCGACGCTG GCGGTCGGCA AGGTGCGCTA TGGCGTGATG CTGCGCGAGG ACGGCTTCGT CATGGACGAC GGCACAACCG CGCGGCTCGC CGAGGATCAC TACGTGATGT CGACCACCAC CGCGAACGCG GTGAAGGTGA TGCAGCATCT CGAATTCTGC CATCAGGTGC TGTGGCCCGA GCTCGACGTG CAGATGGTCT CGGTCACCGA GCAATGGGCG CAGGTCGCGG TCGCCGGGCC TCGGTCCCGC ACGCTTCTGC AGAATCTGTT CGGGCCGGGT GTCGATCTGT CGGATGCGGC GTTTCCCTAT ATGGCGTGCG GCGAATTCCG CCTCGGCGAG GTGCCGGCGC GGCTGTTCCG GATCTCGTTC TCCGGCGAGC GCGCCTACGA GATCGCAGTG CCGGCCGGCT ATGGCGATGC GCTGATGCGC GCGCTGATGG CGGCGGGTGA AGGCCTCGGC GTCGTGCCCT ACGGCACCGA GGCGCTCGGC GTGATGCGGA TCGAGAAGGG CCACGCCGCC GGCAATGAAC TCAACGGCCA GACGGTGGCG CGCGATCTCG GCCTCGGCCG GATGATGTCG ACGAAGAAAG ACTTCATCGG CCGGGTGATG GCGGGCCGGC CCGCGCTGAT CGATCCGGCG CGGCCGACGC TGGTCGGCCT GCGTCCGGTC GATCGCAACG ACCGCCTGCG CAACGGCGCG CATCTGTTCG CGCCCGGCGC AGCGCCGTCG CCGGAGACCG ATCAGGGCTT CGTCACGTCG TCGGCGTTCA GCCCGTCGCT CGGCCACTGG ATCGCGCTGG CGCTGCTGTC GCGCGGTCCG GATCGGATCG GCGAACGTAT TCGCGTCTAC GATCCGATCC GCGCGCATGA TTTCGAGGCC GAGATCGTGT CGCCGGTGTT TGTCGATCCG GAAGGAGAGC GGCTGCGTGG CTGA
|
Protein sequence | MSAASRISGG LIDRDKPLRF SFDGTAMTGF AGDTLASALV ANGTRLVGRS FKYHRPRGIF SAGSEEPNAL VELRSGARRE PNTKATTVEL YDGLEAHSQN RWPSLAFDWR AVHQLASPLI VAGFYYKTFM WPAAFWEKLY EPLIRRSAGL GRLSGEPDPD TYEKATAFCD LLIIGGGPAG LAAALAAGRA GARVILVEED FALGGRLLSE LCEIDGLSGA GWAQLAEAEL ATLSNVRILR RSSVFGVYDD EFGVIERVAD HLPVPPAFTP RQRLWKIVAR ESLLATGATE RPIVFGGNDR PGVMLASAVR SYVNRFAAAP GQRAVVFTTS DDGWRSAADL SRAGIVVAAV VDPRREVAAS IRALAGNAPV HLGASVTDAI GGQSLRAVEI VDAAGKRQKL AADLLAVSGG WNPNIALATH LGGKGEWNPE TSAFLAAGAP KAMTIAGAAA GRFTLAQALE DGARWGAEAA SRCGHAGAAQ PAYRASDEAF AVTPLWQVAG ARSKAFVDLQ NDVTAADIAL SAREGFRSVE HLKRYTTLGM ATDQGKTSNV NGLAMMAALT ERSIAAAGTT RARPPQVPVA IGAFGGLSTG KHFKPTRLTA THDWSAQQGA SFVETGQWLR AQWFARPGET DWLQSVSREV DAVRSAVGIC DVSTLGKIAL CGADVGVFLD RVYINTFSTL AVGKVRYGVM LREDGFVMDD GTTARLAEDH YVMSTTTANA VKVMQHLEFC HQVLWPELDV QMVSVTEQWA QVAVAGPRSR TLLQNLFGPG VDLSDAAFPY MACGEFRLGE VPARLFRISF SGERAYEIAV PAGYGDALMR ALMAAGEGLG VVPYGTEALG VMRIEKGHAA GNELNGQTVA RDLGLGRMMS TKKDFIGRVM AGRPALIDPA RPTLVGLRPV DRNDRLRNGA HLFAPGAAPS PETDQGFVTS SAFSPSLGHW IALALLSRGP DRIGERIRVY DPIRAHDFEA EIVSPVFVDP EGERLRG
|
| |