Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1141 |
Symbol | |
ID | 3909229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1311822 |
End bp | 1313072 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883035 |
Product | sarcosine oxidase beta subunit family protein |
Protein accession | YP_484762 |
Protein GI | 86748266 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.191022 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTATT CGCTGTTCTC GCTTCTCGGC AACACACTCG GCGGCCACAA GGGCTGGAAG CCGACCTGGC GCGACGCCGC GCCGAAGCCT GCCTACGATG TGGTGATCAT CGGCGGCGGC GGCCACGGCC TCGCGACAGC CTATTATCTC GCCAGCCGCT ACGGCGTGCG CAACGTCGCG GTGCTGGAAA AGGGCTGGAT CGGCGGCGGC AATATGGGCC GCAACACCAC CATCATCCGC TCCAACTATC TGTTGCCCGG CAACATCCCG CTTTACGAGC TGTCGATGAA GCTGTGGGAG GGGCTGGAGC AGGACTTCAA CTACAACGCC ATGGTCAGCC AGCGCGGCGT GCTGAACCTG TATCATTCCG ACGCCCAGCG CGACGCCTAT GCGCGGCGCG GCAACGCGAT GCGGCTGCAC GGCGTCGATG CCGAACTGCT CGACCGCGAC GGCGTGCGCA AACTCTATCC GTTCCTGAAT TTCGACGACG CGCGGTTTCC GATCCAGGGC GGCCTGCTGC AGGCCCGCGG CGGCACCGCG CGGCACGACG CGGTGGCCTG GGGCTTCGCC CGCGGCGCCA GCGATCGCGG CGTCGACATC GTGCAGAATT GCGAAGTCAC CGGCATCACC ATTGTGAACG GCCGGGTCAC CGGGGTCGAG ACCACGCGCG GCCCGATCGC TGCGGGCAAG GTCGGCATCG CCGTCGCCGG CTCGAGCTCG CGCGTCGCCG CGATGGCCGG GATGCGGCTG CCGATCGAGT CCTTCGTGCT GCAGGCGATG GTGTCGGAAG GGCTGAAGCC GATCATCCCC GGCGTCATCA CCTTCGGGGC CGGACATTTC TATATCAGCC AGTCCGACAA AGGTGGCCTC GTGTTCGGCG GCGATCTCGA CGGCTACAAT TCCTACGCGC AGCGCGGCAA TCTGCCGACC GTGGAAGACA TCTGCGAGGG CGGCATGGCG CTGATGCCGG CGATCGGCCG CGCCCGCATC CTGCGCACCT GGGCCGGCCT TTGCGACATG TCGATGGACG GCTCGCCGAT CATCGATCGC ACGCCGACGC AGAATCTCTA TCTCAATGCC GGCTGGAACT ACGGCGGCTT CAAAGCCACG CCGGGCTCCG GCCTCGTGTT CGCGCATCTG CTCGCCCGCG ACGAGCCGCA TCCCGCCGCC GTCGAGTTGC GGCTCGATCG GTTCGCGCGC GGTGCGGTGA TCGACGAAAA GGGCCAGGGC GCCCAACCGA ATCTGCATTG A
|
Protein sequence | MRYSLFSLLG NTLGGHKGWK PTWRDAAPKP AYDVVIIGGG GHGLATAYYL ASRYGVRNVA VLEKGWIGGG NMGRNTTIIR SNYLLPGNIP LYELSMKLWE GLEQDFNYNA MVSQRGVLNL YHSDAQRDAY ARRGNAMRLH GVDAELLDRD GVRKLYPFLN FDDARFPIQG GLLQARGGTA RHDAVAWGFA RGASDRGVDI VQNCEVTGIT IVNGRVTGVE TTRGPIAAGK VGIAVAGSSS RVAAMAGMRL PIESFVLQAM VSEGLKPIIP GVITFGAGHF YISQSDKGGL VFGGDLDGYN SYAQRGNLPT VEDICEGGMA LMPAIGRARI LRTWAGLCDM SMDGSPIIDR TPTQNLYLNA GWNYGGFKAT PGSGLVFAHL LARDEPHPAA VELRLDRFAR GAVIDEKGQG AQPNLH
|
| |