Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_0867 |
Symbol | |
ID | 5421920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 965033 |
End bp | 968044 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640880109 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001415775 |
Protein GI | 154244817 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAAT CCAATCGCCT CCCCACCGGC GGCCGCATCG ACCGCGCTCG GCCGCTGGCC TTCACCTTCA ACGGCAAGAC CGTTGAGGGC TTTGCCGGCG ACACGGTGGC CTCCGCCCTG CTCGCCAACG GCATCCGCCT GGTGGGCCGC TCGTTCAAGT ATCACCGCCC GCGCGGCATC CTCACCCACG GCGCCGACGA GCCCAACGCG CTGCTGAAGG TAGACCGCGG CCCCGGCCGC GCCGACCCCA ACAACCGCGC CACGGTGATC GAGGCGGTGG GCGGGCTGAA GCTGTCCTCG CAGAACCACT GGCCCTCGCT GGAGCATGAC ATCGGCGCAG TCAACGACCG CCTGTCGCCA GTATTGGTGG CGGGCTTCTA CTACAAGACC TTCATGTGGC CGCGCTCGTT CTGGGACAAG CTCTACGAGC CCAAGATCCG CGCCGCCGCC GGCCTCGGCG TAGCGCCGGA CGTGGCGGAC GCCGACCGCT ACGTGCACCG GAATGCCCAT TGCGACGTGC TGGTGGTGGG CGCGGGGCCG GCCGGCCTCG CCGCAGCGCT GGCCGCCTCC GAGACCGGCA AGCGCGTGAT CCTGGCCGAC GAACAGGCGG AGATGGGCGG CAGCCTCCTC GCGGAACGCA CCGCCACCAT CGACGGCGTC TCCGCCGCGC AATGGGTGGC GCAGGCGCTT TCGACGCTGG CTGGGCGCAC CAACGTCACC CTTCTGCCGC GCACTACCGC CTTCGGCTAT TACAACCACA ACCACGTGGT GCTGACCGAG CGCGTCACCG ACCATCTGGA AAAGCCCGAT CCCAAGCTGC CGCGCGAGCG GCTGTGGCAG GTGCGCCCGC AGGAAGTGGT GATCGCCACC GGCGCCCACG AGCGCCCGCT CTCCTTCGCC GACAATGACC GCCCCGGCAT CATGCTGGCG GAAAGCGTGC GCACCTACAT CAACCGCTAC GGGGTGGCGC CGGGCCGGCG CATCGTGTTC GCCACCAACG GCGCTTCCGC CTATGAAGCA GCGGCGGACG CCAAGGCCGC CGGGCTCGAT GTCACCATCG TGGACACCCG GCCCCTGAGC GCCATCGGCC CGGAGAAGGA GCGGGTCTCG GGCATCGAGA TCCTCGCCGG CCACACGGTG GTGGGCTCCA CCGGCGGCAA GGGGCTCACC GGCCTCATCG TCGCGCCCTT TACCGCTGGC AAGGGCGTCG GCGCGCGGCG CACCCTGCCG TGCGACTGCG CCGGCGTCTC GGGGGGCTGG ACGCCGTCCG TGCACCTGTT CTCCCAATCG CGCGGCAAGC TTGCCTTCCG CGCGGAGATC GACGCCTTCG TGCCCGGCAT CTCGGTGCAG AAAGAGCGCT CCGCCGGCGC CTGCGAGGGC ACCTACGACC TCGGCTCAAT CCTCGCCGAA GGCTGGGCGG CGGGCAGCGG CGCGGCGGGC GCGGGCGGTT CCCGGCTGTT CGCCGCCGCC CCGGCGACCC CCACGGGCTT TGCCCCGGCG CGGGTGCTGC CCACGGACGA TGATCCCAAG AAGGTACGCG CCTTCCTCGA CTTCCAGAAC GACGTGACGG CCAAGGACAT CCGCCTTGCC GTGCGCGAGG GCTTCGAGAG CGTGGAGCAT GTGAAGCGCT ACACCACCAA CGGCATGGCC AGCGACCAGG GCAAGACCTC CAACATGGCG GCGCTGGGCC TGCTGTCGGA GATCCTGGAA AAGCCCATCC CGCAGGTGGG CACCACCACC TTCCGCCCGC CCTATACGCC GGTGACCTTC GGCGCCATCA TCGGCTCGGC GCGCGGGGCC TTGTTCGACC CCATCCGCAC CGCCCCGCTG CATGAGTGGG CGGTGGAGCA TGGCGCCAGG TTCGAGAACG TCTCGCTATG GCGCCGCGCC TGGTATTTCC CCAAGGCGGG GGAAGACATG CATGCGGCGG TCAAGCGCGA ATGCAAGGCG GTGCGCGAAA GCGTCGGCAT CTTCGATGCC TCGACGCTCG GCAAGATCGA GGTGAAGGGG CCGGATGCGG CGGAATTCCT CAACCGCATC TATCCCAATG CCTGGCTGAA GCTGGAACCG GGCAAGTGCC GCTACGGTCT GATGCTGAAG GAAGACGGCT TCATCTTCGA CGATGGCGTG GTCGCGCGCC TTGCCCCCGA CCTGTTCCAT GTGACCACCA CCACCGGCGG CGCGCCGCGC GTGCTCGCCC ACATGGAGGA TTATCTCCAG ACCGAGTGGC CGGACCTCAA CGTCTTCGCC ACCTCCACCA CCGAGGAATG GGCGGTGATC GCCCTTCAGG GCCCCCGCGC CCGTGACGTG CTGGCGCCCT TCGTCACCGG CATCGACCTC GCGGGCGCGG CCTTCCCGCA CATGGCGGTG CGCGTCGGGG AGATCTGCGG CGTGCCGACA AGGCTCATGC GCGTCTCCTT CACCGGCGAG CTGGGCTTTG AGATCAACGT GCCGGCGGAC CACGCCCGCG CGGTGTGGGA GGCGCTGTTC GACTACGGCC AGCGCTTCGG CATCACCCCC TACGGCACCG AGGCCATGCA TGTTCTGCGC GCGGAACGCG GCTTCATCAT CGTCGGCCAG GAAACCGACG GCACCGTGAC GCCTGACGAT CTCGGCCTCT CCGGCATGGT CTCCAAGCTA AAGCGCGATT TCGTGGGCAA GCGCTCGCTG GCGCGCCCGG ACATGGCGCT GGCCGACCGC AAGCAACTGG TGGGCCTTGC CAGCGTCGAC GGGCGCACGG TGCTGGACGA GGGGGCGCAG ATCGTCGCCG ATCCGGCCCA AAAGGTGCCC ATGACCATGC TGGGGCACGT CACCTCGTCC TATCTCAGCC CGAGCCTCGA CCAGCCCATC GCGCTGGCGC TGGTTGCCGG CGGGCGGGCG CGCATGGGGC AGACCCTGCA CGTCACCACC CCCACCGGCT TCACGGCCGT GACCGTGACC GAGCCGGTGT TCGTCGATCC CGAAGGCAAG CGCGTGAATG GCGTCCAGAC GGAAGGAGCG CTCCATGCTT GA
|
Protein sequence | MAQSNRLPTG GRIDRARPLA FTFNGKTVEG FAGDTVASAL LANGIRLVGR SFKYHRPRGI LTHGADEPNA LLKVDRGPGR ADPNNRATVI EAVGGLKLSS QNHWPSLEHD IGAVNDRLSP VLVAGFYYKT FMWPRSFWDK LYEPKIRAAA GLGVAPDVAD ADRYVHRNAH CDVLVVGAGP AGLAAALAAS ETGKRVILAD EQAEMGGSLL AERTATIDGV SAAQWVAQAL STLAGRTNVT LLPRTTAFGY YNHNHVVLTE RVTDHLEKPD PKLPRERLWQ VRPQEVVIAT GAHERPLSFA DNDRPGIMLA ESVRTYINRY GVAPGRRIVF ATNGASAYEA AADAKAAGLD VTIVDTRPLS AIGPEKERVS GIEILAGHTV VGSTGGKGLT GLIVAPFTAG KGVGARRTLP CDCAGVSGGW TPSVHLFSQS RGKLAFRAEI DAFVPGISVQ KERSAGACEG TYDLGSILAE GWAAGSGAAG AGGSRLFAAA PATPTGFAPA RVLPTDDDPK KVRAFLDFQN DVTAKDIRLA VREGFESVEH VKRYTTNGMA SDQGKTSNMA ALGLLSEILE KPIPQVGTTT FRPPYTPVTF GAIIGSARGA LFDPIRTAPL HEWAVEHGAR FENVSLWRRA WYFPKAGEDM HAAVKRECKA VRESVGIFDA STLGKIEVKG PDAAEFLNRI YPNAWLKLEP GKCRYGLMLK EDGFIFDDGV VARLAPDLFH VTTTTGGAPR VLAHMEDYLQ TEWPDLNVFA TSTTEEWAVI ALQGPRARDV LAPFVTGIDL AGAAFPHMAV RVGEICGVPT RLMRVSFTGE LGFEINVPAD HARAVWEALF DYGQRFGITP YGTEAMHVLR AERGFIIVGQ ETDGTVTPDD LGLSGMVSKL KRDFVGKRSL ARPDMALADR KQLVGLASVD GRTVLDEGAQ IVADPAQKVP MTMLGHVTSS YLSPSLDQPI ALALVAGGRA RMGQTLHVTT PTGFTAVTVT EPVFVDPEGK RVNGVQTEGA LHA
|
| |