Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3702 |
Symbol | |
ID | 4443703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4165274 |
End bp | 4168228 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639691526 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_833177 |
Protein GI | 116672244 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACTTCCC AGAACGCCCG CCTCGCCGCC GGCGGACGCA TCGACCGCAG CATCTCCTGG CGTTTCACCG TGGACGGCGA GGAATTCACC GGCCACCCCG GCGACACGCT CGCCTCGGCC CTGCTCGCCA ATGGCCGGAT CGCCGCCGGC AACTCGCTGT ACGAGGACCG CCCCCGCGGC ATCATGTCCG CCGGCGTGGA GGAATCCAAC GCGCTGGTCC GGGTCGAAGC ACGGTTCCCG GGCCACGTGG CAGAGTCCAT GCTCCCCGCC ACCACCGTCA CCCTGGTGGA CGGCCTGAAG GCAGACCTGC TCAACGGCCT GGGCCGGCTT GACCCCGAGG AGGACCGCGC CGAGTACGAC AAGAAGTTCG TGCACACGGA CGTCCTGGTG ATCGGCGGCG GCCCCGCCGG CCTGGCCGCG GCCCGCGAGG CCGTGCGCAC CGGCGCCCGG GTGATGCTGC TGGACGACCA GCCCGAACTG GGCGGCACGC TCCTGTCCGG ATCCACCGCA CCTGACCTGG CCGAGGCCAT CGAAGGCAAG CCGTCCCTGG AATGGGTGGC TGATGTGGAA GCCGAGCTCG TCTCCGCAGC CGAATGCACC GTCCTGAACC GCACCACGGC CTTCGGCGCC TACGACGCCA ACTACATCGT CGCCGTCCAG AACCGCACCG ACCACCTCTC CAGCCCGGCC GCCCCCGGCG TGTCCCGGCA GCGGATTTGG CACATCCGTG CCAAGCAGGT GGTGGTTGCT CCCGGCGCCC ACGAGCGCCC GCTGGTCTTC GAGAACAACG ACCGCCCGGG CATCATGCTC GCCTCGGCCG TCCGCAGCTA CCTCAACCGC TACGCCGTGG CCGCCGGGCA GCGCGTCGTC ATCAGCACCA CCAACGACAG CGCCTACGCA CTGGCCTCGG ACCTGCGCGC CGCCGGCGTC AAGGTGGCGG CCGTCGTCGA CGCCCGTCCC CGCCTTACGG AAGTGGCAGC CGCCGCCGTC GAGTCCGGCA CCCGCGTGCT GATCGGCAGC GCCGTGGCCA ACACCTCCGC CTCAGGAGAA GGCGCCGCAG ACGGCCGGCT GGACAGCGTC ACCGTCCGCA GTATCAACGA CGACGGCGAA CTCACCTCCG GCATCGAAGA GATCGCCTGC GACCTGCTGG CAGTCTCCGG CGGCTGGAGC CCGCTGGTGC ACCTCCACTC GCAGCGGCAG GGAAAGCTGC GCTGGGACGA GGACCTGGCG GCGTTCGTAC CGAGCACCGT GGTCCCGAAC CAGCAGACCA TCGGCTCCGG CCGCGGCAGC TTCGAACTCG CCGACTGCCT CGCCGAAGGC ATCTCCGCCG GAGCTTCGGC GGCCATCGCC GCCGGCTTCA GTGCCGCCGT CGAACCTTCT GTCATCGGCG AGCCGAAGGC ATCCGCCCCG ACCCGCCAGC TGTGGCTGGT GCCCGGCCAG GCCGGTACCC CGGACGACTG GCACCACCAC TTCGTGGACT TCCAGCGCGA CCAGTCCGTG GCTGACGTGC TGCGGTCCAC CGGCGCCGGC ATGCGGTCCG TGGAGCACAT CAAGCGCTAC ACCTCGATCA GCACCGCCAA CGACCAGGGC AAGACTTCCG GCGTCAACGC CATCGGCGTG ATCGCCGCGG CTCTCCGCAC CGCCGGCGAA GCGTCCCGGG GCATCGGCGA CATCGGCACC ACCACGTACC GGGCACCGTT TACCCCGGTG GCGTTTGCGG CACTTGCCGG ACGCCAGCGC GGCGAACTGT TCGACCCCGC CCGTGTTACG TCGATCCACC CGTGGCACGT GGCCAAGGGT GCGCTGTTCG AGGACGTGGG GCAGTGGAAG CGCCCCTGGT ACTACCCGCA GGACGGGGAG GACATGGACA CCGCCGTGCT GCGCGAGTGC GCCGCCGTCC GCGAATCCGT GGGCTTCATG GACGCCACCA CGCTCGGCAA GATCGAAATC CGTGGCAAGG ACGCCGGTGA GTTCCTCAAC CGGATCTACA CCAACGCGTT CAAGAAGCTC GCCCCGGGTT CCGCCCGCTA CGGCGTGATG TGCATGGCGG ACGGCATGAT TTTCGACGAC GGCGTGACCC TGCGCCTCGA CGAGGACCGG TTCTTCATGA CCACCACCAC CGGCGGTGCC GCGAAGGTGC TGGACTGGCT GGAGGAATGG CTACAGACCG AATGGCCTGA GCTGGACGTG CACTGCACCT CGGTGACCGA ACAGTGGAGC ACCATTGCCG TCGTCGGGCC CAAATCCCGC GCGGTCCTCG CGAAGGTGGC ACCCGAACTC GCCGCCGGCG GCGGCCTGGA GGCGGAAGCC TTCCCGTTCA TGACCTTCCG CGAAACCACC CTCGCCTCCG GCGTGCAGGC CCGGATCTGC CGGATCTCGT TCTCCGGCGA ACTGGCCTAC GAAATTAACG TGCCGTCCTG GTACGGCCTG AACACCTGGG AAGCTGTTGC GGCCGCCGGG GCCGAATTCA ATATCACCCC CTACGGCACC GAAACCATGC ACGTGCTCCG CGCCGAAAAG GGCTACCCGA TCGTCGGGCA GGACACCGAC GGCACTGTCA CCCCGCAGGA TGCCGGGATG GAATGGGTTG TCTCCAAGGC CAAGGAGTTC ATCGGCAAGC GCTCCTACGC CCGTGCCGAT GCGAAGCGCG AGGACCGCAA GCACCTGGTC AGCGTCCTCC CCGTGGACGG AACGCTGCGG CTGCCGGAAG GCACCCAGCT CGTGGAAAAG GGCATCCCGA CCAACCCCGC CTACGGTCCC GTCCCGATGC AGGGTTTCGT GACCTCGAGT TACCACAGCG CCGCACTGGG CCGGTCCTTC GGCCTGGCCC TGATCAAGAA CGGCCGCAAC CGCATCGGCG AGACCCTCGT GGCCGCCGCC GGTGACCAGC TGGTTGATGT CGTTGTCGCC GAAACCGTAC TTTTTGACCC TGAAGGGACC CGCAAAGATG GCTAA
|
Protein sequence | MTSQNARLAA GGRIDRSISW RFTVDGEEFT GHPGDTLASA LLANGRIAAG NSLYEDRPRG IMSAGVEESN ALVRVEARFP GHVAESMLPA TTVTLVDGLK ADLLNGLGRL DPEEDRAEYD KKFVHTDVLV IGGGPAGLAA AREAVRTGAR VMLLDDQPEL GGTLLSGSTA PDLAEAIEGK PSLEWVADVE AELVSAAECT VLNRTTAFGA YDANYIVAVQ NRTDHLSSPA APGVSRQRIW HIRAKQVVVA PGAHERPLVF ENNDRPGIML ASAVRSYLNR YAVAAGQRVV ISTTNDSAYA LASDLRAAGV KVAAVVDARP RLTEVAAAAV ESGTRVLIGS AVANTSASGE GAADGRLDSV TVRSINDDGE LTSGIEEIAC DLLAVSGGWS PLVHLHSQRQ GKLRWDEDLA AFVPSTVVPN QQTIGSGRGS FELADCLAEG ISAGASAAIA AGFSAAVEPS VIGEPKASAP TRQLWLVPGQ AGTPDDWHHH FVDFQRDQSV ADVLRSTGAG MRSVEHIKRY TSISTANDQG KTSGVNAIGV IAAALRTAGE ASRGIGDIGT TTYRAPFTPV AFAALAGRQR GELFDPARVT SIHPWHVAKG ALFEDVGQWK RPWYYPQDGE DMDTAVLREC AAVRESVGFM DATTLGKIEI RGKDAGEFLN RIYTNAFKKL APGSARYGVM CMADGMIFDD GVTLRLDEDR FFMTTTTGGA AKVLDWLEEW LQTEWPELDV HCTSVTEQWS TIAVVGPKSR AVLAKVAPEL AAGGGLEAEA FPFMTFRETT LASGVQARIC RISFSGELAY EINVPSWYGL NTWEAVAAAG AEFNITPYGT ETMHVLRAEK GYPIVGQDTD GTVTPQDAGM EWVVSKAKEF IGKRSYARAD AKREDRKHLV SVLPVDGTLR LPEGTQLVEK GIPTNPAYGP VPMQGFVTSS YHSAALGRSF GLALIKNGRN RIGETLVAAA GDQLVDVVVA ETVLFDPEGT RKDG
|
| |