Gene Acry_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_0994 
Symbol 
ID5161707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1099400 
End bp1102396 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content73% 
IMG OID640552911 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001234130 
Protein GI148260003 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR00528] glycine cleavage system T protein
[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA GCTTCCGGCT TGCCGAGGGC GGGCGGATCG ACCGCTCCCG CACCCTCGCC 
TTCCGTTTCG ACGGCCGCGC CTATGCCGGC CATCCGGGCG ACACGCTGGC CTCGGCCCTG
CTCGCCAACG GCGTCCACCT GGTCGGCCGT TCCTTCAAGT ATCACCGTCC GCGCGGCATC
CTGACCGCCG GCGCCGAGGA GCCGAACGCC CTGGTCGGCG TCGGCCGCGA CGAGGGGCAC
TACACCCCCA ACCTGCGCGC GACCCAGGTC GTCCTGCATG ACGGCCTGGT CGCCGAAAGC
CAGAACCGCT CCCCCTCGCT CGAACGCGAC TTCGCCGGCC TCACCGACCT GTTCTGGAAA
TTCATCCCCG CCGGCTTCTA CTACAAGACC TTCATGTGGC CGAAAGCGGC GTGGACGCGG
TTCTTCGAGC CGCGCATCCG CGCCATGGCC GGCCTCGGCC GCGCCCCGGA GGCGGCGGAC
GCCGCCTGCT ACGCCCAGCG CTACGCCCAT TGCGACGTGC TGGTGGTCGG CGCCGGCCCC
GCCGGCATCG AGGCCGCCCT CGCCGCCGCG TCCTCCGGCG CGCGGGTGAT CCTGTGCGAC
GAACAGGCCG AGCTTGGCGG CGCCCTCCTC GCCGAGACGG ATGCGACGCT CGATGGCACC
AATGCCGCGA GCTTCCTCGC CACCCGCCTG AGCTGGCTGC AGGCCGCCGG CCAGGTCACC
ATCCTGCCGC GCACCATCGC CTTCGGCTAT TTCCCGCACA ACATGATCGG CCTCGCCCAG
GACCTCACCG ATCATCTCGG CGAACCCGAC GACACCCAGC CCCGCGGCCG GCTCTGGCAG
GTCCGCGCCC GCGAGGTGGT GATCGCAACC GGCGCCATCG AGCGCCCCCT CGCCTTCCCC
GACAACGACC GCCCCGGCAT CATGCTGGCC GACGCCGCGC GCACCTACGT CACCCGCTAC
GGCGTGCTGC CGGGCCGCAA CGCCGTCGTC TTCACCGCGC ACGATTCCGC CTATGCCGCC
GCCCTCGCCC TGCACCGCGC CGGCGCCCGC ATCGCCGCGA TCGCCGATCT CCGCCCCGAT
CCGTCCGGCG AACTGGTCGA GGCCGCCCGC GCCGCCGGCC TGCCGGTCCG CCCCGGCTGC
ACCCTCACCG GCACCGAGGG GCGGCTGCGC GTCACCGCCG CCACCATCGC GCGGCGCGAT
GGCGGCGCCG GCGAGCGCCT CTCCTGCGAC CTCGTGCTGA TGTCCGGCGG CTTCACCCCG
AGCGTGCATC TCTTCTCGCA GAGCCGCGGC AAGCTCCGCT TCGATCCCGC GCTCGACGCC
TTCATCCCCG GCGAGCCGGC CGAGGCCTGC CGCGCCGCCG GTGCCGCCGC CGGCACCACC
TCGCTGGCCG ACGCCCTCGC CTCCGGCCGC GCCGCCGGCG AGGCGGCGGC CGCCGCCGCC
GGCTTCACCG CCCCGCCCGC CGCGGCGATC GCGGTCGCCA ACGCCCCCGC CGCCACCGGC
GGCTTCCTCG GCGCCACGCC GCACGGCCGC AACCCCGGCT CGGTCCGCGC CTTCATCGAC
TTCCAGAACG ACGTCACCGC GAAGGACATC TCGCTCGCCC TGCGCGAGGG CTTCCGCTCG
GTCGAGCACG TCAAGCGCTA CACCACCAAC GGCATGGCGA CCGACCAGGG CAAGCTCTCC
AACATGAACG CGCTCGGCAT CATGTCGGCC GAGCTCGGCC GCCCGATCCC CGAGATCGGC
ACCACCACCT TCCGCATGCC CTACACGCCG GTCCCCTTCG GCTATTTCGC CGGCTACGCC
CGCGGCGCGC TGTTCGAGCC CGAGCGCCAC ACCCCGATCC ACGACTGGGC GGCGGAACAG
GGCGCGGTGT TCGAGGATGT CGGCATCTGG AAACGCGCCC GCTACTTCCC GCGCGGCGGC
GAGACGATGC GCAGCGCCGT CGCCCGCGAA TGCCGCGCCG TGCGCGCGAG CGTCGGCATC
TTCGACGCCT CGACCCTCGG CAAGATCGAG GTCGTGGGGC CGGACGCCGC CGAGTTCCTC
AACCGGATGT ACGTCAACGC CTGGACCAAG CTGAAACCCG GCCGCCTGCG CTACGGCGTG
CTGCTGCGCG AGGACGGGTT CGTGATCGAC GACGGCGTGA TCGGCCGCCT GTCCGACACC
CGCTTCCACG TCACCACCAC CACCGGCGGC GCCCCGCGCG TGCTCGCCAT GATGGAGGAC
TACCTGCAGA CCGAGTTCCC CGAGCTCGAT GTCTGGCTCA CCTCCACCAC CGAGCAATAC
GCGGTCATCG CCGTCCAGGG CCCCCGCGCG CGCGACGTGA TCGCCCCGCT CGTCGAGGGG
GCGGACATCT CCGGCGCCGC CATGCCGCAC ATGAGCATGG TCGAATGCCG CGTCGCCGGC
ATCCCGGCCC GGCTCTTCCG CGTCAGCTTC ACCGGCGAGC TCGGCTTCGA GATCAACGTC
CCCGCCGATT ACGGCCGCGC CGTCTGGGAG GCGGTGTTCG ACGCCGGCCG CCGGCACGAC
ATCACCGCCT ACGGCACCGA GACGATGCAC GTTCTGCGCG CCGAGAAGGG CTACATCATC
GTCGGCCAGG AGACCGACGG CACGGCGACG CCCGACGATG TCGGCCTCGC CTGGGCGATC
GGCAAGGCGA AGCCCGATTT CGTCGGCAAG CGCGCGCTCG ACCGCGCCGC CTTCGCCGGC
CAGACCGGGC GCAAGCAGCT CGTCGGACTG TTCACCGAGC CCGGCGATAT CGTGCTCGAG
GAAGGATCCC ACCTCGTCGC CGATCCCAGC CGGCCGCCGC CCGCCGAGAT CCTCGGCCAC
GTCACCTCCG CCTACTGGAG CGAGACGCTC GGCCGCTCCA TCGCCCTCGC CCTGGTGCGC
GGCGGGCGCG ACCGCATCGG CGACACGCTG CACGTGAAAC TCGCGGACCG CGCGATCCCG
GTCCGCCTCA CCGATCCGGT ATTCTACGAC CGTGAAGGAG CCAGGCTCGA TGGCTGA
 
Protein sequence
MSQSFRLAEG GRIDRSRTLA FRFDGRAYAG HPGDTLASAL LANGVHLVGR SFKYHRPRGI 
LTAGAEEPNA LVGVGRDEGH YTPNLRATQV VLHDGLVAES QNRSPSLERD FAGLTDLFWK
FIPAGFYYKT FMWPKAAWTR FFEPRIRAMA GLGRAPEAAD AACYAQRYAH CDVLVVGAGP
AGIEAALAAA SSGARVILCD EQAELGGALL AETDATLDGT NAASFLATRL SWLQAAGQVT
ILPRTIAFGY FPHNMIGLAQ DLTDHLGEPD DTQPRGRLWQ VRAREVVIAT GAIERPLAFP
DNDRPGIMLA DAARTYVTRY GVLPGRNAVV FTAHDSAYAA ALALHRAGAR IAAIADLRPD
PSGELVEAAR AAGLPVRPGC TLTGTEGRLR VTAATIARRD GGAGERLSCD LVLMSGGFTP
SVHLFSQSRG KLRFDPALDA FIPGEPAEAC RAAGAAAGTT SLADALASGR AAGEAAAAAA
GFTAPPAAAI AVANAPAATG GFLGATPHGR NPGSVRAFID FQNDVTAKDI SLALREGFRS
VEHVKRYTTN GMATDQGKLS NMNALGIMSA ELGRPIPEIG TTTFRMPYTP VPFGYFAGYA
RGALFEPERH TPIHDWAAEQ GAVFEDVGIW KRARYFPRGG ETMRSAVARE CRAVRASVGI
FDASTLGKIE VVGPDAAEFL NRMYVNAWTK LKPGRLRYGV LLREDGFVID DGVIGRLSDT
RFHVTTTTGG APRVLAMMED YLQTEFPELD VWLTSTTEQY AVIAVQGPRA RDVIAPLVEG
ADISGAAMPH MSMVECRVAG IPARLFRVSF TGELGFEINV PADYGRAVWE AVFDAGRRHD
ITAYGTETMH VLRAEKGYII VGQETDGTAT PDDVGLAWAI GKAKPDFVGK RALDRAAFAG
QTGRKQLVGL FTEPGDIVLE EGSHLVADPS RPPPAEILGH VTSAYWSETL GRSIALALVR
GGRDRIGDTL HVKLADRAIP VRLTDPVFYD REGARLDG