Gene Xaut_4702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_4702 
Symbol 
ID5423721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp5213303 
End bp5216272 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content70% 
IMG OID640883966 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001419578 
Protein GI154248620 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.272798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.7148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCC TCGCCCATGA CGGCCTCATC GACCGCAGCC GCACCCTCTC TTTCGCCTTC 
GACGGCAAGA CCTTCACTGG CCATCCCGGC GATACACTGG CCTCGGCGCT GCTCGCCAAC
GGCGTGCGGC TCGTCGGCCG CTCGTTCAAG TATCACCGCC CGCGCGGCGT GGTGACGGCG
GGCTCCGAGG AGCCCAACGC TCTGGTGGAG CTGCGCACCG GCGCCCGGCG CGAGCCCAAC
ACCCGCGCCA CCACGGTGGA GCTTTACGAC GGCCTCGAAG CCGCGAGCCA GAACCGCTGG
CCCTCGCTCG ACCACGACGT GCTGAGCGTG AACCGGCTGG TGTCGCCCTT CCTCGGCGCG
GGCTTCTACT ACAAGACCTT CATGTGGCCC GCCGCGTTCT GGGAGAAGGT CTACGAGCCG
GTCATCCGCC GCGCCGCGGG CCTCGGCCGC GCCGCCAACG CGCCGGACCC GGACCATTAT
GAGAAGGCCA CCGCCTTCTG CGACGTGCTC GTCATCGGCT CCGGCGCGGC CGGCCTGGCG
GCGGCGCTGG CGGCAGCGCG TTCCGGCGCG CGGGTGATCC TCGCGGACGA AGACTTCCGC
CTCGGTGGCC GGCTGCTCTC CGAGCGCGCG GTGATCAATG GCGGTTCCGC CCTCGATTTC
GTGGCGAGCG CGCAAGCCGA GCTTTCGAGC CTGCGCAATG TGCGGCTCAT ACCGCGCACC
ACCGTGTTCG GCGCCTATGA CGGCAGCGAA TATGGCGCCG TCGAGCGGGT GAGCGATCAC
CTTCCCGCCC CCCTGCCCTT CCAGCCGCGC CAGCGCCTGT GGCGGATCGT GGCCAAGCGC
TGCGTGCTGG CGGCGGGTGC CTTCGACCGG CCCATCGTGT TCCCCGGCAA CGACCGGCCG
GGTGTCATGT CGGCGCTGGC ATTGGCCACC TACGCCACCC GCTACGGCGC GGGCGCGGGG
GCAAATGCTG CCGTCTTCTC CACCAATGAC CATGCGGTGG CCGCCGCCCT CGACGCGGCG
GACGCGGGCC TGAAGGTCGA CGCCGTCATC GACGTGCGCC CCGCTTTGCC CGAACCGCTC
GCGGCGCGGG CCAAGGCGCT GGGCGTGCGC GTCATCACCG AAGGCGAGGT GGTGGCGACC
TCCGGCAAGT GCCTGAAATC CGTCACCGTG CGCACCCCGC GCGGCAGCGA GACGCTGGCG
GTGGAAGCCC TCGGCGTGTC GGGTGGCGCA ACGCCCAACC TCAACCTCAC CTGCCATCTG
GGTGGAAAGC CCGTGTGGCG CGAGGACATC GCCGCCTTCG TCCCTGGCGC GGTGCCGCCC
GGCATGGCAG TGGCCGGCGC GGCGGCGGGC ACCTTCGGTC TTGCCGATAT CCTCGCCGAA
GGCACCCGCC TCGGCGCATC GGCCGCGTCC GACGCCGGCT TCGCCGCATC CCCGGCGCCG
GCTCCGCAGG CAGAGGGCGC GCCCACCGGC TTCAAGGCGG TCTTCCATGT GAAGGGCAAG
GGGTCAAAGG GCGGTCCAGC CTTCGTGGAC CAGCAGAACG ACGTGACCGC CAAAGACGTG
GCCCTCGCCC ACCGCGAAGG CTTCCGCGCG GTGGAATTGC TGAAGCGCTA CACCACGCTG
GGCATGGCCA CCGATCAGGG CAAGACCTCC AACATGGCCG GTCTCGCGGT GATGGCGGAG
CTGACCGGAA AAGGCATTCC CGCCACCGGC ACCACCGTGT TCCGCCCGCC CTACACGCCG
GTGGCGCTGG GCGTGCTGGC CGGCCACCAT CGCGGCATCG ACTTCAAGCC GGCCCGTCCC
ACCCCGACCC ACGCGTGGGC GCAAGCGCAG GGCGCGGTCT TCGTGGAAAC CGGCCTGTGG
ATGCGCGCCG CCTATTTCCC CAAGCCCGGT GAAAAGGACT GGCTGGAAAG CGTGAACCGC
GAGGTGAAAG CCACGCGTGA AAGCGTGGGC GTCATCGACG TATCCACCTT CGGCAAGATC
GACCTTCAGG GCCCGGACGT GGGCATGCTG CTCGATCGCG TCTACATCAA CATGTTCTCG
ACGCTGGCCG TGGGCAAGGC GCGCTACGGC GTGATGCTGC GCGAGGACGG CCTGGTCATG
GACGACGGCA CCACCGCACG CCTCGCCGAC GACCATTATG TGATGACCAC CACCACGGCC
AACGCCGCCA AGGTCTACCA GCATCTGGAA TTCTGCCTGC AGGTGCTGTG GCCGGAGCTG
GATGTGTGCC TCGCCTCGGT GTCTGAGCAA TGGGCGCAGA TCGCCGTGTC CGGGCCGCGC
TCCCGCGAGG TGCTGGCCAA GATCGTGGAT GGGTTGGATG TGTCCAATGC CGGCCTCCCC
TTCATGGGGG TGGCGCAGGG CACGGTGATG GGCGGCGTTC AGGCGCGCAT CTTCCGCCTC
TCCTTCTCCG GGGAGCTGGG CTACGAGATC GCGGTGCCGG CCCGCCACGG CCCGGCGCTC
ATGCAGGCGC TGATGGCGGC GGGCGCGCCC TTCGGCATCA CGCCCTATGG GGTGGAGGCG
CTGGGCGTGC TGCGCATCGA GAAGGGCCAT GTCTCCGGCA GCGAGCTGAC GGGCCAGACC
TCGGCGCGCG ATCTCGGCCT CGGCAAGATG GCGTCCACCA AGAAGGACTA TATCGGCCGG
GTGATGGCCG GGCGGCCCGC CTTCACCGAC CCGGACCGGC CCAGCTTCGT CGGCTTCAAG
CCGGTGGACC GCACCGCGCG GCTGCGCGCC GGCGCCCATT TCCTGAAAGC CGGCGCGGCG
GCGTCGACCG AGAACGACGA GGGCTACATG ACCTCGACCG CCTTCTCACC CACCCTCGGC
CACTACATCG GCCTCGGCCT TTTGAAACGC GGGCCGGAGC GCATGGGCGA GAAGGTGCGC
GCCTATGACC CGCTGCGGGG CGGTGACATC GAGGTCGAGG TGTGCTCTCC CGCATTCATT
GACCCGCAAG GGGAGAAGCA GCGTGTCTGA
 
Protein sequence
MTRLAHDGLI DRSRTLSFAF DGKTFTGHPG DTLASALLAN GVRLVGRSFK YHRPRGVVTA 
GSEEPNALVE LRTGARREPN TRATTVELYD GLEAASQNRW PSLDHDVLSV NRLVSPFLGA
GFYYKTFMWP AAFWEKVYEP VIRRAAGLGR AANAPDPDHY EKATAFCDVL VIGSGAAGLA
AALAAARSGA RVILADEDFR LGGRLLSERA VINGGSALDF VASAQAELSS LRNVRLIPRT
TVFGAYDGSE YGAVERVSDH LPAPLPFQPR QRLWRIVAKR CVLAAGAFDR PIVFPGNDRP
GVMSALALAT YATRYGAGAG ANAAVFSTND HAVAAALDAA DAGLKVDAVI DVRPALPEPL
AARAKALGVR VITEGEVVAT SGKCLKSVTV RTPRGSETLA VEALGVSGGA TPNLNLTCHL
GGKPVWREDI AAFVPGAVPP GMAVAGAAAG TFGLADILAE GTRLGASAAS DAGFAASPAP
APQAEGAPTG FKAVFHVKGK GSKGGPAFVD QQNDVTAKDV ALAHREGFRA VELLKRYTTL
GMATDQGKTS NMAGLAVMAE LTGKGIPATG TTVFRPPYTP VALGVLAGHH RGIDFKPARP
TPTHAWAQAQ GAVFVETGLW MRAAYFPKPG EKDWLESVNR EVKATRESVG VIDVSTFGKI
DLQGPDVGML LDRVYINMFS TLAVGKARYG VMLREDGLVM DDGTTARLAD DHYVMTTTTA
NAAKVYQHLE FCLQVLWPEL DVCLASVSEQ WAQIAVSGPR SREVLAKIVD GLDVSNAGLP
FMGVAQGTVM GGVQARIFRL SFSGELGYEI AVPARHGPAL MQALMAAGAP FGITPYGVEA
LGVLRIEKGH VSGSELTGQT SARDLGLGKM ASTKKDYIGR VMAGRPAFTD PDRPSFVGFK
PVDRTARLRA GAHFLKAGAA ASTENDEGYM TSTAFSPTLG HYIGLGLLKR GPERMGEKVR
AYDPLRGGDI EVEVCSPAFI DPQGEKQRV