Gene Arth_3702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3702 
Symbol 
ID4443703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4165274 
End bp4168228 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content69% 
IMG OID639691526 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_833177 
Protein GI116672244 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTTCCC AGAACGCCCG CCTCGCCGCC GGCGGACGCA TCGACCGCAG CATCTCCTGG 
CGTTTCACCG TGGACGGCGA GGAATTCACC GGCCACCCCG GCGACACGCT CGCCTCGGCC
CTGCTCGCCA ATGGCCGGAT CGCCGCCGGC AACTCGCTGT ACGAGGACCG CCCCCGCGGC
ATCATGTCCG CCGGCGTGGA GGAATCCAAC GCGCTGGTCC GGGTCGAAGC ACGGTTCCCG
GGCCACGTGG CAGAGTCCAT GCTCCCCGCC ACCACCGTCA CCCTGGTGGA CGGCCTGAAG
GCAGACCTGC TCAACGGCCT GGGCCGGCTT GACCCCGAGG AGGACCGCGC CGAGTACGAC
AAGAAGTTCG TGCACACGGA CGTCCTGGTG ATCGGCGGCG GCCCCGCCGG CCTGGCCGCG
GCCCGCGAGG CCGTGCGCAC CGGCGCCCGG GTGATGCTGC TGGACGACCA GCCCGAACTG
GGCGGCACGC TCCTGTCCGG ATCCACCGCA CCTGACCTGG CCGAGGCCAT CGAAGGCAAG
CCGTCCCTGG AATGGGTGGC TGATGTGGAA GCCGAGCTCG TCTCCGCAGC CGAATGCACC
GTCCTGAACC GCACCACGGC CTTCGGCGCC TACGACGCCA ACTACATCGT CGCCGTCCAG
AACCGCACCG ACCACCTCTC CAGCCCGGCC GCCCCCGGCG TGTCCCGGCA GCGGATTTGG
CACATCCGTG CCAAGCAGGT GGTGGTTGCT CCCGGCGCCC ACGAGCGCCC GCTGGTCTTC
GAGAACAACG ACCGCCCGGG CATCATGCTC GCCTCGGCCG TCCGCAGCTA CCTCAACCGC
TACGCCGTGG CCGCCGGGCA GCGCGTCGTC ATCAGCACCA CCAACGACAG CGCCTACGCA
CTGGCCTCGG ACCTGCGCGC CGCCGGCGTC AAGGTGGCGG CCGTCGTCGA CGCCCGTCCC
CGCCTTACGG AAGTGGCAGC CGCCGCCGTC GAGTCCGGCA CCCGCGTGCT GATCGGCAGC
GCCGTGGCCA ACACCTCCGC CTCAGGAGAA GGCGCCGCAG ACGGCCGGCT GGACAGCGTC
ACCGTCCGCA GTATCAACGA CGACGGCGAA CTCACCTCCG GCATCGAAGA GATCGCCTGC
GACCTGCTGG CAGTCTCCGG CGGCTGGAGC CCGCTGGTGC ACCTCCACTC GCAGCGGCAG
GGAAAGCTGC GCTGGGACGA GGACCTGGCG GCGTTCGTAC CGAGCACCGT GGTCCCGAAC
CAGCAGACCA TCGGCTCCGG CCGCGGCAGC TTCGAACTCG CCGACTGCCT CGCCGAAGGC
ATCTCCGCCG GAGCTTCGGC GGCCATCGCC GCCGGCTTCA GTGCCGCCGT CGAACCTTCT
GTCATCGGCG AGCCGAAGGC ATCCGCCCCG ACCCGCCAGC TGTGGCTGGT GCCCGGCCAG
GCCGGTACCC CGGACGACTG GCACCACCAC TTCGTGGACT TCCAGCGCGA CCAGTCCGTG
GCTGACGTGC TGCGGTCCAC CGGCGCCGGC ATGCGGTCCG TGGAGCACAT CAAGCGCTAC
ACCTCGATCA GCACCGCCAA CGACCAGGGC AAGACTTCCG GCGTCAACGC CATCGGCGTG
ATCGCCGCGG CTCTCCGCAC CGCCGGCGAA GCGTCCCGGG GCATCGGCGA CATCGGCACC
ACCACGTACC GGGCACCGTT TACCCCGGTG GCGTTTGCGG CACTTGCCGG ACGCCAGCGC
GGCGAACTGT TCGACCCCGC CCGTGTTACG TCGATCCACC CGTGGCACGT GGCCAAGGGT
GCGCTGTTCG AGGACGTGGG GCAGTGGAAG CGCCCCTGGT ACTACCCGCA GGACGGGGAG
GACATGGACA CCGCCGTGCT GCGCGAGTGC GCCGCCGTCC GCGAATCCGT GGGCTTCATG
GACGCCACCA CGCTCGGCAA GATCGAAATC CGTGGCAAGG ACGCCGGTGA GTTCCTCAAC
CGGATCTACA CCAACGCGTT CAAGAAGCTC GCCCCGGGTT CCGCCCGCTA CGGCGTGATG
TGCATGGCGG ACGGCATGAT TTTCGACGAC GGCGTGACCC TGCGCCTCGA CGAGGACCGG
TTCTTCATGA CCACCACCAC CGGCGGTGCC GCGAAGGTGC TGGACTGGCT GGAGGAATGG
CTACAGACCG AATGGCCTGA GCTGGACGTG CACTGCACCT CGGTGACCGA ACAGTGGAGC
ACCATTGCCG TCGTCGGGCC CAAATCCCGC GCGGTCCTCG CGAAGGTGGC ACCCGAACTC
GCCGCCGGCG GCGGCCTGGA GGCGGAAGCC TTCCCGTTCA TGACCTTCCG CGAAACCACC
CTCGCCTCCG GCGTGCAGGC CCGGATCTGC CGGATCTCGT TCTCCGGCGA ACTGGCCTAC
GAAATTAACG TGCCGTCCTG GTACGGCCTG AACACCTGGG AAGCTGTTGC GGCCGCCGGG
GCCGAATTCA ATATCACCCC CTACGGCACC GAAACCATGC ACGTGCTCCG CGCCGAAAAG
GGCTACCCGA TCGTCGGGCA GGACACCGAC GGCACTGTCA CCCCGCAGGA TGCCGGGATG
GAATGGGTTG TCTCCAAGGC CAAGGAGTTC ATCGGCAAGC GCTCCTACGC CCGTGCCGAT
GCGAAGCGCG AGGACCGCAA GCACCTGGTC AGCGTCCTCC CCGTGGACGG AACGCTGCGG
CTGCCGGAAG GCACCCAGCT CGTGGAAAAG GGCATCCCGA CCAACCCCGC CTACGGTCCC
GTCCCGATGC AGGGTTTCGT GACCTCGAGT TACCACAGCG CCGCACTGGG CCGGTCCTTC
GGCCTGGCCC TGATCAAGAA CGGCCGCAAC CGCATCGGCG AGACCCTCGT GGCCGCCGCC
GGTGACCAGC TGGTTGATGT CGTTGTCGCC GAAACCGTAC TTTTTGACCC TGAAGGGACC
CGCAAAGATG GCTAA
 
Protein sequence
MTSQNARLAA GGRIDRSISW RFTVDGEEFT GHPGDTLASA LLANGRIAAG NSLYEDRPRG 
IMSAGVEESN ALVRVEARFP GHVAESMLPA TTVTLVDGLK ADLLNGLGRL DPEEDRAEYD
KKFVHTDVLV IGGGPAGLAA AREAVRTGAR VMLLDDQPEL GGTLLSGSTA PDLAEAIEGK
PSLEWVADVE AELVSAAECT VLNRTTAFGA YDANYIVAVQ NRTDHLSSPA APGVSRQRIW
HIRAKQVVVA PGAHERPLVF ENNDRPGIML ASAVRSYLNR YAVAAGQRVV ISTTNDSAYA
LASDLRAAGV KVAAVVDARP RLTEVAAAAV ESGTRVLIGS AVANTSASGE GAADGRLDSV
TVRSINDDGE LTSGIEEIAC DLLAVSGGWS PLVHLHSQRQ GKLRWDEDLA AFVPSTVVPN
QQTIGSGRGS FELADCLAEG ISAGASAAIA AGFSAAVEPS VIGEPKASAP TRQLWLVPGQ
AGTPDDWHHH FVDFQRDQSV ADVLRSTGAG MRSVEHIKRY TSISTANDQG KTSGVNAIGV
IAAALRTAGE ASRGIGDIGT TTYRAPFTPV AFAALAGRQR GELFDPARVT SIHPWHVAKG
ALFEDVGQWK RPWYYPQDGE DMDTAVLREC AAVRESVGFM DATTLGKIEI RGKDAGEFLN
RIYTNAFKKL APGSARYGVM CMADGMIFDD GVTLRLDEDR FFMTTTTGGA AKVLDWLEEW
LQTEWPELDV HCTSVTEQWS TIAVVGPKSR AVLAKVAPEL AAGGGLEAEA FPFMTFRETT
LASGVQARIC RISFSGELAY EINVPSWYGL NTWEAVAAAG AEFNITPYGT ETMHVLRAEK
GYPIVGQDTD GTVTPQDAGM EWVVSKAKEF IGKRSYARAD AKREDRKHLV SVLPVDGTLR
LPEGTQLVEK GIPTNPAYGP VPMQGFVTSS YHSAALGRSF GLALIKNGRN RIGETLVAAA
GDQLVDVVVA ETVLFDPEGT RKDG