Gene Arth_3704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3704 
Symbol 
ID4443705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4168690 
End bp4169910 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content65% 
IMG OID639691528 
Productsarcosine oxidase beta subunit family protein 
Protein accessionYP_833179 
Protein GI116672246 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACCC AACAGCTCCC CGAGCACCCG GATTTCCTCT GGCGCAACCC GGAGCCCAAG 
TCCTCCTACG GCGCCGTGAT TGTGGGCGGT GGCGGGCACG GCCTGGCCAC CGCCTACTTC
CTGGCCAAGA ACCATGGCAT GACCAACATC GCCGTTCTGG AAAAGGGCTG GCTGGCCGGC
GGCAACATGG CACGCAACAC CACCATCATC CGGTCCAACT ACCTCTGGGA CGAGAGCGCT
GCCATCTACG AGCACGCCCT CAAGCTCTGG GAAATCCTGC CGGAAGAGCT CGAATACGAC
TTCCTCTTCA GCCAGCGAGG CGTCATGAAC CTCGCCCACA CCCTGGGCGA CGTCCGCGAA
AGCATCCGCC GCGTGGGCGC GAACAAGCTC AACGGCGTGG ACGCTGAGTG GCTGGAACCG
GACCAGGTCA AGGAGCTCTG CCCGATCCTG AACATCAGCG ACAACATCCG CTACCCGGTC
ATGGGCGCCA CCTACCAGCC GCGTGCCGGC ATCGCCAAGC ACGACCACGT GGCCTGGGCC
TTCGCCCGCA AGTGCGACGA ACTGGGCGTG GACATCATCC AGAACTGCGA AGTCACCGGC
TTCCTCAAGG ACGGCGACCG CGTGGTGGGC GTCAAGACCA ACCGCGGCAC CATCAACACC
GAAAAGGTGG GCCTGTGCGC TGCCGGCCAC AGCTCGGTCC TTGCGGAAAT GGCCGGCTTC
CGGCTCCCCA TCCAGTCCCA CCCGCTCCAG GCCCTGGTCT CCGAGCTGCA CGAACCGGTC
CACCCCACCG TGGTGATGTC CAACCACGTG CACGTCTATG TTTCCCAGGC CCACAAGGGC
GAACTGGTCA TGGGCGCCGG CGTCGACTCC TACAACGGCT ACGGCCAGCG CGGGTCCTTC
CACGTAATTG AGAACCAGAT GGCCGCCGCC GTTGAACTGT TCCCCATCTT TGCGCGGGCA
CATGTGCTCC GGACCTGGGG CGGGATCGTG GACACCACGC TGGATGCCTC GCCGATCGTG
GGCACCACCC CGGTGGAGAA CATGTTCGTG AACTGTGGCT GGGGCACCGG CGGCTTCAAG
GCCACCCCGG CTGCCGGACT CACGTTCGCG CACACCATCG CCACGGGCAC CCCGCACAAG
CTGAACAAGC CGTTTGCGCT GGAACGCTTC GAAACCGGCG CCCTGATCGA CGAGCACGGC
GCAGCCGCAG TAGCCCACTA G
 
Protein sequence
MSTQQLPEHP DFLWRNPEPK SSYGAVIVGG GGHGLATAYF LAKNHGMTNI AVLEKGWLAG 
GNMARNTTII RSNYLWDESA AIYEHALKLW EILPEELEYD FLFSQRGVMN LAHTLGDVRE
SIRRVGANKL NGVDAEWLEP DQVKELCPIL NISDNIRYPV MGATYQPRAG IAKHDHVAWA
FARKCDELGV DIIQNCEVTG FLKDGDRVVG VKTNRGTINT EKVGLCAAGH SSVLAEMAGF
RLPIQSHPLQ ALVSELHEPV HPTVVMSNHV HVYVSQAHKG ELVMGAGVDS YNGYGQRGSF
HVIENQMAAA VELFPIFARA HVLRTWGGIV DTTLDASPIV GTTPVENMFV NCGWGTGGFK
ATPAAGLTFA HTIATGTPHK LNKPFALERF ETGALIDEHG AAAVAH