Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3704 |
Symbol | |
ID | 4443705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4168690 |
End bp | 4169910 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639691528 |
Product | sarcosine oxidase beta subunit family protein |
Protein accession | YP_833179 |
Protein GI | 116672246 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCACCC AACAGCTCCC CGAGCACCCG GATTTCCTCT GGCGCAACCC GGAGCCCAAG TCCTCCTACG GCGCCGTGAT TGTGGGCGGT GGCGGGCACG GCCTGGCCAC CGCCTACTTC CTGGCCAAGA ACCATGGCAT GACCAACATC GCCGTTCTGG AAAAGGGCTG GCTGGCCGGC GGCAACATGG CACGCAACAC CACCATCATC CGGTCCAACT ACCTCTGGGA CGAGAGCGCT GCCATCTACG AGCACGCCCT CAAGCTCTGG GAAATCCTGC CGGAAGAGCT CGAATACGAC TTCCTCTTCA GCCAGCGAGG CGTCATGAAC CTCGCCCACA CCCTGGGCGA CGTCCGCGAA AGCATCCGCC GCGTGGGCGC GAACAAGCTC AACGGCGTGG ACGCTGAGTG GCTGGAACCG GACCAGGTCA AGGAGCTCTG CCCGATCCTG AACATCAGCG ACAACATCCG CTACCCGGTC ATGGGCGCCA CCTACCAGCC GCGTGCCGGC ATCGCCAAGC ACGACCACGT GGCCTGGGCC TTCGCCCGCA AGTGCGACGA ACTGGGCGTG GACATCATCC AGAACTGCGA AGTCACCGGC TTCCTCAAGG ACGGCGACCG CGTGGTGGGC GTCAAGACCA ACCGCGGCAC CATCAACACC GAAAAGGTGG GCCTGTGCGC TGCCGGCCAC AGCTCGGTCC TTGCGGAAAT GGCCGGCTTC CGGCTCCCCA TCCAGTCCCA CCCGCTCCAG GCCCTGGTCT CCGAGCTGCA CGAACCGGTC CACCCCACCG TGGTGATGTC CAACCACGTG CACGTCTATG TTTCCCAGGC CCACAAGGGC GAACTGGTCA TGGGCGCCGG CGTCGACTCC TACAACGGCT ACGGCCAGCG CGGGTCCTTC CACGTAATTG AGAACCAGAT GGCCGCCGCC GTTGAACTGT TCCCCATCTT TGCGCGGGCA CATGTGCTCC GGACCTGGGG CGGGATCGTG GACACCACGC TGGATGCCTC GCCGATCGTG GGCACCACCC CGGTGGAGAA CATGTTCGTG AACTGTGGCT GGGGCACCGG CGGCTTCAAG GCCACCCCGG CTGCCGGACT CACGTTCGCG CACACCATCG CCACGGGCAC CCCGCACAAG CTGAACAAGC CGTTTGCGCT GGAACGCTTC GAAACCGGCG CCCTGATCGA CGAGCACGGC GCAGCCGCAG TAGCCCACTA G
|
Protein sequence | MSTQQLPEHP DFLWRNPEPK SSYGAVIVGG GGHGLATAYF LAKNHGMTNI AVLEKGWLAG GNMARNTTII RSNYLWDESA AIYEHALKLW EILPEELEYD FLFSQRGVMN LAHTLGDVRE SIRRVGANKL NGVDAEWLEP DQVKELCPIL NISDNIRYPV MGATYQPRAG IAKHDHVAWA FARKCDELGV DIIQNCEVTG FLKDGDRVVG VKTNRGTINT EKVGLCAAGH SSVLAEMAGF RLPIQSHPLQ ALVSELHEPV HPTVVMSNHV HVYVSQAHKG ELVMGAGVDS YNGYGQRGSF HVIENQMAAA VELFPIFARA HVLRTWGGIV DTTLDASPIV GTTPVENMFV NCGWGTGGFK ATPAAGLTFA HTIATGTPHK LNKPFALERF ETGALIDEHG AAAVAH
|
| |