Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3407 |
Symbol | |
ID | 4444137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3832711 |
End bp | 3834288 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639691231 |
Product | oxidoreductase, molybdopterin binding |
Protein accession | YP_832882 |
Protein GI | 116671949 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGC TTACGAACTG GCTCAAGGGT CCCACTGCGC TGGCCGCGCT GGCAGGCGTG GCTGCGGCCG CCGTCGTACT TTCCGTTGCG GAACTGATCG GCGCCTTCTT CACGGCGCGG GCAACGCCGC TCATTGCCCT CGGCTCAACC TTTATCGACT TCACGCCGCC GTGGCTGAAG GATTTCGCCA TTGCCACGTT CGGCACCAAT GACAAGGCGG CACTGTTCGC GGGGATGGGA CTGACCATCT TCCTGCTCGC CTGCGTGCTC GGCGTCGTGG CGTACCGGAA GTGGTCGCTG GGCGTGGCCG GCGTCCTCCT GATGGGCGCG GTCATTGTTG CCAGCGTTGT GACCCGGGCC AGCGTGGAAC CGCTGGACGC CATCCCTTCG CTGATCGGCA CCGCGGCCGG GCTGGTGGTG CTGCGGCTGC TGATCACGCG GCTGTGGCGG ATGCGTTCAT GGCCCGGCGT CGCCGCGGAT GTCGCCGCCA AGGACACCGA ACGCCCGGCC ACCACCCGCC GCGCCTTCTT CGCCGCCAGC GGGATCACCG CAGTGGCGGC TGCCATCGCG GCAACCGGCG GCCGGCTGCT CAGCGCGGCA CGGAGCAATA TTGCCCAGGC CCGCGAATCG CTCCAGCTGC CTTCCCCGGC CAAGGCAGCC CCCGCTGTAC CGGCGGGTGT CCAGTCCGCC GCCCCGGGGG TCACGCCGTG GATTACCCCC AACAACGAGT TCTACCGGAT CGACACGGCC CTGAGCGTGC CGGAGATCAA CGCCGAGGAA TGGGAGCTTC GCGTCCACGG CCTCGTGGAG CAGGAAGTCA CCCTGACGTT CCAGGACCTG CTCGACGCCG AACTGATTGA GTCCCACGTG ACACTCACGT GCGTGTCGAA TCCCGTGGGC GGCAACCTGG CGGGCAACGC CAAATGGCTG GGACTGCCCC TCCGTGAGGT CCTCAAGATG GCCCGCCCCA AGGACGGCGC AGACATGGTG CTGTCCACCT CCGAGGATGG CTTCAGCGCG TCGACTCCGC TCGAGGTGCT GCAGGATGAC CGCGACGCCA TGCTGGCGAT CGGCATGAAC GGCGAGCCCC TGCCGCTGGA ACACGGCTAC CCCGTCCGCA TGGTGGTCCC GGGCCTATAC GGTTTTGTCT CCGCCACCAA ATGGGTGGTG GACCTTGAAG TGACCCGCTT CGCTGACAGC AAGGCCTACT GGACGGACCG CGGCTGGTCC GAGCGCGGTC CCATCAAGAC CATGGCCCGG GTGGAGGTGC CCAAGTCCTT CGCGCAGGTC CCGGTCGGCA AAGTGGCCAT CGGCGGCACC GCCTGGGCGC AGACGCGCGG CATTACCAAG GTGGAGGTGC AGATCGACAA CGGCCCCTGG ACCGAGGCGG TGCTGTCCAC CGAAGCATCC GTGGTGACGT GGCGCCAGTG GTCGTTCGAA TGGGACGCCA CCCCCGGCCC GCATTACATC AAGGCCCGGG CCACGGACGG TACCGGCGAG GTCCAGACGG ACAAGCGCGC CGATCCCGTG CCCGACGGCG CTTCCGGCTG GCAGTCGGTT ATGGTGACCG TGCAATAG
|
Protein sequence | MKKLTNWLKG PTALAALAGV AAAAVVLSVA ELIGAFFTAR ATPLIALGST FIDFTPPWLK DFAIATFGTN DKAALFAGMG LTIFLLACVL GVVAYRKWSL GVAGVLLMGA VIVASVVTRA SVEPLDAIPS LIGTAAGLVV LRLLITRLWR MRSWPGVAAD VAAKDTERPA TTRRAFFAAS GITAVAAAIA ATGGRLLSAA RSNIAQARES LQLPSPAKAA PAVPAGVQSA APGVTPWITP NNEFYRIDTA LSVPEINAEE WELRVHGLVE QEVTLTFQDL LDAELIESHV TLTCVSNPVG GNLAGNAKWL GLPLREVLKM ARPKDGADMV LSTSEDGFSA STPLEVLQDD RDAMLAIGMN GEPLPLEHGY PVRMVVPGLY GFVSATKWVV DLEVTRFADS KAYWTDRGWS ERGPIKTMAR VEVPKSFAQV PVGKVAIGGT AWAQTRGITK VEVQIDNGPW TEAVLSTEAS VVTWRQWSFE WDATPGPHYI KARATDGTGE VQTDKRADPV PDGASGWQSV MVTVQ
|
| |