Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3420 |
Symbol | |
ID | 4444150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3847557 |
End bp | 3850409 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639691244 |
Product | xanthine dehydrogenase, molybdenum binding subunit apoprotein |
Protein accession | YP_832895 |
Protein GI | 116671962 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAACG ACACCACCCG CGCCATTGAA ATCAACGGCG TCCAGGCCGA AGCCGCGCCG CGCCCCGGCC AGTGCCTGCG CACCTTCCTG CGCGAACAGG GCAACTTCGG CGTCAAGAAG GGCTGCGACG GTGGCGATTG CGGTGCCTGC ACCGTGCACG TGGACGGCAC TCCCGTGCAC AGCTGCATCT ACCCGGCCGT TCGGGCCGAA GGGCACTCCG TCACCACGGT CGAGGGCCTG GCCGGCACCT GTGGAACTGC CGGCGCCCTC CACCCGATGC AGCAGCAGTT CCTGGACCGC CAGGGATTCC AGTGCGGATT CTGCACCGCC GGCATGGTGA TGACGGCGGC CACGTTCGAC GAGGAGCAGA AGGAAAACCT GCCCCGGAAC CTCAAGGGAA ACCTGTGTCG CTGCACCGGC TACCGGGCCA TCGAGGATGC CGTGTGCGGC CAGGAAGGAC ACCCGGACCC CAGGGGCCCG GGTTCGGGCA TCGCCGGCGA AGGCCAGCCC GAGCCCAAGC CGGGCCAGCT CGGAGACGAC GTTCCCGCCC CTGCCAGCCT TGCGGTGGTC ACCGGGAAAG CCCGCTATAC CCTGGACGTT CCGGCAGAGG AGCTGCAAGG GCTACTGCAC CTGAAGCTCC TGCGGTCACC CCACGCCCAT GCGCGGGTCG TCTCCATTGA CTCGGGCGCG GCGCTGCAGG TTCCAGGCGT CGTGGCCGTC CTCACCCACG AGGACGCTCC GGCGCAGCTT TTCTCGACCG CCCAGCACGA GCTTTACACC GACGATCCGG ACGATACCCG CGTCCTGGAC AACGTCGTGC GGTTCATCGG ACAGAAGGTG GCTGCCGTCG TCGCCGAATC CGTGGCGGCG GCGGAAGCCG GCGTCCGCGC CCTCACAGTC GAGTACGAGG TGCTCGACGC CGTCTTCACC CCCGAGGACG CGATGCGCCC AGGTGCCCCC GCGATCCACG GGGAAAAGGA CGCGGTCACG GCCCGCATCA GCCGGCCCGG GCAGAACGTC GTGGCAGAGC TGCATTCGGA ACTCGGCAGC GTGGCAGCCG GGTTCGCCGA GGCCGACTTC ATCCACGAAC AGACATACCG GACCCAGCGG GTGCAGCACA CCGCACTGGA GACGCACGCC GCGATCGCAT CCGTGGACGG GGAAGGCCGG CTGCAGGTCC GCACCTCCAG CCAGGTCCCG TTCCTGGTCC GGCGCACCCT GTCCCGGGTC TTCGATATCC CGGAAGAACA GATCCGCGTG GTGGCCGGCC GCGTGGGCGG CGGCTTCGGC GGCAAGCAGG AGGTGCTCAC GGAGGACGTG GTGGCACTGG CTGCGATGAA GCTGAAGCGG CCGGTGCAGC TCGAATTCAC CCGGACGGAG CAGTTCACGG CCGCCACCAC CCGGCACCCG TTCACCATCC ACCTCAAGGC CGGCGCCAGG AGGGACGGCA GGGTCACCGC GCTGCAGCTG GACGTGCTCA CCAACACGGG CGCCTACGGC AACCATGCCC CGGGCGTGAT GTTCCACGGC TGCGGCGAAT CACTCGGCGT GTACAAGTGC GCCAACAAGA AAGTGGACGC CCACGCCGTG TACACCAACA CAGTCCCGTC GGGCGCGTTC CGGGGCTACG GCCTCAGCCA GATGATCTTC GCCATCGAAT CCGGCATTGA CGAGCTGGCC ACCGGGATCG GGATGGATCC GCTGGAGTTC CGCCGCCGCA ACATGGTCCT GGAGGGCGAC GACATGCTCT CCACGCATCC CGAGCCCGAG GAAGACGTGC ACTATGGCAG CTACGGGCTG GACCAGTGCA CGCGGCTGGT GCAGGGTGCC CTGGCCCGCG GGTTGGAACG GTACCGTGCC GCCGGACTGG ACGACCTGGG CCCGGACTGG GTCACGGGCG AGGGGACCGC GCTGTCCATG ATCGACACCG TCCCGCCGCG CGGCCACTTT GCCCACTCAA AGCTCCGGCT CCTTCCGGAC GGAACGTACG CTGCCGACGT CGGGACCGCC GAATTCGGCA ACGGCACCAC CACCGTCCAT GCGCAGCTCG CGGCCACCGC GCTGTCCACG ACGGCGGGGC GCATCACGGT GCGGCAGTCG GACACGGACC TGGTGCAGCA CGATACCGGC GCGTTCGGCT CGGCCGGAAC CGTGGTGGCA GGCAAGGCCA CGCTGGCCGC GGCCGAAGAA CTCGCCATCC GGATCCGGGC GTTCGCTGCC GGCATCCGTC AGGTCCAGTC CTCCGGCTGC GTGCTGGAGG ATGACGCTGT GGTGTGCGAA GGAACGCGGG TGCCGCTGAC GGAAATTTTC GACGCCGCCG AGGCCGCCGG CGTCGAACTC GCCACCGAGG GACACTGGGG CGGCACGCCG CGCTCGGTGG CATTCAATGT CCAGGGCTTC CGCGTGGCCG TAAATACGGG CACCGGCGAA CTGAAGATCC TGCAGAGCGT CCAGGCCGCC GACGCCGGCG TGGTGGTCAA CCCGCGGCAG TGCCGCGGGC AGATCGAAGG CGGGATCGCG CAGGCCCTCG GCGCTGCGCT GTACGAGGAA GTGAAGGTGG ACGCCGGCGG AAAGGTCACC ACGGACATCC TGCGGCAGTA CCACATCCCC ACTTTCGCGG ACGTGCCCCG CAGCGAGGTC TATTTCGCCG AGACCAGCGA CAAACTGGGC CCGCTGGGGG CGAAATCGAT GAGCGAAAGC CCGTTCAACC CAGTGGCGCC GGCGCTCGCC AATGCCATCC GGAACGCCAC CGGAGTGAGG TTCGCCGAGC TGCCCATCGC CAGGGACAAG ATCTACCTTG GTCTGAAGGA AGCAGGCGTC GCCTCGCGAT TTGAGAGCGC TAACGGCCTA TAG
|
Protein sequence | MANDTTRAIE INGVQAEAAP RPGQCLRTFL REQGNFGVKK GCDGGDCGAC TVHVDGTPVH SCIYPAVRAE GHSVTTVEGL AGTCGTAGAL HPMQQQFLDR QGFQCGFCTA GMVMTAATFD EEQKENLPRN LKGNLCRCTG YRAIEDAVCG QEGHPDPRGP GSGIAGEGQP EPKPGQLGDD VPAPASLAVV TGKARYTLDV PAEELQGLLH LKLLRSPHAH ARVVSIDSGA ALQVPGVVAV LTHEDAPAQL FSTAQHELYT DDPDDTRVLD NVVRFIGQKV AAVVAESVAA AEAGVRALTV EYEVLDAVFT PEDAMRPGAP AIHGEKDAVT ARISRPGQNV VAELHSELGS VAAGFAEADF IHEQTYRTQR VQHTALETHA AIASVDGEGR LQVRTSSQVP FLVRRTLSRV FDIPEEQIRV VAGRVGGGFG GKQEVLTEDV VALAAMKLKR PVQLEFTRTE QFTAATTRHP FTIHLKAGAR RDGRVTALQL DVLTNTGAYG NHAPGVMFHG CGESLGVYKC ANKKVDAHAV YTNTVPSGAF RGYGLSQMIF AIESGIDELA TGIGMDPLEF RRRNMVLEGD DMLSTHPEPE EDVHYGSYGL DQCTRLVQGA LARGLERYRA AGLDDLGPDW VTGEGTALSM IDTVPPRGHF AHSKLRLLPD GTYAADVGTA EFGNGTTTVH AQLAATALST TAGRITVRQS DTDLVQHDTG AFGSAGTVVA GKATLAAAEE LAIRIRAFAA GIRQVQSSGC VLEDDAVVCE GTRVPLTEIF DAAEAAGVEL ATEGHWGGTP RSVAFNVQGF RVAVNTGTGE LKILQSVQAA DAGVVVNPRQ CRGQIEGGIA QALGAALYEE VKVDAGGKVT TDILRQYHIP TFADVPRSEV YFAETSDKLG PLGAKSMSES PFNPVAPALA NAIRNATGVR FAELPIARDK IYLGLKEAGV ASRFESANGL
|
| |