Gene Arth_3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3420 
Symbol 
ID4444150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3847557 
End bp3850409 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content69% 
IMG OID639691244 
Productxanthine dehydrogenase, molybdenum binding subunit apoprotein 
Protein accessionYP_832895 
Protein GI116671962 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
[COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAACG ACACCACCCG CGCCATTGAA ATCAACGGCG TCCAGGCCGA AGCCGCGCCG 
CGCCCCGGCC AGTGCCTGCG CACCTTCCTG CGCGAACAGG GCAACTTCGG CGTCAAGAAG
GGCTGCGACG GTGGCGATTG CGGTGCCTGC ACCGTGCACG TGGACGGCAC TCCCGTGCAC
AGCTGCATCT ACCCGGCCGT TCGGGCCGAA GGGCACTCCG TCACCACGGT CGAGGGCCTG
GCCGGCACCT GTGGAACTGC CGGCGCCCTC CACCCGATGC AGCAGCAGTT CCTGGACCGC
CAGGGATTCC AGTGCGGATT CTGCACCGCC GGCATGGTGA TGACGGCGGC CACGTTCGAC
GAGGAGCAGA AGGAAAACCT GCCCCGGAAC CTCAAGGGAA ACCTGTGTCG CTGCACCGGC
TACCGGGCCA TCGAGGATGC CGTGTGCGGC CAGGAAGGAC ACCCGGACCC CAGGGGCCCG
GGTTCGGGCA TCGCCGGCGA AGGCCAGCCC GAGCCCAAGC CGGGCCAGCT CGGAGACGAC
GTTCCCGCCC CTGCCAGCCT TGCGGTGGTC ACCGGGAAAG CCCGCTATAC CCTGGACGTT
CCGGCAGAGG AGCTGCAAGG GCTACTGCAC CTGAAGCTCC TGCGGTCACC CCACGCCCAT
GCGCGGGTCG TCTCCATTGA CTCGGGCGCG GCGCTGCAGG TTCCAGGCGT CGTGGCCGTC
CTCACCCACG AGGACGCTCC GGCGCAGCTT TTCTCGACCG CCCAGCACGA GCTTTACACC
GACGATCCGG ACGATACCCG CGTCCTGGAC AACGTCGTGC GGTTCATCGG ACAGAAGGTG
GCTGCCGTCG TCGCCGAATC CGTGGCGGCG GCGGAAGCCG GCGTCCGCGC CCTCACAGTC
GAGTACGAGG TGCTCGACGC CGTCTTCACC CCCGAGGACG CGATGCGCCC AGGTGCCCCC
GCGATCCACG GGGAAAAGGA CGCGGTCACG GCCCGCATCA GCCGGCCCGG GCAGAACGTC
GTGGCAGAGC TGCATTCGGA ACTCGGCAGC GTGGCAGCCG GGTTCGCCGA GGCCGACTTC
ATCCACGAAC AGACATACCG GACCCAGCGG GTGCAGCACA CCGCACTGGA GACGCACGCC
GCGATCGCAT CCGTGGACGG GGAAGGCCGG CTGCAGGTCC GCACCTCCAG CCAGGTCCCG
TTCCTGGTCC GGCGCACCCT GTCCCGGGTC TTCGATATCC CGGAAGAACA GATCCGCGTG
GTGGCCGGCC GCGTGGGCGG CGGCTTCGGC GGCAAGCAGG AGGTGCTCAC GGAGGACGTG
GTGGCACTGG CTGCGATGAA GCTGAAGCGG CCGGTGCAGC TCGAATTCAC CCGGACGGAG
CAGTTCACGG CCGCCACCAC CCGGCACCCG TTCACCATCC ACCTCAAGGC CGGCGCCAGG
AGGGACGGCA GGGTCACCGC GCTGCAGCTG GACGTGCTCA CCAACACGGG CGCCTACGGC
AACCATGCCC CGGGCGTGAT GTTCCACGGC TGCGGCGAAT CACTCGGCGT GTACAAGTGC
GCCAACAAGA AAGTGGACGC CCACGCCGTG TACACCAACA CAGTCCCGTC GGGCGCGTTC
CGGGGCTACG GCCTCAGCCA GATGATCTTC GCCATCGAAT CCGGCATTGA CGAGCTGGCC
ACCGGGATCG GGATGGATCC GCTGGAGTTC CGCCGCCGCA ACATGGTCCT GGAGGGCGAC
GACATGCTCT CCACGCATCC CGAGCCCGAG GAAGACGTGC ACTATGGCAG CTACGGGCTG
GACCAGTGCA CGCGGCTGGT GCAGGGTGCC CTGGCCCGCG GGTTGGAACG GTACCGTGCC
GCCGGACTGG ACGACCTGGG CCCGGACTGG GTCACGGGCG AGGGGACCGC GCTGTCCATG
ATCGACACCG TCCCGCCGCG CGGCCACTTT GCCCACTCAA AGCTCCGGCT CCTTCCGGAC
GGAACGTACG CTGCCGACGT CGGGACCGCC GAATTCGGCA ACGGCACCAC CACCGTCCAT
GCGCAGCTCG CGGCCACCGC GCTGTCCACG ACGGCGGGGC GCATCACGGT GCGGCAGTCG
GACACGGACC TGGTGCAGCA CGATACCGGC GCGTTCGGCT CGGCCGGAAC CGTGGTGGCA
GGCAAGGCCA CGCTGGCCGC GGCCGAAGAA CTCGCCATCC GGATCCGGGC GTTCGCTGCC
GGCATCCGTC AGGTCCAGTC CTCCGGCTGC GTGCTGGAGG ATGACGCTGT GGTGTGCGAA
GGAACGCGGG TGCCGCTGAC GGAAATTTTC GACGCCGCCG AGGCCGCCGG CGTCGAACTC
GCCACCGAGG GACACTGGGG CGGCACGCCG CGCTCGGTGG CATTCAATGT CCAGGGCTTC
CGCGTGGCCG TAAATACGGG CACCGGCGAA CTGAAGATCC TGCAGAGCGT CCAGGCCGCC
GACGCCGGCG TGGTGGTCAA CCCGCGGCAG TGCCGCGGGC AGATCGAAGG CGGGATCGCG
CAGGCCCTCG GCGCTGCGCT GTACGAGGAA GTGAAGGTGG ACGCCGGCGG AAAGGTCACC
ACGGACATCC TGCGGCAGTA CCACATCCCC ACTTTCGCGG ACGTGCCCCG CAGCGAGGTC
TATTTCGCCG AGACCAGCGA CAAACTGGGC CCGCTGGGGG CGAAATCGAT GAGCGAAAGC
CCGTTCAACC CAGTGGCGCC GGCGCTCGCC AATGCCATCC GGAACGCCAC CGGAGTGAGG
TTCGCCGAGC TGCCCATCGC CAGGGACAAG ATCTACCTTG GTCTGAAGGA AGCAGGCGTC
GCCTCGCGAT TTGAGAGCGC TAACGGCCTA TAG
 
Protein sequence
MANDTTRAIE INGVQAEAAP RPGQCLRTFL REQGNFGVKK GCDGGDCGAC TVHVDGTPVH 
SCIYPAVRAE GHSVTTVEGL AGTCGTAGAL HPMQQQFLDR QGFQCGFCTA GMVMTAATFD
EEQKENLPRN LKGNLCRCTG YRAIEDAVCG QEGHPDPRGP GSGIAGEGQP EPKPGQLGDD
VPAPASLAVV TGKARYTLDV PAEELQGLLH LKLLRSPHAH ARVVSIDSGA ALQVPGVVAV
LTHEDAPAQL FSTAQHELYT DDPDDTRVLD NVVRFIGQKV AAVVAESVAA AEAGVRALTV
EYEVLDAVFT PEDAMRPGAP AIHGEKDAVT ARISRPGQNV VAELHSELGS VAAGFAEADF
IHEQTYRTQR VQHTALETHA AIASVDGEGR LQVRTSSQVP FLVRRTLSRV FDIPEEQIRV
VAGRVGGGFG GKQEVLTEDV VALAAMKLKR PVQLEFTRTE QFTAATTRHP FTIHLKAGAR
RDGRVTALQL DVLTNTGAYG NHAPGVMFHG CGESLGVYKC ANKKVDAHAV YTNTVPSGAF
RGYGLSQMIF AIESGIDELA TGIGMDPLEF RRRNMVLEGD DMLSTHPEPE EDVHYGSYGL
DQCTRLVQGA LARGLERYRA AGLDDLGPDW VTGEGTALSM IDTVPPRGHF AHSKLRLLPD
GTYAADVGTA EFGNGTTTVH AQLAATALST TAGRITVRQS DTDLVQHDTG AFGSAGTVVA
GKATLAAAEE LAIRIRAFAA GIRQVQSSGC VLEDDAVVCE GTRVPLTEIF DAAEAAGVEL
ATEGHWGGTP RSVAFNVQGF RVAVNTGTGE LKILQSVQAA DAGVVVNPRQ CRGQIEGGIA
QALGAALYEE VKVDAGGKVT TDILRQYHIP TFADVPRSEV YFAETSDKLG PLGAKSMSES
PFNPVAPALA NAIRNATGVR FAELPIARDK IYLGLKEAGV ASRFESANGL