Gene Arth_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2039 
Symbol 
ID4445448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2298190 
End bp2300583 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content65% 
IMG OID639689847 
Productcarbon monoxide dehydrogenase, large subunit apoprotein 
Protein accessionYP_831519 
Protein GI116670586 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0349521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTTA CCCACCATCG GCCCGGCAAT CCGGCGGCCG GTGACGCCGA CCGCCCGATC 
GGGTACGGGC GCATCCAGCG CAAGGAAGAC CCCCGGTTCG TCCGTGGGAT GGGCCACTAC
GTCGACGACA TTGTGCTGCC GGGAATGCTG CACGGCGCCA TCCTGCGCGC CCCGGTCGCC
CATGCGCGGC TGGTCTCGAT CGACACCACC AACGCGTTGG CCCACCCGAA AGTACTCGCC
GTGATCACCG GCAAGGACCT GCTGGCGCTC AATCTGGCCT GGGCGCCCAC GCTTTCAGCG
GATGTGCAGG CCGTGCTCGT GACCGACAAA GTGCGCTTCC AGGGGCAGGA GGTTGCTTTC
GTCGTCGCGG AGAACCGCTA CGCCGCCCGC GACGCCTTGG AACTGATCGA CGTCGAATAT
GACCTGCTCC CTCCGGTGAT CGATGCGCGC AAAGCCCTGG ACCCGGACGC CCCCCTGATC
CGGGATGATC TCGAAGGCCG GACGGACAAC CGGATTTTCG ACTGGGAGAT GGGCGATGAG
GCGGAAACCG AGGCGGTATT CGCCGCCGCC GACGTCGTAG TGGCACAGGA GGTCGTCTAC
CCTCGAGTCC ATCCGGCACC CATGGAGACC TGCGGCGCCG TCGCGGACTT CGATGCCGTC
TCCGGAAAGC TGACGCTCTA TGAGACGACC CAGGCCCCGC ACGCGCACCG CACCCTGTTC
GCGATTGTGG CCGGCATCCC CGAACACAAG ATCCGTATAG TCTCTCCTGA TATCGGCGGC
GGGTTCGGTA ACAAGGTGGG CATCTATCCC GGCTACATCC TCGCCGTCGT CGGATCGATC
GTCACCGGCA AGCCGGTGAA GTGGGTGGAG GACCGCTCGG AAAACCTGAT GTCGACGTCG
TTCGCCCGGG ACTACATCAT GCAGGGCGAG ATCGCCGCCA CCAAAGACGG CAAGATCCTC
GCTCTCAGGA CCAGCGTGCT GGCCGATCAC GGCGCGTTCA ACGCCACCGC GCAGCCCACC
AAGAACCCCG CCGGCTTCTT CTCGATCTTC ACTGGCAGCT ACGACCTGAA GGCCGCGTTC
TGCAAGGTCA GAGGCGTCTA CACCAACAAG GCTCCGGGCG GCGTTGCCTA CGCGTGCTCG
TTCCGGGTGA CGGAAGCCGT CTACCTGGTG GAGCGGATGG TGGACATCCT GGCCCGAAAG
CTGGACATGG ACCCGGCGGA ACTCCGGCTG AAGAACTTCA TCAAGCCCGA ACAGTTCCCC
TACGCGAACA AAACCGGCTG GGTCTATGAC TCAGGCAACT ATGAAGAGGC CATGCGCCTG
TCGATGAAGA TGGCCGGCTA CGAGGCGCTC CGGCGCGAGC AGGTAGAAAA ACGCGAACGT
GGCGAACTCA TGGGCATCGG CGTCGCCTTC TTCACTGAGG TCGTGGGAGC CGGCCCCCGC
AAGCACTTCG ACATCGTGGG TCTGGGCATG GCCGACGGCG CCGAGTTGCG CGTCCACCCC
ACCGGCAAGG CCGTCGTGCG GCTTTCCGTC CAGAGCCAGG GGCAGGGCCA CGAGACCACG
TTCGCGCAGA TCGTCGCGGA AGAGCTCGGC ATTCCGCCGG AGAACATCGA CGTCGTCCAC
GGCGACACGG ACCAGACGCC CTTCGGCCTG GGCACGTACG GGAGCCGGTC GACACCGGTC
AGCGGCGGGG CGGTGGCACT CGTTGCGCGG AAGGTCCGCG AAAAGGCGAA GCTTATCGCC
GCAGCCATGC TCGAAACCCG GCCCGAAGAC CTCGAGTGGG AGAAGGGCCG CTGGTTCGTC
AAGGGCGATC CCGGCGCCGG GAAGACCATC GAGGAAATCG CCATGGCCGC CCACGGCACA
ATGACGCTCC CCGAGGGAAT CGACGGCAAC CTCGACGCAG AGGTCACCTA CGACCCGCCG
AACCTGACGT TCCCCTTCGG TGCCTACATC TGCGTAGTGG ACATCGATCC GGGTACAGGC
CACGTCAAGG TGCGGCGTTT CATCGCGGTG GATGACTGCG GGACCCGGAT CAACCCGATG
ATTATCGAAG GCCAGGTGCA CGGCGGCCTG ACCGACGGCG TCGGCATGGC CCTCATGGAA
ATCATTGAGT TCGATGAGGC GGGCAACTGC CTGGGCGGCT CCTTTATGGA CTACCTGATC
CCCACGGCGA TGGAGGTACC GGACTGGGAG ACCGGATTTA CAGTGACGCC GTCACCGCAC
CACCCCATCG GCGCCAAGGG CATCGGAGAG TCCGCCACAG TCGGCTCGCC CCCGGCCATC
GTGAACGCGA TCGTCGACGC CCTGGCACCT TACGGGGTCG TCCACATGGA CATGCCGTGC
ACGCCCGCCC GGGTATGGGA GGCCATGCAG GGCCGGCCAA GGCCACCGAT CTGA
 
Protein sequence
MTVTHHRPGN PAAGDADRPI GYGRIQRKED PRFVRGMGHY VDDIVLPGML HGAILRAPVA 
HARLVSIDTT NALAHPKVLA VITGKDLLAL NLAWAPTLSA DVQAVLVTDK VRFQGQEVAF
VVAENRYAAR DALELIDVEY DLLPPVIDAR KALDPDAPLI RDDLEGRTDN RIFDWEMGDE
AETEAVFAAA DVVVAQEVVY PRVHPAPMET CGAVADFDAV SGKLTLYETT QAPHAHRTLF
AIVAGIPEHK IRIVSPDIGG GFGNKVGIYP GYILAVVGSI VTGKPVKWVE DRSENLMSTS
FARDYIMQGE IAATKDGKIL ALRTSVLADH GAFNATAQPT KNPAGFFSIF TGSYDLKAAF
CKVRGVYTNK APGGVAYACS FRVTEAVYLV ERMVDILARK LDMDPAELRL KNFIKPEQFP
YANKTGWVYD SGNYEEAMRL SMKMAGYEAL RREQVEKRER GELMGIGVAF FTEVVGAGPR
KHFDIVGLGM ADGAELRVHP TGKAVVRLSV QSQGQGHETT FAQIVAEELG IPPENIDVVH
GDTDQTPFGL GTYGSRSTPV SGGAVALVAR KVREKAKLIA AAMLETRPED LEWEKGRWFV
KGDPGAGKTI EEIAMAAHGT MTLPEGIDGN LDAEVTYDPP NLTFPFGAYI CVVDIDPGTG
HVKVRRFIAV DDCGTRINPM IIEGQVHGGL TDGVGMALME IIEFDEAGNC LGGSFMDYLI
PTAMEVPDWE TGFTVTPSPH HPIGAKGIGE SATVGSPPAI VNAIVDALAP YGVVHMDMPC
TPARVWEAMQ GRPRPPI