Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2039 |
Symbol | |
ID | 4445448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2298190 |
End bp | 2300583 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639689847 |
Product | carbon monoxide dehydrogenase, large subunit apoprotein |
Protein accession | YP_831519 |
Protein GI | 116670586 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0349521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTTA CCCACCATCG GCCCGGCAAT CCGGCGGCCG GTGACGCCGA CCGCCCGATC GGGTACGGGC GCATCCAGCG CAAGGAAGAC CCCCGGTTCG TCCGTGGGAT GGGCCACTAC GTCGACGACA TTGTGCTGCC GGGAATGCTG CACGGCGCCA TCCTGCGCGC CCCGGTCGCC CATGCGCGGC TGGTCTCGAT CGACACCACC AACGCGTTGG CCCACCCGAA AGTACTCGCC GTGATCACCG GCAAGGACCT GCTGGCGCTC AATCTGGCCT GGGCGCCCAC GCTTTCAGCG GATGTGCAGG CCGTGCTCGT GACCGACAAA GTGCGCTTCC AGGGGCAGGA GGTTGCTTTC GTCGTCGCGG AGAACCGCTA CGCCGCCCGC GACGCCTTGG AACTGATCGA CGTCGAATAT GACCTGCTCC CTCCGGTGAT CGATGCGCGC AAAGCCCTGG ACCCGGACGC CCCCCTGATC CGGGATGATC TCGAAGGCCG GACGGACAAC CGGATTTTCG ACTGGGAGAT GGGCGATGAG GCGGAAACCG AGGCGGTATT CGCCGCCGCC GACGTCGTAG TGGCACAGGA GGTCGTCTAC CCTCGAGTCC ATCCGGCACC CATGGAGACC TGCGGCGCCG TCGCGGACTT CGATGCCGTC TCCGGAAAGC TGACGCTCTA TGAGACGACC CAGGCCCCGC ACGCGCACCG CACCCTGTTC GCGATTGTGG CCGGCATCCC CGAACACAAG ATCCGTATAG TCTCTCCTGA TATCGGCGGC GGGTTCGGTA ACAAGGTGGG CATCTATCCC GGCTACATCC TCGCCGTCGT CGGATCGATC GTCACCGGCA AGCCGGTGAA GTGGGTGGAG GACCGCTCGG AAAACCTGAT GTCGACGTCG TTCGCCCGGG ACTACATCAT GCAGGGCGAG ATCGCCGCCA CCAAAGACGG CAAGATCCTC GCTCTCAGGA CCAGCGTGCT GGCCGATCAC GGCGCGTTCA ACGCCACCGC GCAGCCCACC AAGAACCCCG CCGGCTTCTT CTCGATCTTC ACTGGCAGCT ACGACCTGAA GGCCGCGTTC TGCAAGGTCA GAGGCGTCTA CACCAACAAG GCTCCGGGCG GCGTTGCCTA CGCGTGCTCG TTCCGGGTGA CGGAAGCCGT CTACCTGGTG GAGCGGATGG TGGACATCCT GGCCCGAAAG CTGGACATGG ACCCGGCGGA ACTCCGGCTG AAGAACTTCA TCAAGCCCGA ACAGTTCCCC TACGCGAACA AAACCGGCTG GGTCTATGAC TCAGGCAACT ATGAAGAGGC CATGCGCCTG TCGATGAAGA TGGCCGGCTA CGAGGCGCTC CGGCGCGAGC AGGTAGAAAA ACGCGAACGT GGCGAACTCA TGGGCATCGG CGTCGCCTTC TTCACTGAGG TCGTGGGAGC CGGCCCCCGC AAGCACTTCG ACATCGTGGG TCTGGGCATG GCCGACGGCG CCGAGTTGCG CGTCCACCCC ACCGGCAAGG CCGTCGTGCG GCTTTCCGTC CAGAGCCAGG GGCAGGGCCA CGAGACCACG TTCGCGCAGA TCGTCGCGGA AGAGCTCGGC ATTCCGCCGG AGAACATCGA CGTCGTCCAC GGCGACACGG ACCAGACGCC CTTCGGCCTG GGCACGTACG GGAGCCGGTC GACACCGGTC AGCGGCGGGG CGGTGGCACT CGTTGCGCGG AAGGTCCGCG AAAAGGCGAA GCTTATCGCC GCAGCCATGC TCGAAACCCG GCCCGAAGAC CTCGAGTGGG AGAAGGGCCG CTGGTTCGTC AAGGGCGATC CCGGCGCCGG GAAGACCATC GAGGAAATCG CCATGGCCGC CCACGGCACA ATGACGCTCC CCGAGGGAAT CGACGGCAAC CTCGACGCAG AGGTCACCTA CGACCCGCCG AACCTGACGT TCCCCTTCGG TGCCTACATC TGCGTAGTGG ACATCGATCC GGGTACAGGC CACGTCAAGG TGCGGCGTTT CATCGCGGTG GATGACTGCG GGACCCGGAT CAACCCGATG ATTATCGAAG GCCAGGTGCA CGGCGGCCTG ACCGACGGCG TCGGCATGGC CCTCATGGAA ATCATTGAGT TCGATGAGGC GGGCAACTGC CTGGGCGGCT CCTTTATGGA CTACCTGATC CCCACGGCGA TGGAGGTACC GGACTGGGAG ACCGGATTTA CAGTGACGCC GTCACCGCAC CACCCCATCG GCGCCAAGGG CATCGGAGAG TCCGCCACAG TCGGCTCGCC CCCGGCCATC GTGAACGCGA TCGTCGACGC CCTGGCACCT TACGGGGTCG TCCACATGGA CATGCCGTGC ACGCCCGCCC GGGTATGGGA GGCCATGCAG GGCCGGCCAA GGCCACCGAT CTGA
|
Protein sequence | MTVTHHRPGN PAAGDADRPI GYGRIQRKED PRFVRGMGHY VDDIVLPGML HGAILRAPVA HARLVSIDTT NALAHPKVLA VITGKDLLAL NLAWAPTLSA DVQAVLVTDK VRFQGQEVAF VVAENRYAAR DALELIDVEY DLLPPVIDAR KALDPDAPLI RDDLEGRTDN RIFDWEMGDE AETEAVFAAA DVVVAQEVVY PRVHPAPMET CGAVADFDAV SGKLTLYETT QAPHAHRTLF AIVAGIPEHK IRIVSPDIGG GFGNKVGIYP GYILAVVGSI VTGKPVKWVE DRSENLMSTS FARDYIMQGE IAATKDGKIL ALRTSVLADH GAFNATAQPT KNPAGFFSIF TGSYDLKAAF CKVRGVYTNK APGGVAYACS FRVTEAVYLV ERMVDILARK LDMDPAELRL KNFIKPEQFP YANKTGWVYD SGNYEEAMRL SMKMAGYEAL RREQVEKRER GELMGIGVAF FTEVVGAGPR KHFDIVGLGM ADGAELRVHP TGKAVVRLSV QSQGQGHETT FAQIVAEELG IPPENIDVVH GDTDQTPFGL GTYGSRSTPV SGGAVALVAR KVREKAKLIA AAMLETRPED LEWEKGRWFV KGDPGAGKTI EEIAMAAHGT MTLPEGIDGN LDAEVTYDPP NLTFPFGAYI CVVDIDPGTG HVKVRRFIAV DDCGTRINPM IIEGQVHGGL TDGVGMALME IIEFDEAGNC LGGSFMDYLI PTAMEVPDWE TGFTVTPSPH HPIGAKGIGE SATVGSPPAI VNAIVDALAP YGVVHMDMPC TPARVWEAMQ GRPRPPI
|
| |