Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1808 |
Symbol | |
ID | 8419649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 2076118 |
End bp | 2078832 |
Gene Length | 2715 bp |
Protein Length | 904 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645038392 |
Product | aldehyde oxidase and xanthine dehydrogenase molybdopterin binding |
Protein accession | YP_003198670 |
Protein GI | 258405928 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.406388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAAAA AGACTCTGTG GATCAATGGA ATGGAAAAAA AGATTGTCGC TGAAGCCGAC GAAAGTCTGG CCAACGTGCT GCGCAAACAG CTCCACCTGA CCGGCACTAA GGTCGGCTGC GACCAGGGCC AGTGCGGCGC CTGCAGTGTC ATCATGGATG GCAAGGTTGT CCGTTCCTGC ATCACCAAAA TGCGTCGAGT TCCGGACAAC GCGGCAGTAA CCACAATCGA AGGTATCGGT CAGCCAGGCA ATCTCCATGC CCTGCAATTG GCATGGATGG TCCACGGCGG CGCCCAATGT GGTTTCTGCA CACCCGGCTT TATCGTCGCC GCCAAGGGAT TGTTGGACAC CAATCCCAAT CCGAGCCGCG AGGACGTCCG CGACTGGTTT CAGAAGCACC GCAATGCCTG CCGTTGCACC GGGTATAAAC AGCTCGTTGA TGCGGTCATG GACGCGGCCA AGGTCGTTCG TGGCGATATG AGCATGCAAG ACTTGGCCTT CAAACTCCCA GAAGACGGAC ACGTCTGGGG CGGCTCCATG CCCCGGCCCA GCGCCGAGGC CAAAGTCACC GGCACCTGGG ATTTCGGGCG CGATTTGGGA CTGTTCATGC CGGAAAACAC CTTGCAACTC GCCCTGGTCC AGGCCGAGGT CTCCCACGCG AACATCAAAT CCATCGACAC GTCAGAGGCC GAAAAAATGC CCGGTGTGCA TGCGGTCCTG ACCCACAAGG ACGTCAAGGG CAAAAACCGC ATCACCGGCC TGATCACCTT CCCCACAAAT AAGGGCGACG GCTGGGACCG TCCCATCCTC TGCGACACCA AGATCCACCA ATACGGCGAC GCCATGGCCA TTGTCTGCGC CGACACCGAA GCCAACGCCA GGGCGGCGGC CAAGAAAGTC AAAGTCGATC TTGAAGAGCT GCCGGCCTAT ATGAGCGCAC CGGAAGCCAT GGCCGAAGAC GCTATTGAAA TCCACCCCGG CACGCCCAAT GTCTATTTCG AACAAAAAAT CGCCAAGGGC GAAGACACCG CATCGGTTTT CGAAAAAGCC GAAGCTGTGG TCGAGGGCGA CTATTATGTC GGCCGTCAGC CCCATATGCC CATCGAGCCG GATGTCGGCT TCGCCTACCT CAATGAGAAC AACAAATTGG TCATCCAATC CAAATCCATC GGACTCAATC TCCATCTCGC CATGATTGCC CCGGGCATGG GCGTGGAGCT CGAAGACGTC ATCATGGTCC AAAACCCCAC CGGCGGCACC TTCGGCTACA AATTCAGCCC GACCATGGAA GCCTTGGTCG GCGTCGCCGC CTTAGCCACC GGCCGTCCGG TCTTTTTGTC CTACGACTAC CACCAACAGC AGACCTACAC CGGGAAACGC TCCCCGTTTA TCACCAATGT CCGTCTGGCG GCGAACAAGG AAGGCAAATT CCTGGGCATG GAGACCGACT GGAGCGTGGA CCACGGGCCG TACTCTGAAT TCGGCGACCT GTTGACCCTG CGCGGCGCCC AGTACATCGG GGCTGGCTAC GACATCGCCA ATATTCGCGG TGAAGGCCGT ACCGTGTGCA CCAACCACGC CTGGGGCTCG GCTTTCCGGG GCTACGGTTC CCCGGAATCG GAATTCCCGT CGGAAGTCCT GATTGACGAA CTGGCTGAGA AACTCGGCAT GGACCCCTTC GAACTGCGCT ACAAAAACGT CTATCGCCCC GGCAGCACGA CCCCGACCGG GCAGGAGCCC GAAGTCTATA GTCTTCCGGA GATGATGGAC AAATTGCGTC CCAAATATGA AGAAGCCTGC AAACGCGCCA AGGCGAACTC CACAAACGAC GTCAAGCGTG GCGTGGGGAT CTCGGTGGGC GTTTACGGCG CCGGCCTGGA CGGCCCGGAC ACCGCTCAAG TCGACCTCGA ACTCAATGAA GACAATTCAG TAACCGCCTA CACTACTTGG CACGATCACG GTCAGGGCGC GGACATGGGC CTTCTGGGGA CCGTGCACGA AGCCTTGCGG CCGCTTGGAT TGTCTGCGGA ACAGATCCAC TTGGTCATGA ACGATACGGA AAAATGCCCC GACGGCGGCC CCGCCGGCGG CAGCCGCTCT CAGGGCGTCA TTGGACGCGC CGCTATCGCT GCGGCAGAGA ACTTGCTCAG CGCTATGCGC AAGGACAATG GATTCATGAC CTACGAAGAG ATGAAGGCTG CCGGCCGCGA AATGCGCTAC AGCGGCTCCT GGAGCGCCCC GGCCGCCAAT TGCGACGAAA ACGGCCAGGG CAACCCCTTT GCCCTCTACA TGTACGCCGT GTTCATGTCC GAAGTCGCAG TGGAAGTGGC TACCGGCAAG ACTGAGGTCG AACGGATGGT CATGGTCGCC GATCCCGGCG TGGTCAACAA CCGCCTCGTC GTCGACGGGC AGAACTACGG CGGCTTGGCC CAGGGCGTCG GCCTGGCCTT GAGTGAAGAC TACGAGGACA TTCAGAAACA CTCGACCTTG ATCGGGGCTG GGTTCCCGTA CATCAAACAG ATTCCTGACG ATATCGAATT GATGTATGTG GAAAGTCCGC GGCCGGAGGG GCCATTTGGC GCCTCTGGGG TCGGGGAACT TCCGCTGACC AGTCCGCACG CCTCTATCAT CAATGCCATC GCCAATGCCT GCGGCGCCCG GGTCCACGAA CTCCCGGCCC GCCCGGAAAA AGTCTTGGCG GCCATGCCCA AATAA
|
Protein sequence | MLKKTLWING MEKKIVAEAD ESLANVLRKQ LHLTGTKVGC DQGQCGACSV IMDGKVVRSC ITKMRRVPDN AAVTTIEGIG QPGNLHALQL AWMVHGGAQC GFCTPGFIVA AKGLLDTNPN PSREDVRDWF QKHRNACRCT GYKQLVDAVM DAAKVVRGDM SMQDLAFKLP EDGHVWGGSM PRPSAEAKVT GTWDFGRDLG LFMPENTLQL ALVQAEVSHA NIKSIDTSEA EKMPGVHAVL THKDVKGKNR ITGLITFPTN KGDGWDRPIL CDTKIHQYGD AMAIVCADTE ANARAAAKKV KVDLEELPAY MSAPEAMAED AIEIHPGTPN VYFEQKIAKG EDTASVFEKA EAVVEGDYYV GRQPHMPIEP DVGFAYLNEN NKLVIQSKSI GLNLHLAMIA PGMGVELEDV IMVQNPTGGT FGYKFSPTME ALVGVAALAT GRPVFLSYDY HQQQTYTGKR SPFITNVRLA ANKEGKFLGM ETDWSVDHGP YSEFGDLLTL RGAQYIGAGY DIANIRGEGR TVCTNHAWGS AFRGYGSPES EFPSEVLIDE LAEKLGMDPF ELRYKNVYRP GSTTPTGQEP EVYSLPEMMD KLRPKYEEAC KRAKANSTND VKRGVGISVG VYGAGLDGPD TAQVDLELNE DNSVTAYTTW HDHGQGADMG LLGTVHEALR PLGLSAEQIH LVMNDTEKCP DGGPAGGSRS QGVIGRAAIA AAENLLSAMR KDNGFMTYEE MKAAGREMRY SGSWSAPAAN CDENGQGNPF ALYMYAVFMS EVAVEVATGK TEVERMVMVA DPGVVNNRLV VDGQNYGGLA QGVGLALSED YEDIQKHSTL IGAGFPYIKQ IPDDIELMYV ESPRPEGPFG ASGVGELPLT SPHASIINAI ANACGARVHE LPARPEKVLA AMPK
|
| |