Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0997 |
Symbol | |
ID | 4026220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1121376 |
End bp | 1124237 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637966174 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_573053 |
Protein GI | 92113125 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGC AATGCTCGAA CACAAGCTTC ACCCTGACCC TGGACGGGGT GGCAGTAAGC GCCACTGCCG GGGAGACGCT CTGGCAGGTG GCCAAGCGTG CCGGCGAGAG CATTCCTCAC CTGTGCTTCA AGGATGCCAG TGGCTACCGC GCCGACGGCA ATTGCCGCGC CTGCATGGTG GAGATCGAGG GCGAGCGTGC CCTGGCCGCC AGTTGCCTGC GCGAGGCCGC CCCCGGCATG GTGGTCAAGA GCGCCAGTTC AGAGCGGGCC CGCACGGCGC GGGAAGGCGT CATGGAGCTG CTGCTGGTCG ACCAGCCGGT GCGTGACGAC AGTCCCGACC GTTCCAGCCA CTTCTGGGCC ATGGCCGATC AACTGGCCAT CGATGCCACG GGGGTGCGCC GACGGCTACC GGCGCGGGCC GAGAGAGAGG CGACCACCGT GCACCATGTG CACCTTCCCG AGCATTCGCG GCGCGCCTCG TCGACCCGTG ACGCCAGCCA TTCGGCCATG AACGTCAACC TCGACGCCTG CATCGAGTGC AATCTCTGCG TGCGTGCCTG TCGCGAGGTC CAGGGCAATG ACGTCATCGG CATGGCGCAT CGCGGCGCGG CTTCCAAGAT CGTCTTCGAT TTCGACGACC CCATGGGCGA TAGCACCTGC GTGGCCTGCG GCGAGTGCGT TCAGGCCTGC CCGACCGGGG CACTGATGCC GGCGACCCTG GTCGACGAAG CGGGGCGCGG CGATTCCCGG GTGGCCGACC GCAGCGTCGA TTCCGTCTGC CCCTACTGCG GCGTGGGCTG CCAGCTCACC TACCATGTCA AGGACGACGC GATTCTCTTC GTCGAAGGGC GCGAGGGCCC TTCCAACCAG AACCGGTTAT GCGTCAAGGG CCGCTTCGGC TTCGATTATC CGCGGCACCC GTCACGCTTG ACCACGCCAC TGATCCGCAA ACCGGGGGTG CTCAAGGGGC TCGATCCCGA TTTCGATCCG GCTCAGCCAC TGACCCATTT CGTCGAAGCG AGCTGGGAAG AAGCGCTGGA GCTTGCCGCC GGCGGACTCG CCGACCTCAA GGCAGCGCAC GGTCCCGATG CCCTGGCTGG CTTCGGCAGC GCCAAATGCT CCAACGAAGA AGCCTGGCTG TTTCAGAAGC TGGTACGCAC CGGTTTCGGC TCCAACAATG TCGACCATTG CACCCGGCTG TGCCATGCCA GTTCGGTGGC GGCACTGATG GAATGCATCG GCTCGGGCGC GGTGACCGCC TCCTTCATGC AGGCGCTTGA GGCCGATGTC GTGATCCTCA CCGGCTGCAA CCCGACGATC AACCATCCCG TGGCGGCCAC CTACTTCAAG CAGGCGGCCA GAAACGGCAC CAAACTGATC GTCCTCGACC CACGCGGCCA GGCACTGGAT GCCTACGCGC ATCGGAGCGT GCGTTTCACG CCCGGCGGCG ATGTCTCGCT CTACAACGCC ATGCTCAACG TCATCATCAG CGAGGCGCTC TATGACCAGG CCTATATCGA TGCCCACACC GAGGGCTTCG AGGCACTCAA GGCCTATGTG CGTGACATGA CGCCCGAAGC GATGTCGCCC GCCTGCGGCG TCGAGCCCGA GCTGATTCGC GAGCTTGCAC GGCTGTATGC CCAGGCCGAG CGGGCGATGA TCTTCTGGGG CATGGGCATT TCCCAGCACG TGCATGGCAC CGACAACGCC CGCTGCCTGA TCTCGCTGGC GCTGGCCTGT GGCCAGACCG GCCGTCCCGG CACCGGCCTG CATCCGCTGC GTGGCCAGAA CAACGTCCAG GGCGCCTCCG ATGCTGGCCT GATTCCCATG GTGCTGCCCG ACTATCAACC GGTGGGTGAC GCCCAGATGC GCAGTGCCTT CGAGGAACTG TGGAACTCGC CGCTGGATGC GACGCCTGGG CTCACGGTGG TGGAGATCAT GAATGCCATC GCTGCCGGTA CCATCAAGGG GATGTACATC CTCGGCGAGA ATCCCGCGAT GTCCGACCCG GACCTCGATC ACGCCCGAGA AGCACTGGCC AAGCTCGAGC ATCTGGTAGT GCAGGACCTG TTCGTCACCG AGACCGCGCA GTTCGCCGAT GTGATCCTGC CGGCCTCGAG CTGGCCAGAA AAGGACGGCA GCGTGACCAA CACCAACCGT CAGGTGCAGC TCGGTCGCGC TGCGGTGCCG CTACCCGGTG AGGCGAAGCC CGACTGGTGG ATCATCCAGC AGCTCGCCAA TCGCCTGGGC CTGGACTGGC ATTACACGCA CCCGCGCGAG GTGTTCGACG AGATGAAGCA GGGCATGGCG TCGCTCGATC ACATTTCGTG GGCCCGTCTG GAACGCGAGA GCTCGGTGAC CTATCCCTGC CCCGCCGAGG ATGCTCCCGG CGCTGATGTG GTGTTCTCCG ACGCCTTCCC CACCGCGAGC GGGCGGGCGA CGTTCACGCC GACCCGGCCA CTGCCGCCGG ATGAGCCCAT CGATGACGAC TATCCCACGG TGCTGATCAC CGGGCGCCAG CTGGAACACT GGCACACCGG CTCCATGACC CGGCGCACCC GGGTACTCGA CGACCTCGAG CCCGAGGCGG TTGCCAGCCT CGCGCCGTCG GAATTGGGGC GCCTGGGACT GTCGCCGGGC GAGGCGGTCA CCATCGCCAC TCGGCGTGGC AGCATCACCC TGAAGACGCG CGCCGATCCC TTGATGCAGC CAGGCATGGT GTTCGTGCCG TTCTGCTATC TGGAGGCGGC GGCCAATATC CTGACCAATC CGGCGCTCGA TCCGTTCGGC AAGATTCCCG AGTTCAAGTA CGCCGCCAGT CGGTTGCATC GCGCCGAGGT GGCGATGGCG CTCGACGGCT GA
|
Protein sequence | MSKQCSNTSF TLTLDGVAVS ATAGETLWQV AKRAGESIPH LCFKDASGYR ADGNCRACMV EIEGERALAA SCLREAAPGM VVKSASSERA RTAREGVMEL LLVDQPVRDD SPDRSSHFWA MADQLAIDAT GVRRRLPARA EREATTVHHV HLPEHSRRAS STRDASHSAM NVNLDACIEC NLCVRACREV QGNDVIGMAH RGAASKIVFD FDDPMGDSTC VACGECVQAC PTGALMPATL VDEAGRGDSR VADRSVDSVC PYCGVGCQLT YHVKDDAILF VEGREGPSNQ NRLCVKGRFG FDYPRHPSRL TTPLIRKPGV LKGLDPDFDP AQPLTHFVEA SWEEALELAA GGLADLKAAH GPDALAGFGS AKCSNEEAWL FQKLVRTGFG SNNVDHCTRL CHASSVAALM ECIGSGAVTA SFMQALEADV VILTGCNPTI NHPVAATYFK QAARNGTKLI VLDPRGQALD AYAHRSVRFT PGGDVSLYNA MLNVIISEAL YDQAYIDAHT EGFEALKAYV RDMTPEAMSP ACGVEPELIR ELARLYAQAE RAMIFWGMGI SQHVHGTDNA RCLISLALAC GQTGRPGTGL HPLRGQNNVQ GASDAGLIPM VLPDYQPVGD AQMRSAFEEL WNSPLDATPG LTVVEIMNAI AAGTIKGMYI LGENPAMSDP DLDHAREALA KLEHLVVQDL FVTETAQFAD VILPASSWPE KDGSVTNTNR QVQLGRAAVP LPGEAKPDWW IIQQLANRLG LDWHYTHPRE VFDEMKQGMA SLDHISWARL ERESSVTYPC PAEDAPGADV VFSDAFPTAS GRATFTPTRP LPPDEPIDDD YPTVLITGRQ LEHWHTGSMT RRTRVLDDLE PEAVASLAPS ELGRLGLSPG EAVTIATRRG SITLKTRADP LMQPGMVFVP FCYLEAAANI LTNPALDPFG KIPEFKYAAS RLHRAEVAMA LDG
|
| |