Gene Csal_0997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0997 
Symbol 
ID4026220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1121376 
End bp1124237 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content66% 
IMG OID637966174 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_573053 
Protein GI92113125 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC AATGCTCGAA CACAAGCTTC ACCCTGACCC TGGACGGGGT GGCAGTAAGC 
GCCACTGCCG GGGAGACGCT CTGGCAGGTG GCCAAGCGTG CCGGCGAGAG CATTCCTCAC
CTGTGCTTCA AGGATGCCAG TGGCTACCGC GCCGACGGCA ATTGCCGCGC CTGCATGGTG
GAGATCGAGG GCGAGCGTGC CCTGGCCGCC AGTTGCCTGC GCGAGGCCGC CCCCGGCATG
GTGGTCAAGA GCGCCAGTTC AGAGCGGGCC CGCACGGCGC GGGAAGGCGT CATGGAGCTG
CTGCTGGTCG ACCAGCCGGT GCGTGACGAC AGTCCCGACC GTTCCAGCCA CTTCTGGGCC
ATGGCCGATC AACTGGCCAT CGATGCCACG GGGGTGCGCC GACGGCTACC GGCGCGGGCC
GAGAGAGAGG CGACCACCGT GCACCATGTG CACCTTCCCG AGCATTCGCG GCGCGCCTCG
TCGACCCGTG ACGCCAGCCA TTCGGCCATG AACGTCAACC TCGACGCCTG CATCGAGTGC
AATCTCTGCG TGCGTGCCTG TCGCGAGGTC CAGGGCAATG ACGTCATCGG CATGGCGCAT
CGCGGCGCGG CTTCCAAGAT CGTCTTCGAT TTCGACGACC CCATGGGCGA TAGCACCTGC
GTGGCCTGCG GCGAGTGCGT TCAGGCCTGC CCGACCGGGG CACTGATGCC GGCGACCCTG
GTCGACGAAG CGGGGCGCGG CGATTCCCGG GTGGCCGACC GCAGCGTCGA TTCCGTCTGC
CCCTACTGCG GCGTGGGCTG CCAGCTCACC TACCATGTCA AGGACGACGC GATTCTCTTC
GTCGAAGGGC GCGAGGGCCC TTCCAACCAG AACCGGTTAT GCGTCAAGGG CCGCTTCGGC
TTCGATTATC CGCGGCACCC GTCACGCTTG ACCACGCCAC TGATCCGCAA ACCGGGGGTG
CTCAAGGGGC TCGATCCCGA TTTCGATCCG GCTCAGCCAC TGACCCATTT CGTCGAAGCG
AGCTGGGAAG AAGCGCTGGA GCTTGCCGCC GGCGGACTCG CCGACCTCAA GGCAGCGCAC
GGTCCCGATG CCCTGGCTGG CTTCGGCAGC GCCAAATGCT CCAACGAAGA AGCCTGGCTG
TTTCAGAAGC TGGTACGCAC CGGTTTCGGC TCCAACAATG TCGACCATTG CACCCGGCTG
TGCCATGCCA GTTCGGTGGC GGCACTGATG GAATGCATCG GCTCGGGCGC GGTGACCGCC
TCCTTCATGC AGGCGCTTGA GGCCGATGTC GTGATCCTCA CCGGCTGCAA CCCGACGATC
AACCATCCCG TGGCGGCCAC CTACTTCAAG CAGGCGGCCA GAAACGGCAC CAAACTGATC
GTCCTCGACC CACGCGGCCA GGCACTGGAT GCCTACGCGC ATCGGAGCGT GCGTTTCACG
CCCGGCGGCG ATGTCTCGCT CTACAACGCC ATGCTCAACG TCATCATCAG CGAGGCGCTC
TATGACCAGG CCTATATCGA TGCCCACACC GAGGGCTTCG AGGCACTCAA GGCCTATGTG
CGTGACATGA CGCCCGAAGC GATGTCGCCC GCCTGCGGCG TCGAGCCCGA GCTGATTCGC
GAGCTTGCAC GGCTGTATGC CCAGGCCGAG CGGGCGATGA TCTTCTGGGG CATGGGCATT
TCCCAGCACG TGCATGGCAC CGACAACGCC CGCTGCCTGA TCTCGCTGGC GCTGGCCTGT
GGCCAGACCG GCCGTCCCGG CACCGGCCTG CATCCGCTGC GTGGCCAGAA CAACGTCCAG
GGCGCCTCCG ATGCTGGCCT GATTCCCATG GTGCTGCCCG ACTATCAACC GGTGGGTGAC
GCCCAGATGC GCAGTGCCTT CGAGGAACTG TGGAACTCGC CGCTGGATGC GACGCCTGGG
CTCACGGTGG TGGAGATCAT GAATGCCATC GCTGCCGGTA CCATCAAGGG GATGTACATC
CTCGGCGAGA ATCCCGCGAT GTCCGACCCG GACCTCGATC ACGCCCGAGA AGCACTGGCC
AAGCTCGAGC ATCTGGTAGT GCAGGACCTG TTCGTCACCG AGACCGCGCA GTTCGCCGAT
GTGATCCTGC CGGCCTCGAG CTGGCCAGAA AAGGACGGCA GCGTGACCAA CACCAACCGT
CAGGTGCAGC TCGGTCGCGC TGCGGTGCCG CTACCCGGTG AGGCGAAGCC CGACTGGTGG
ATCATCCAGC AGCTCGCCAA TCGCCTGGGC CTGGACTGGC ATTACACGCA CCCGCGCGAG
GTGTTCGACG AGATGAAGCA GGGCATGGCG TCGCTCGATC ACATTTCGTG GGCCCGTCTG
GAACGCGAGA GCTCGGTGAC CTATCCCTGC CCCGCCGAGG ATGCTCCCGG CGCTGATGTG
GTGTTCTCCG ACGCCTTCCC CACCGCGAGC GGGCGGGCGA CGTTCACGCC GACCCGGCCA
CTGCCGCCGG ATGAGCCCAT CGATGACGAC TATCCCACGG TGCTGATCAC CGGGCGCCAG
CTGGAACACT GGCACACCGG CTCCATGACC CGGCGCACCC GGGTACTCGA CGACCTCGAG
CCCGAGGCGG TTGCCAGCCT CGCGCCGTCG GAATTGGGGC GCCTGGGACT GTCGCCGGGC
GAGGCGGTCA CCATCGCCAC TCGGCGTGGC AGCATCACCC TGAAGACGCG CGCCGATCCC
TTGATGCAGC CAGGCATGGT GTTCGTGCCG TTCTGCTATC TGGAGGCGGC GGCCAATATC
CTGACCAATC CGGCGCTCGA TCCGTTCGGC AAGATTCCCG AGTTCAAGTA CGCCGCCAGT
CGGTTGCATC GCGCCGAGGT GGCGATGGCG CTCGACGGCT GA
 
Protein sequence
MSKQCSNTSF TLTLDGVAVS ATAGETLWQV AKRAGESIPH LCFKDASGYR ADGNCRACMV 
EIEGERALAA SCLREAAPGM VVKSASSERA RTAREGVMEL LLVDQPVRDD SPDRSSHFWA
MADQLAIDAT GVRRRLPARA EREATTVHHV HLPEHSRRAS STRDASHSAM NVNLDACIEC
NLCVRACREV QGNDVIGMAH RGAASKIVFD FDDPMGDSTC VACGECVQAC PTGALMPATL
VDEAGRGDSR VADRSVDSVC PYCGVGCQLT YHVKDDAILF VEGREGPSNQ NRLCVKGRFG
FDYPRHPSRL TTPLIRKPGV LKGLDPDFDP AQPLTHFVEA SWEEALELAA GGLADLKAAH
GPDALAGFGS AKCSNEEAWL FQKLVRTGFG SNNVDHCTRL CHASSVAALM ECIGSGAVTA
SFMQALEADV VILTGCNPTI NHPVAATYFK QAARNGTKLI VLDPRGQALD AYAHRSVRFT
PGGDVSLYNA MLNVIISEAL YDQAYIDAHT EGFEALKAYV RDMTPEAMSP ACGVEPELIR
ELARLYAQAE RAMIFWGMGI SQHVHGTDNA RCLISLALAC GQTGRPGTGL HPLRGQNNVQ
GASDAGLIPM VLPDYQPVGD AQMRSAFEEL WNSPLDATPG LTVVEIMNAI AAGTIKGMYI
LGENPAMSDP DLDHAREALA KLEHLVVQDL FVTETAQFAD VILPASSWPE KDGSVTNTNR
QVQLGRAAVP LPGEAKPDWW IIQQLANRLG LDWHYTHPRE VFDEMKQGMA SLDHISWARL
ERESSVTYPC PAEDAPGADV VFSDAFPTAS GRATFTPTRP LPPDEPIDDD YPTVLITGRQ
LEHWHTGSMT RRTRVLDDLE PEAVASLAPS ELGRLGLSPG EAVTIATRRG SITLKTRADP
LMQPGMVFVP FCYLEAAANI LTNPALDPFG KIPEFKYAAS RLHRAEVAMA LDG