Gene GSU2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2098 
SymbolcooS 
ID2687851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2305590 
End bp2307512 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content67% 
IMG OID637126789 
Productcarbon monoxide dehydrogenase subunit 
Protein accessionNP_953147 
Protein GI39997196 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCAGG CACGCAACGG TCATGACAGC CGCAGTATCG ATCCGGCCGC GAAGGAGATG 
CTGCGTATCG CCGACCGCGA AGGATACGCA ACCATCTGGG AGCGTTACGA ACAGCAGCAG
CCCCAATGCA GCTACGGCCA GCTCGGCACC TGCTGCCGGA TCTGCTCCAT GGGACCGTGC
CGGATCGACC CCTTCGGCGA CGGCCCCACC CGCGGGGTCT GCGGGGCCAC CGCCGATACG
ATGGTGGCGC GCAACTTGGC CCGGATGGCC GCGGTTGGTT CCTCCTCCCA CTCCGATCAC
GGCCGGAAGG TGGCGCTGCT CCTCAAGGCG GTGGCCAACG GCAGCAACAC CGACTACCAC
ATTGCCGACC CGGACAAGCT CACGGCGGTT GCGGAGCGCC TGGGCATCCC CACGGCCGGC
CGGTCGACCG CCGAGATAGC CGGCGACGTG GCAGCGGTGG CCATCGACTG TTTCGGCAAC
CAGGGGGAAG AGCCCATCGT CTTCATGGAG AAGTACATGC CCAAGAAGCG GTTCCAGCGC
CTGCGGGAGC TGGAGGAGAC CCTCTACCGC ACCACGGGCG CGAAGACCGG GCTCCTGCCG
CGGGCCATCG ACCGGGAGGC CGTTGACATC CTGCACCGCA CCCACTTCGG CTGCGACCAC
GATCCCCTTT CCCTGGTGGC CCAATCTGTC CGCTGCTCCC TTTCCGACGG CTGGGGGGGC
TCCCTGATCG CCACCGAGCT GCAGGACATC CTGCTCGGCT CGCCCATCAT CAGGCCGGTG
AAGGCCAATC TCGGCGTGCT GGAGGCGGAG AGCGTCAACG TGGTGGTCCA CGGCCACGAG
CCCATCCTGT CGGCCAAGGT GGTGGAGATG GCCCAGTCCC CCGAATGTCG CGCCGCGGCC
GAGGCCGTGG GGGCCAAGCG GGTCAACGTG GTGGGGCTCT GCTGCACCGG CAACGAGGTG
CTGCTGCGCC AGGGGGTCGG CATGGCGGGG AACGAGTCCC ACAGCGAGCT GGCCATCATG
ACCGGTGCCG TCGACGCCAT GGTGGTGGAC GTGCAGTGCA TCTACCCGGC CCTGGCCGAT
CTCGCCTCCT GCTTCCACAC CAAGTTCGTC ACCACGAGCG AACAGGCCAA GATCCCCGGC
GCGCTCCACA TCCAGTTCGA AGAGCACGAG GCCGACGCCA TCGCCACCCG CATCATCAAG
ACCGCCATCG ACGCCTTCCC GAACCGCAAC AAGGCCCGTG TCTACATCCC GCAGCACACC
AGCACCGCCA TTGTCGGCTT CACCGTGGAG GAGATCCTCA AGGCGCTCGG CGGAACGCCC
CAGCCCCTGA TCGACCTGAT TGTCACGGGG ACCATCAAGG GGGTCGCCGG CATCGTCGGC
TGCAACAACG TGAAGGTGCA GCAGGATTTC TTCCACCGCA CCCTGACCGA GGAGCTGATC
AAGCGCGACA TCCTCGTGAT CGGCACCGGC TGCTGGGCCA TTGCCGCGGC AAAGTCGGGG
CTCATGGACC TGCCCGCCCG CGAGCTGGCC GGACCGGGGC TCCAGGCGGT ATGCGGCCAG
CTGGGGATTC CACCGGTCCT CCACATGGGC TCGTGCGTCG ACTGCTCGCG GATGCTCAAC
CTGGCCGGGG CCCTGGCGGA TCACCTGCAG GTGGACATTT CCGATCTGCC CCTGGTCGGG
TCCGCGCCCG AATGGACCAC GGAGAAGGCG GTCGCCATCG GCACCTATTT CGTCGGCTCG
GGCATTCCCG TGCACCTGTG GCCGCTGCCG CCCATCCTGG GCGGACCGCA GGTAACGAAG
ATCCTCACCA GCGACGCCAA GGATGTCCTG GGCGGGTGGT TCTTCGTGGA GGAAGACCCG
AAGGCCACGG CCGACCGGAT GGAGCAGATC ATCATGGAGC GGCGCGCCGC CCTCGGGATC
TGA
 
Protein sequence
MDQARNGHDS RSIDPAAKEM LRIADREGYA TIWERYEQQQ PQCSYGQLGT CCRICSMGPC 
RIDPFGDGPT RGVCGATADT MVARNLARMA AVGSSSHSDH GRKVALLLKA VANGSNTDYH
IADPDKLTAV AERLGIPTAG RSTAEIAGDV AAVAIDCFGN QGEEPIVFME KYMPKKRFQR
LRELEETLYR TTGAKTGLLP RAIDREAVDI LHRTHFGCDH DPLSLVAQSV RCSLSDGWGG
SLIATELQDI LLGSPIIRPV KANLGVLEAE SVNVVVHGHE PILSAKVVEM AQSPECRAAA
EAVGAKRVNV VGLCCTGNEV LLRQGVGMAG NESHSELAIM TGAVDAMVVD VQCIYPALAD
LASCFHTKFV TTSEQAKIPG ALHIQFEEHE ADAIATRIIK TAIDAFPNRN KARVYIPQHT
STAIVGFTVE EILKALGGTP QPLIDLIVTG TIKGVAGIVG CNNVKVQQDF FHRTLTEELI
KRDILVIGTG CWAIAAAKSG LMDLPARELA GPGLQAVCGQ LGIPPVLHMG SCVDCSRMLN
LAGALADHLQ VDISDLPLVG SAPEWTTEKA VAIGTYFVGS GIPVHLWPLP PILGGPQVTK
ILTSDAKDVL GGWFFVEEDP KATADRMEQI IMERRAALGI