Gene Cagg_2476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2476 
Symbol 
ID7269322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3011377 
End bp3012798 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content54% 
IMG OID643567303 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_002463784 
Protein GI219849351 
COG category[C] Energy production and conversion 
COG ID[COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.795621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.410999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCAAC GGATTGTGAT CGATCCGGTC ACACGGATCG AAGGGCATGC CAAAATCAGC 
ATTCACCTCG ACGACAACGG TAATGTCGCC GAGACACGCT TTCACGTTAC CGAGTTTCGC
GGTTTTGAAC GATTCTGCAT TGGGCGGCCC TTTTGGGAGA TGCCCGGTAT TACTGCGCGG
ATCTGTGGTA TCTGTCCGGT GAGCCACCTC CTTGCCTCGG CCCGTACCGG CGATGCCTTA
CTGGCAGTGC AGATCCCTCC GGCTGCAGAA AAACTCCGCC GCCTGATGAA TCTCGGTCAG
ATAATTCAAT CGCATTCACT CAGTTTCTTT CACTTGAGTG GTCCTGATCT CATGCTCGGC
TTTGACAGTG AGCCGACACA GCGCAACATT TTTGGCTTGA TCGCCGCCGA GCCAGACGTT
GCCCGGAAGG GGATCCGTTT ACGTCAGTTT GGTCAAGAGG TGATTGAGCT GCTCGGTGGA
CGTAAAATCC ACCCGGCGTG GGCAGTACCG GGCGGAGTCC GTAGTGGCCT TACCGAGAGT
AGTCGCGACC GAATTCGGTC TGCCGTACCC GAAATGCTGA CGATTGCCCT CGATGCGATG
GCTCGCCTCA AACGGCTCAC CGAGCAATAC CAACGTGAAG TTACAACCTT CGGTGTCTTC
CCCAGCTTAT ACCTCGGCAT GGTTGGGCCT GGTGGCCGCT GGATGCATTA TGGCGGTAAA
CTGCGAATTA TTGACCATAC CGGCAAAATG TTGATCGACG ATCTCGATCC GATTGACTAC
CGCGACGTGA TTGGCGAAGC GGTTGAGCCA TGGAGCTATC TGAAGTTTCC CTACATTCGT
GCATTCGGTT ATCCACAGGG TATGTACCGA GTTGGACCGC TCGCACGGCT TAACGTATGT
GCGTCAATAG GTACACCGCG AGCCGACGCT GAACTACAGG AACTTCGTGA ACAGGTCGGA
CCGGTGATCG AAGGGAGTTT CTACTACCAT CATGCCCGGC TCATCGAGAT CATCGCTGCC
CTTGAACACA TCGAAATGCT GATCGAAGAT GATGATCTCC TTTCACCTAA CCTTCGCGCC
GACGCTGGGG TCAATCGTTA CGAGGCGGTT GGAGTAAGTG AAGCACCTCG CGGTACGCTC
TTCCATCACT ATCGGGTTGA TGAGAAGGGG TTGATTACTG ACGTTAATCT CATTATTGCC
ACCGGTCAAA ATAATCTGGC GATAAACCGC ACAGTTGGTC AGATCGCCCA AAACTACATT
TGCAACGGTC AGTTTGATGA AGGCATTCTC AACCGGATCG AAGCAGGTGT ACGTGCGTAT
GATCCATGCT TAAGTTGTTC GACGCACGCT ATCGGACAGA TGGCGATGGA AGTTGAGCTA
TACGATTCGC ACGGCACACT GGTGGGTCGG CTCAGTCGTT GA
 
Protein sequence
MGQRIVIDPV TRIEGHAKIS IHLDDNGNVA ETRFHVTEFR GFERFCIGRP FWEMPGITAR 
ICGICPVSHL LASARTGDAL LAVQIPPAAE KLRRLMNLGQ IIQSHSLSFF HLSGPDLMLG
FDSEPTQRNI FGLIAAEPDV ARKGIRLRQF GQEVIELLGG RKIHPAWAVP GGVRSGLTES
SRDRIRSAVP EMLTIALDAM ARLKRLTEQY QREVTTFGVF PSLYLGMVGP GGRWMHYGGK
LRIIDHTGKM LIDDLDPIDY RDVIGEAVEP WSYLKFPYIR AFGYPQGMYR VGPLARLNVC
ASIGTPRADA ELQELREQVG PVIEGSFYYH HARLIEIIAA LEHIEMLIED DDLLSPNLRA
DAGVNRYEAV GVSEAPRGTL FHHYRVDEKG LITDVNLIIA TGQNNLAINR TVGQIAQNYI
CNGQFDEGIL NRIEAGVRAY DPCLSCSTHA IGQMAMEVEL YDSHGTLVGR LSR