Gene Cagg_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0471 
Symbol 
ID7266639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp581974 
End bp583611 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content60% 
IMG OID643565334 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_002461848 
Protein GI219847415 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.980581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.166741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAATA GTATCGAGAA TCAGGCAACG ATCACCGGGC GCGATCTCCG CATCAGCCCG 
CTTGGCCGCG TCGAGGGAGA CCTCGACTTG CGGGTCACAA TCCGCGATGG GGTTGTGACC
AGCGCATGGA CTGAGGCCTC GATGTTTCGC GGCTTTGAGA TCATTCTGAA GGGGAAGGAT
CCGCAGGCCG GCTTGATCGT TACGCCGCGT ATTTGCGGCA TCTGCGGCGG CAGCCACTTG
ACCAAAGCGG TCTATGCGCT CGATACGGCA TGGCAAACCG AATTACCGCC CAATGCGACC
CTAATTCGCA ATATTGCGCA AGCTTGCGAG ACGCTGCAAA GCATTCCGCG CTGGTTCTAC
GCTCTGTTTG CGATCGACCT AACCAACAAG AAGTACGCCC ACCTGCCCGA ATACGATGAG
GCGGTGCGCC GGTTCGCCCC CTTTGTCGGC ACGAGCTATG AGCACGGCGT GACTCTCTCG
AATAAGCCGG TCGAGATTTA CGCCATCTTC GGCGGCCAGT GGCCCCACTC CAGCTTCATG
ATCCCCGGCG GCGTGATGTG CGCGCCGACG CTGGCTGATG TCACCCGAGC CATCGCCATC
CTCGAATACT GGAAGGATGA ATGGCTGGAG AAGAAGTGGC TCGGTTGCTC GGTCGATCGC
TGGCTGCAGA ACAAGAGCTG GGCCGATGTG ATGGAGTGGA TGAACGAGAA CGAGCGCCAC
TACAACTCTG ACTGCGGCTT CTTCATCCGT TTTGCGATGG CCGCTGGTCT TGACAAGTAT
GGCGCCGGTT GGAATAACTA CATTGCCACC GGTACCTACT TCCACCCCGA ACTGTACGCC
CGCCCAACCA TCGAGGGGAG AAATGCGGCG CTGATCGCGC GCTCCGGCGT GTATGTCAAC
GGCCAGTTCT ACGATTTCGA TCAGGCCAAC GTGCGCGAAG ACGTGACCCA CTCGTTCTAC
GAGGGGAATC ACGCGCTGCA CCCGTTTGAG GGACGCACTG AGCCAATTGA TCCGGCGATC
GGACATCGAC AAGGGAAGTA CTCGTGGGCG AAAGCGCCAC GCTATCTCAT CCCCGGCGTT
GGCAGTCAGC CGGTCGAGGC CGGCCCACTG GCCCGCCAAG TCATCGCCGG TCGGCCCGGC
GCCGCCAATT GGCAAGACTA CGACCCGCTC TTCCTCGATG CGGTGACGAC GGTCGGGCCG
AGTGTGCTGG TGCGCGTGAT GGCCCGGATG CACGAAGCGC CCAAGTATTA CAAGCTGGTG
CGCAAGTGGC TTGACCAGAT CAATCTGCAC GAGAAGTTTT ATATCAAACC GAAGGAGCTG
CCCGAAGGGC GTGGGTTTGG TTCGACCGAA GCCGCTCGCG GCAGCCTCTC GGACTGGATC
GTGCTTAAGG ATGGTAAGAT CGAGAACTAT CAGGTGGTGA CGCCGACGGC ATGGAACATC
GGGCCGCGCG ATGGCCGCGA TGTCAATGGG CCGATGGAGC AAGCCTTCCT CGGCGCGCCG
ATTGCCGATC CCAACGATCC GGTCGAACTC GGCCATGTGG CGCGGAGCTA CGACTCGTGC
CTCGTCTGTA CAGTGCATGC CTACGACGAA AAGACCGGCA AGGAGTTGGC GCGGTTCCGC
ATTGGTGAAG GCGCGTAG
 
Protein sequence
MVNSIENQAT ITGRDLRISP LGRVEGDLDL RVTIRDGVVT SAWTEASMFR GFEIILKGKD 
PQAGLIVTPR ICGICGGSHL TKAVYALDTA WQTELPPNAT LIRNIAQACE TLQSIPRWFY
ALFAIDLTNK KYAHLPEYDE AVRRFAPFVG TSYEHGVTLS NKPVEIYAIF GGQWPHSSFM
IPGGVMCAPT LADVTRAIAI LEYWKDEWLE KKWLGCSVDR WLQNKSWADV MEWMNENERH
YNSDCGFFIR FAMAAGLDKY GAGWNNYIAT GTYFHPELYA RPTIEGRNAA LIARSGVYVN
GQFYDFDQAN VREDVTHSFY EGNHALHPFE GRTEPIDPAI GHRQGKYSWA KAPRYLIPGV
GSQPVEAGPL ARQVIAGRPG AANWQDYDPL FLDAVTTVGP SVLVRVMARM HEAPKYYKLV
RKWLDQINLH EKFYIKPKEL PEGRGFGSTE AARGSLSDWI VLKDGKIENY QVVTPTAWNI
GPRDGRDVNG PMEQAFLGAP IADPNDPVEL GHVARSYDSC LVCTVHAYDE KTGKELARFR
IGEGA