Gene Rcas_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3147 
Symbol 
ID5540645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4078132 
End bp4079844 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content60% 
IMG OID640895268 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_001433219 
Protein GI156743090 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAA TTGCTATTGA TCCGATCACC CGCATTGAAG GACATCTGCG TATCGAGGCG 
CAGATCGAGC GCGGGCGCGT AGTGGATGCC TGGAGCAGTT CGACGATGTT CCGCGGCATG
GAGATCGTCC TGCGCGGACG CGATCCGCGC GACGCCTGGG TGTTTGCGCA GCGCATCTGC
GGCGTCTGCA CGACCGTCCA TGCGCTTGCA TCGGTGCGCG CCGTTGAAAA CGCGCTCGAT
ATTCAGATAC CCGACAATGC CCGGCTTATC CGCAACATCA TCGCAGGCGC CCAATATGTG
CAAGACCATG TCATCCACTT CTACCACCTG CACGCCTTAG ACTGGGTAGA TATCGTGAGT
GCGCTCAAGG CCGATCCGGT CAAAACATCG GAACTGGCGC AGAGCATTTC CGACTGGCCC
AAATCGTCGC CCGCCTACTT CAAGGGTGTC CAGGACCGAT TGCAAAAGTT CGTTGACAGC
GGGCAATTGG GGATTTTTGG CAACGCCTAT TGGGGGCATC CGGCGTATGC GCTGCCGCCT
GAAGCCAATC TGATGGCCGT GGCGCACTAC CTGGAAGCGC TGGAGTGGCA GAAGGACGTC
ATCAGAATTC ACGCGATTCT GGGCGGTAAG AATCCCCACC CGCAGACATA TCTCGTCGGC
GGGATGGCAG CGCCGCTCGA CCCGAATGCG CAGCAGGCGA TCAACACCAT CCGCATCGCG
CAGTTGAAAA TGCTCGCCGA TCAGGTGCGC ACGTTTGTGG GCAAGGTCTA CATTCCCGAT
ATTCTGGCTA TCGCATCATT CTACAAAGAC TGGGCAGGGC TTGGCGCTGG CGTGGGCAAC
TATCTGTCGT ATGGCGACTT CCCGGCTGCG AAAGATGGCA ACGTCGCCAG TTACTGGCTG
CCGCGCGGTG TGATTGTGAA CAAGAATATC GACCAGAAGC CGCAACCGGT GAACCACGAG
CGCGTGACCG AATATGTTGC GCATTCCTGG TTCCGCTATG GCGAGGGCGA TCAGCAGGCG
CTCCATCCCT GGAAAGGTGA GACCATCCCG AATTACACCG GTCCTCAACC GCCTTACGAC
TGGCTCAACA CCGATGGCAA ATATTCCTGG CTCAAAACGC CGCGCTACGA CGACATGCCG
ATGGAAGTCG GTCCATTGGC GCGTATGCTC GTCGGATACG CTTCCGGTCA GCAACGCATT
CAGGAGTTGG TCAACGCTGC ACTCAAACAG TTGGGAGTTG GTCCGGCGGC GCTCTTCTCG
ACGCTGGGGC GCACAGCGGC GCGCGCCATC GAAACCGCGT TGATCGCCGA ACTGTTGCCG
GGATGGATCG ACGAACTGGC GGCAAATATG GCGGCCGGCA ACCTGGTAGT TCACAACAGC
GCCAAATGGA GTCCGGCGAA CTGGCCCCAG GAAGCGGTTG GTTGGGGATC GATGGAAGCG
CCGCGCGGCT CGCTTGGGCA CTGGGTGCGG ATCAGAGATG GCAAGATCGT CAACTATCAA
GCAGTGGTTC CGACGACGTG GAACGGCTCG CCACGTGATG CGCGCGATGT GCGCGGACCT
TACGAAGCCG CACTGATCGA CACGCCGATT GCCGACCCGG AGCAGCCGAT TGAAATCCTG
CGCACCATTC ATTCATTCGA CCCGTGCATG GCGTGCGCGG TTCACCTGGT GGATGCCCGT
GGCATTGAGA TTACCCGCGT CCGGGTGCAG TGA
 
Protein sequence
MAKIAIDPIT RIEGHLRIEA QIERGRVVDA WSSSTMFRGM EIVLRGRDPR DAWVFAQRIC 
GVCTTVHALA SVRAVENALD IQIPDNARLI RNIIAGAQYV QDHVIHFYHL HALDWVDIVS
ALKADPVKTS ELAQSISDWP KSSPAYFKGV QDRLQKFVDS GQLGIFGNAY WGHPAYALPP
EANLMAVAHY LEALEWQKDV IRIHAILGGK NPHPQTYLVG GMAAPLDPNA QQAINTIRIA
QLKMLADQVR TFVGKVYIPD ILAIASFYKD WAGLGAGVGN YLSYGDFPAA KDGNVASYWL
PRGVIVNKNI DQKPQPVNHE RVTEYVAHSW FRYGEGDQQA LHPWKGETIP NYTGPQPPYD
WLNTDGKYSW LKTPRYDDMP MEVGPLARML VGYASGQQRI QELVNAALKQ LGVGPAALFS
TLGRTAARAI ETALIAELLP GWIDELAANM AAGNLVVHNS AKWSPANWPQ EAVGWGSMEA
PRGSLGHWVR IRDGKIVNYQ AVVPTTWNGS PRDARDVRGP YEAALIDTPI ADPEQPIEIL
RTIHSFDPCM ACAVHLVDAR GIEITRVRVQ