Gene Rcas_1463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1463 
Symbol 
ID5538937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1865472 
End bp1868504 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content60% 
IMG OID640893601 
ProductFe-S-cluster-containing hydrogenase components 1-like protein 
Protein accessionYP_001431576 
Protein GI156741447 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.210969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTATGA GCACCACTTC CCAGGATTTG AACGCGCTGC GCGCGCGTCT GGCACAAGCC 
GAAGGGCGAG AATTCTGGCG TAGCCTCGAT GAGTTGGCTG ATACGCCTGC GTTCAATGAA
CTGCTGAAGC GCGAATTTCC GCGCGGCGCC GCTGAATGGC GCGACCCGGC GAGCCGGCGC
AATTTCCTCA AACTTATGGG CGCCTCGCTG GCGCTCGCCG GTCTGTCGGG GTGTCAGTTC
GCGCTTAAGC AGCCACAGGA AAAGATCGTT CCTTATGTGC GTCAGCCGGA GGAGATTATC
CACGGTAGAC CCCTTTTCTT CGCTACTGCC GTCACATTCG CCGGCTTCGG CGTCGGTCTG
CTTGTTGAAA GCCACGAGGG GCGCCCGACG AAAATCGAGG GAAACCCCGA TCATCCGGCG
TCACTCGGTT CGACCGACCT GATCACGCAG GCGATGATTC TGACGATGTA CGACCCGGAT
CGCTCGCAGG CGCCGACCAA CGCCGGACAG GAGACGACAT GGGATGCCTT CGTCGCCGCT
GCAACGGCTG CGATGCAGGC GCAGACGGCA AAACAGGGCG CCGGGTTGCG CGTCCTCTCC
GGGTCGCTTA CCTCGCCAAC GCTGATTGCG CAAAAGCAAC AATTGCTGAC GCAGTTTCCG
CAGGCGAAGT GGTATGAGTA TGAACCGGTC GGGCGCGACA ATGCCAATGC CGGCGCACGG
CTGGCGTTTG GCGCGGATGT GCATACGATC TATCGCCTCG ATACGGCGAA GGTGATCGTC
GGGTTCGATG CCGATTTTAC CGCCCCGTCG CCGACGGGGG TGCGCATGGC GCGCCAGCTT
GCCGATGGCC GCCGCATTCG CAAAGGGACG AAAGAGGTCA ATCGGTTGTA CCTCGCCGAG
AGCACGCCGT CGATCACCGG TCTGCTCGCC GATCATCGCC TGCCGGTGCG CTCGTCGCAG
ATTGAACATC TGGTGCGCGC GCTGGCGACC CTCGTCGGTG TGCCGAATGT GGCGGCCGGC
GCTCCTCTGA GCGATACGGA GAAGAAATGG GTTGAGGCAG CGGCAAAGGA CCTTCAGGCG
AATCGCGGGG CGTGCGTGGT GCTGGTTGGC GAAAGTCAGC CGCCGGTCGT CCACGCGCTC
GGTCACGCGA TCAATGCGCA ACTCGGCAAT GTCGGCAGCA CAGTGGTGTA CACCGAGCCG
GTTGAGGACG ATCCATCTGG CGGTATTGCC GCCCTGAGCG CCTTGACGCA GGAAATGAAC
GCCGGGACGG TCGAGGTGTT GCTGATGATC GAGAGCAACC CGGTGTACAA TGCGCCTGCC
GACATTCCGT TTGCTGAGGC GCTGGCGAAA GTGCCGCTCA GCATGCACGT CGGTCTCTAC
CGTGATGAAA CCGCGCAGCA GAGCGTTTGG CACATCAATG GCGCGCACTT CCTGGAAGCC
TGGGGCGATG TGCGCGCTTT CGATGGGACG ACGACGATTG TGCAACCGCT GATAGCTCCG
CTGTACAACG GCAAGTCGGC AATCGAAGTG CTCAATGTGC TGCTCGGCAA GCCGCAGGAG
ACCGGTTATC AGACGCTGAC CGCCTACTGG CAGACGCAGG ATGCGAGCGG CAATTTCCGC
GTCTTCTGGA ATACGGCGTT GCACGATGGT GTGATTACTG CTACACAGGC GCGCAGTCGC
CAGGTGACGC TCCAGCAGGG TTTTGCCGAT GCTGCGCCGC CGGCGCCGAC GCAGGGATTG
GAAATTGTGT TTCGCCCCGA TCCGTCGCTG TGGGACGGTG CGTTCGCCAA TAATGCCTGG
CTCCAGGAGA CCCCTAAGCC GTATACCAAA TTGACGTGGG ATAATGTCGC GCTGATGAGC
GTTCGCACCG CAAACGCGCT TGGGCTTAAG AATGGTGATG TGGTGCGGTT GACGTACCAG
GGGCGCTCGG TGGATGCACC GGTTTGGGTG CAGCCGGGGC ACGCCGACGA TTCGGTGACG
GTGCATTTCG GATTTGGGCG CACGGCTGCC GGAAGAGTTG GCAACAATGT TGGGTTCAAC
GCTTATCGCC TGCGCACCAG CGCAACGCCG TGGTTCGGTG TTGGGTTGGA GGTGGCGAAA
GTCGGCGAGA ACTATAAACT GGCAAGCACC CAGGGGCACT TCCTGATGGA AGGGCGCAAG
AAGGACCTGG TGCGCTATGG CACGCTCGCC GAGTATGTCG AGGACGAGAA GTTCCTTCAG
GTCGAAAAGG AAGAGCCAAT CTCGCTGATC GGCGAGTATG AGTACAACGG CTATAAGTGG
GGCATGTCGA TCGACCTGAA TGTGTGTAAC TCGTGCAACG CCTGTGTGGT CGCATGCCAG
TCGGAGAACA ACATTCCGGT GGTCGGCAAA GACGAAGTCT GGCTTGGGCG CGAAATGCAC
TGGATCCGTA TCGACCAGTA TTACGTCGGT GATGAGCATA CTCCGAACGT CTATAACATG
GTGATGCTCT GCCAGCAGTG CGAGCACGCG CCGTGCGAAA TTGTCTGCCC GGTCGCTGCG
ACCGTCCACG ACGCGGAAGG GTTGAACAAT ATGGTGTATA ACCGCTGCGT CGGCACCAAG
TACTGCTCGA ACAACTGCCC GTACAAAGTG CGTCGGTTCA ATTTCCTTCA GTATCAGGAC
GTGCCATACC GTTCGCCGAT CGACGCCTCG ACCGAGAATG ACAGCATCCC GGTGCTCAAA
ATGATGCGCA ACCCGGATGT GACGGTGCGC GCGCGCGGTG TGATGGAAAA ATGCACGTTC
TGCGTCCAGC GCATCAATGA GGCGCGCATC CAGGCGCGCA CAGAGAATCG ACGCATCGCC
GACGGCGAGA TTATGACTGC GTGCCAGCAG GTGTGCCCGA CGCAGGCAAT TGTCTTCGGC
GACCTGAACG ATCCGCAGGC GCGGGTTGTG GACCTGAAGG AACAACCGCT GAAGTATACC
TCGCTCGATA AACTGAACAC CAAACCACGG GTCAGTTATC TGGCGAAGAT CAAGAATCTG
AACCCCGATC TCGCAGAGGA GAAAACGGCA TAA
 
Protein sequence
MTMSTTSQDL NALRARLAQA EGREFWRSLD ELADTPAFNE LLKREFPRGA AEWRDPASRR 
NFLKLMGASL ALAGLSGCQF ALKQPQEKIV PYVRQPEEII HGRPLFFATA VTFAGFGVGL
LVESHEGRPT KIEGNPDHPA SLGSTDLITQ AMILTMYDPD RSQAPTNAGQ ETTWDAFVAA
ATAAMQAQTA KQGAGLRVLS GSLTSPTLIA QKQQLLTQFP QAKWYEYEPV GRDNANAGAR
LAFGADVHTI YRLDTAKVIV GFDADFTAPS PTGVRMARQL ADGRRIRKGT KEVNRLYLAE
STPSITGLLA DHRLPVRSSQ IEHLVRALAT LVGVPNVAAG APLSDTEKKW VEAAAKDLQA
NRGACVVLVG ESQPPVVHAL GHAINAQLGN VGSTVVYTEP VEDDPSGGIA ALSALTQEMN
AGTVEVLLMI ESNPVYNAPA DIPFAEALAK VPLSMHVGLY RDETAQQSVW HINGAHFLEA
WGDVRAFDGT TTIVQPLIAP LYNGKSAIEV LNVLLGKPQE TGYQTLTAYW QTQDASGNFR
VFWNTALHDG VITATQARSR QVTLQQGFAD AAPPAPTQGL EIVFRPDPSL WDGAFANNAW
LQETPKPYTK LTWDNVALMS VRTANALGLK NGDVVRLTYQ GRSVDAPVWV QPGHADDSVT
VHFGFGRTAA GRVGNNVGFN AYRLRTSATP WFGVGLEVAK VGENYKLAST QGHFLMEGRK
KDLVRYGTLA EYVEDEKFLQ VEKEEPISLI GEYEYNGYKW GMSIDLNVCN SCNACVVACQ
SENNIPVVGK DEVWLGREMH WIRIDQYYVG DEHTPNVYNM VMLCQQCEHA PCEIVCPVAA
TVHDAEGLNN MVYNRCVGTK YCSNNCPYKV RRFNFLQYQD VPYRSPIDAS TENDSIPVLK
MMRNPDVTVR ARGVMEKCTF CVQRINEARI QARTENRRIA DGEIMTACQQ VCPTQAIVFG
DLNDPQARVV DLKEQPLKYT SLDKLNTKPR VSYLAKIKNL NPDLAEEKTA