Gene Rcas_3148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3148 
Symbol 
ID5540646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4079875 
End bp4081128 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content63% 
IMG OID640895269 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_001433220 
Protein GI156743091 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCTC GTCCTTTGCC GTTGGAGGAA CGCCTTGCAG CGCGCGGTGT TTCGCGCCGT 
CAATTCCTCA AGTTCTGCGC TGCGATGAGC GCCGCGCTCG CCCTGCCTTC CACCTTTACC
CCACGTATCG CCAGGGCGCT GAACACTGCC GAACGTCTTC CGGTTGTCTG GTTGGAGTTT
CAAGATTGCG CCGGCAACAC CGAATCCTTC CTTCGCGCCG AGTCGCCTGG CGTTGCCGAC
ATTGTGCTGG AGCAGATCAG CCTGGAGTAC CACGAGACGA TCATGGCGCC TGCCGGCCAT
CGCGCCGAAC ATTCGCTCGA TGCGGTCGTG GAAAACTATC CGGGGCAATA CATCGCCATC
GTCGAAGGGT CGATCCCCAT TGCCAATGGC GGGGTGTATT GCACGATCGG CGGTCGCACC
GCGTTGAGCA TCGCAGAGCG TGTGTGCTCG AATGCGCTGG CAACGATTGC GGTTGGCGCA
TGTGCCTGGG ATGGCGGTTG GCCCGCCGCC AGCCCGAATC CGACCGGTGC GGTTGGTGTG
CGCCAGGCGG TGCCGGGTCT CAAGAATCTG ATCAACCTGC CGGGCTGCCC GATGAATGTG
ATTAATCTAA CCGCCGTCAT TGTCCACTAC CTGACATTCA AACAACTGCC GGCAACCGAC
GAGCAGGGAC GCCCCTTCTT CGCCTATGGG CAGCTCATTC ACAACAACTG TGAGCGTCGG
GGGCACTTCG ACTCCGGTCG CTTCGTCGAG CGTTGGGGCG ACGAGGGGCA TCGCCTGGGA
TGGTGCCTGT ATAAGATGGG GTGCAAAGGA CCGCAAACGC TCTCGAACTG TCCTGCGGTC
GGCTGGAACG GCACGTCCTA CTGGCCCATC GGCGCCGGTC ACGGATGCGT TGGCTGCATG
TCGCCGCGTT TCTGGGATAC CATGTCGCCT TTCTATGAGC GACTGCCCAA TGTCGAAGGC
TTTGGCATCG AGGTGACCGC CGATACGCTG GGCGCCATTG CGGTTGGCGC CGTGGCGGCG
GCCGGTGTAG TCCACGGCGT TGCCAGCGCA ATCCGGGCGA GTCGTCATCC GATTGCGGCG
CATGGGGGTG AGACGCTGGT GGAAGCCGCA GAGCGCGCCG TTCAGGTGGT AGAGCAGATT
GCAGGACCGG TCAAAGCGGA GGAAAAGCCG GCAGAAGAGG CAGGGAAGCC GATGGCGTCG
GGTGACGCCA GGGAGAGCGT TCGTCCCGAT CGGGATGGAC CGTCCGCCTC CTGA
 
Protein sequence
MPARPLPLEE RLAARGVSRR QFLKFCAAMS AALALPSTFT PRIARALNTA ERLPVVWLEF 
QDCAGNTESF LRAESPGVAD IVLEQISLEY HETIMAPAGH RAEHSLDAVV ENYPGQYIAI
VEGSIPIANG GVYCTIGGRT ALSIAERVCS NALATIAVGA CAWDGGWPAA SPNPTGAVGV
RQAVPGLKNL INLPGCPMNV INLTAVIVHY LTFKQLPATD EQGRPFFAYG QLIHNNCERR
GHFDSGRFVE RWGDEGHRLG WCLYKMGCKG PQTLSNCPAV GWNGTSYWPI GAGHGCVGCM
SPRFWDTMSP FYERLPNVEG FGIEVTADTL GAIAVGAVAA AGVVHGVASA IRASRHPIAA
HGGETLVEAA ERAVQVVEQI AGPVKAEEKP AEEAGKPMAS GDARESVRPD RDGPSAS