Gene RoseRS_4140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4140 
Symbol 
ID5211124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5183855 
End bp5186887 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content61% 
IMG OID640597729 
ProductFe-S-cluster-containing hydrogenase components 1-like protein 
Protein accessionYP_001278434 
Protein GI148658229 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTATGA ACACCAAGTC CCAGGATGTG AACGCATTGC GTGCGCGCCT TGCGAATGCC 
GAGGGGCGCG AGTTCTGGCG CAGCCTCGAT GAACTGGCTG ATACGCCGGA GTTCAACGAA
CTGCTGAAGC GTGAGTTCCC GCACGGCGCC GCTGAATGGC GCGACCCGGT GAGCCGGCGC
AATTTTCTCA AGTTGATGGG TGCGTCGCTG GCGCTCGCGG GTCTGTCGGG GTGTCAGTTC
GCGCTGAAGC AACCGCAGGA AAAGATCGTT CCCTATGTGC GCCAGCCGGA GGAGATTATT
CACGGCAAAC CGCTGTTTTT TGCCACCGCG GCGACGTTCG CTGGCTTTGG CACGGGGGTG
CTGGTCGAAA GCCACGAGGG GCGCCCGACT AAGATTGAGG GGAATCCCGA CCATCCTGCG
TCGCTTGGGG CGACTGACCT GATCACGCAG GCGATGATTC TGACCATGTA CGATCCGGAT
CGGTCGCAGG TGCCGACAAA TGCCGGGCAG GAGGCGACGT GGGAGTCGTT TGTTGCTGCT
GCAACTGCTG CGATTCAGGC GCAGGCGGCG AAACAGGGCG CCGGGTTGCG CATCCTCTCC
GGGGCGATCA CCTCACCGAC GCTGATTGCG CAAAAGCAGC AACTGCTGAC GCAATTCCCG
CAGGCGAAGT GGTATCAGTA CGAACCGGTC GGTCGTGAGA ATGTCAACGC CGGCGCACGC
CTGGCGTTCG GCGAGGATGT GCAGACGATC TATCGCCTCG ATGCGGCGAA GGTGATCGTC
GGTTTCGACT CCGACTTTAC GGCGCCGTCG CCGACCGGCG TGCGCATGGC GCGTCAACTC
GCCGATGGAC GTCGCATCCG CAAGGGGACG AAGGAGGTCA ACCGGTTGTA CCTGGCGGAA
AGCACGCCAT CGATCACCGG TTTGCTTGCC GACCATCGTT TGCCGGTGCG CTCGTCGCAG
ATCGAACATC TGGTGCGCGC GCTGGCGATC CTCGTCGGTG TCCCTGATGT GGCTGCGGGC
GCTGCTCTCA ACGAGGCGGA GAAGAAATGG ATCGAGGCGG TTGCAAAAGA TGTACAGGCG
CACCGCGGAG CGTGCGTCGT GCTGGTCGGC GAAAATCAAC CACCGGTCGT TCATGCGCTC
GGTCACGCGA TCAACGCGCA ACTCGGCAAT GTTGGGAGCA CGGTGGTCTA CACTGATCCG
GTGGAGAGCG ATCCGTCGGG CGGTATTGCC GCCCTCGGCG CGCTCACCCA GGAAATGAAC
GCCGGTACGG TCGAGATGCT GGTAATGCTC GACAGCAACC CGGTCTACAA TGCGCCAGCC
GATATTCCGT TCGCCGAGGC GCTGGCAAAA GTGCCGTTGA GCGTTCACGT TGGACTGTAC
CGCGATGAGA CCGCGCAGCA GAGCACCTGG CATATCAATG GAACGCACTT CCTCGAAGCG
TGGGGGGATG TGCGCGCCTT CGACGGCACG GTGACAATTG TTCAACCGCT GATTGCCCCG
CTGTACAACG GAAAGTCGGC GATTGAGACG CTCAATGTGC TGCTCGGCAA GCCGCAGGAG
ACCGGCTACC AGACGCTGAC CGCGTACTGG CAGACGCAGG ATTCGAGCGG CAATTTCCGC
GTCTTCTGGA ATCAGGCGTT GCATGATGGC ATTATTCCCG GCACCCAGGC GCAGGCACGC
CAGGTGACGC TCCAGCGCGG GTTTGCCAGC GCTGCGCCAT CAGCGCCAGC GCAGGGGCTG
GAAATCGTGT TCCGCCCCGA TCCGTCGATA TGGGACGGTG CGTTTGCCAA CAATGCCTGG
TTGCAGGAAG TCCCCAAGCC ATATACCAAA CTGACGTGGG ATAATGTCGC TATGATGAGT
GCGCGCACCG CGAATGCGCT ACGCCTCAAG AATGGCGATG TCGTGCGGTT GACGTACCAG
GGGCGCTCGG TGGATGCGCC GGTGTGGGTG CAACCGGGCC ACGCCGACGA CTCGGTGACG
GTGCATCTCG GCTTCGGGCG CACGGCTGCC GGGCGGGTCG GCAATAATGT CGGGTTCAAC
GCCTATCGCC TGCGTACCAG CGCGACGCCC TGGTTCGGCG TCGGTCTGGA AGTGGCGAAG
GTGGGTGAAA ACTACAAGCT GGCGAGCACC CAGGGTCACT TCCTGATGGA GGGACGCAAG
AAAGACCTGG TGCGGTACGG GACGCTCGCT GAGTATGTGG AAAACGAGAA GTTCCTGCAG
GTCGAAAAGA AGGAGCCAAT CTCGCTCATC GGCGAGTATG AGTACAACGG CTACAAGTGG
GGTATGTCGA TCGACCTGAA TGTGTGCAAC TCGTGCAATG CCTGTGTCGT TGCCTGCCAG
TCGGAGAACA ACATTCCGGT CGTCGGCAAG GACGAAGTCT GGCTTGGGCG CGAAATGCAC
TGGATCCGGA TCGACCAGTA CTACGTCGGC GACGAGCACA CGCCAAACGT CTACAACATG
GTCATGCTGT GCCAGCAGTG TGAACATGCG CCGTGCGAGA TCGTCTGCCC CGTCGCTGCC
ACAGTTCACG ATGCCGAAGG GTTGAACAAC ATGGTGTACA ACCGCTGCGT CGGCACCAAG
TACTGCTCGA ACAACTGCCC GTACAAAGTG CGCCGGTTCA ACTTCCTGCA ATATCAGGAT
GTGCCGTACC GCTCGCCAAT CGATGCGTCG ACCGAGAACG ACAGCATTCC GGTGCTCAAG
ATGATGCGCA ACCCGGATGT GACCGTGCGT GCGCGTGGTG TGATGGAAAA ATGCTCGTTC
TGCGTCCAGC GCATCAACGA AGCGCGCATC GAGGCGCGCA AGGAGAATCG ACGCATCACC
GACGGCGAGG TTGTGACGGC ATGCCAGCAG GTCTGCCCGA CACAGGCGAT CGTCTTTGGC
GACCTGAATG ATCCGCAGGC GCGGGTTGTG ACCTTGAAGG ATCAACCGCT GAAGTACACA
TCGCTCGATA AACTCAATAC AAAACCGCGT GTGAGTTATC TGGCAAAGAT CAAGAATCTG
AACCCCGACC TCGCAGAGGA AAAGAAGGCA TAG
 
Protein sequence
MTMNTKSQDV NALRARLANA EGREFWRSLD ELADTPEFNE LLKREFPHGA AEWRDPVSRR 
NFLKLMGASL ALAGLSGCQF ALKQPQEKIV PYVRQPEEII HGKPLFFATA ATFAGFGTGV
LVESHEGRPT KIEGNPDHPA SLGATDLITQ AMILTMYDPD RSQVPTNAGQ EATWESFVAA
ATAAIQAQAA KQGAGLRILS GAITSPTLIA QKQQLLTQFP QAKWYQYEPV GRENVNAGAR
LAFGEDVQTI YRLDAAKVIV GFDSDFTAPS PTGVRMARQL ADGRRIRKGT KEVNRLYLAE
STPSITGLLA DHRLPVRSSQ IEHLVRALAI LVGVPDVAAG AALNEAEKKW IEAVAKDVQA
HRGACVVLVG ENQPPVVHAL GHAINAQLGN VGSTVVYTDP VESDPSGGIA ALGALTQEMN
AGTVEMLVML DSNPVYNAPA DIPFAEALAK VPLSVHVGLY RDETAQQSTW HINGTHFLEA
WGDVRAFDGT VTIVQPLIAP LYNGKSAIET LNVLLGKPQE TGYQTLTAYW QTQDSSGNFR
VFWNQALHDG IIPGTQAQAR QVTLQRGFAS AAPSAPAQGL EIVFRPDPSI WDGAFANNAW
LQEVPKPYTK LTWDNVAMMS ARTANALRLK NGDVVRLTYQ GRSVDAPVWV QPGHADDSVT
VHLGFGRTAA GRVGNNVGFN AYRLRTSATP WFGVGLEVAK VGENYKLAST QGHFLMEGRK
KDLVRYGTLA EYVENEKFLQ VEKKEPISLI GEYEYNGYKW GMSIDLNVCN SCNACVVACQ
SENNIPVVGK DEVWLGREMH WIRIDQYYVG DEHTPNVYNM VMLCQQCEHA PCEIVCPVAA
TVHDAEGLNN MVYNRCVGTK YCSNNCPYKV RRFNFLQYQD VPYRSPIDAS TENDSIPVLK
MMRNPDVTVR ARGVMEKCSF CVQRINEARI EARKENRRIT DGEVVTACQQ VCPTQAIVFG
DLNDPQARVV TLKDQPLKYT SLDKLNTKPR VSYLAKIKNL NPDLAEEKKA