Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4140 |
Symbol | |
ID | 5211124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5183855 |
End bp | 5186887 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640597729 |
Product | Fe-S-cluster-containing hydrogenase components 1-like protein |
Protein accession | YP_001278434 |
Protein GI | 148658229 |
COG category | [C] Energy production and conversion |
COG ID | [COG0437] Fe-S-cluster-containing hydrogenase components 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTATGA ACACCAAGTC CCAGGATGTG AACGCATTGC GTGCGCGCCT TGCGAATGCC GAGGGGCGCG AGTTCTGGCG CAGCCTCGAT GAACTGGCTG ATACGCCGGA GTTCAACGAA CTGCTGAAGC GTGAGTTCCC GCACGGCGCC GCTGAATGGC GCGACCCGGT GAGCCGGCGC AATTTTCTCA AGTTGATGGG TGCGTCGCTG GCGCTCGCGG GTCTGTCGGG GTGTCAGTTC GCGCTGAAGC AACCGCAGGA AAAGATCGTT CCCTATGTGC GCCAGCCGGA GGAGATTATT CACGGCAAAC CGCTGTTTTT TGCCACCGCG GCGACGTTCG CTGGCTTTGG CACGGGGGTG CTGGTCGAAA GCCACGAGGG GCGCCCGACT AAGATTGAGG GGAATCCCGA CCATCCTGCG TCGCTTGGGG CGACTGACCT GATCACGCAG GCGATGATTC TGACCATGTA CGATCCGGAT CGGTCGCAGG TGCCGACAAA TGCCGGGCAG GAGGCGACGT GGGAGTCGTT TGTTGCTGCT GCAACTGCTG CGATTCAGGC GCAGGCGGCG AAACAGGGCG CCGGGTTGCG CATCCTCTCC GGGGCGATCA CCTCACCGAC GCTGATTGCG CAAAAGCAGC AACTGCTGAC GCAATTCCCG CAGGCGAAGT GGTATCAGTA CGAACCGGTC GGTCGTGAGA ATGTCAACGC CGGCGCACGC CTGGCGTTCG GCGAGGATGT GCAGACGATC TATCGCCTCG ATGCGGCGAA GGTGATCGTC GGTTTCGACT CCGACTTTAC GGCGCCGTCG CCGACCGGCG TGCGCATGGC GCGTCAACTC GCCGATGGAC GTCGCATCCG CAAGGGGACG AAGGAGGTCA ACCGGTTGTA CCTGGCGGAA AGCACGCCAT CGATCACCGG TTTGCTTGCC GACCATCGTT TGCCGGTGCG CTCGTCGCAG ATCGAACATC TGGTGCGCGC GCTGGCGATC CTCGTCGGTG TCCCTGATGT GGCTGCGGGC GCTGCTCTCA ACGAGGCGGA GAAGAAATGG ATCGAGGCGG TTGCAAAAGA TGTACAGGCG CACCGCGGAG CGTGCGTCGT GCTGGTCGGC GAAAATCAAC CACCGGTCGT TCATGCGCTC GGTCACGCGA TCAACGCGCA ACTCGGCAAT GTTGGGAGCA CGGTGGTCTA CACTGATCCG GTGGAGAGCG ATCCGTCGGG CGGTATTGCC GCCCTCGGCG CGCTCACCCA GGAAATGAAC GCCGGTACGG TCGAGATGCT GGTAATGCTC GACAGCAACC CGGTCTACAA TGCGCCAGCC GATATTCCGT TCGCCGAGGC GCTGGCAAAA GTGCCGTTGA GCGTTCACGT TGGACTGTAC CGCGATGAGA CCGCGCAGCA GAGCACCTGG CATATCAATG GAACGCACTT CCTCGAAGCG TGGGGGGATG TGCGCGCCTT CGACGGCACG GTGACAATTG TTCAACCGCT GATTGCCCCG CTGTACAACG GAAAGTCGGC GATTGAGACG CTCAATGTGC TGCTCGGCAA GCCGCAGGAG ACCGGCTACC AGACGCTGAC CGCGTACTGG CAGACGCAGG ATTCGAGCGG CAATTTCCGC GTCTTCTGGA ATCAGGCGTT GCATGATGGC ATTATTCCCG GCACCCAGGC GCAGGCACGC CAGGTGACGC TCCAGCGCGG GTTTGCCAGC GCTGCGCCAT CAGCGCCAGC GCAGGGGCTG GAAATCGTGT TCCGCCCCGA TCCGTCGATA TGGGACGGTG CGTTTGCCAA CAATGCCTGG TTGCAGGAAG TCCCCAAGCC ATATACCAAA CTGACGTGGG ATAATGTCGC TATGATGAGT GCGCGCACCG CGAATGCGCT ACGCCTCAAG AATGGCGATG TCGTGCGGTT GACGTACCAG GGGCGCTCGG TGGATGCGCC GGTGTGGGTG CAACCGGGCC ACGCCGACGA CTCGGTGACG GTGCATCTCG GCTTCGGGCG CACGGCTGCC GGGCGGGTCG GCAATAATGT CGGGTTCAAC GCCTATCGCC TGCGTACCAG CGCGACGCCC TGGTTCGGCG TCGGTCTGGA AGTGGCGAAG GTGGGTGAAA ACTACAAGCT GGCGAGCACC CAGGGTCACT TCCTGATGGA GGGACGCAAG AAAGACCTGG TGCGGTACGG GACGCTCGCT GAGTATGTGG AAAACGAGAA GTTCCTGCAG GTCGAAAAGA AGGAGCCAAT CTCGCTCATC GGCGAGTATG AGTACAACGG CTACAAGTGG GGTATGTCGA TCGACCTGAA TGTGTGCAAC TCGTGCAATG CCTGTGTCGT TGCCTGCCAG TCGGAGAACA ACATTCCGGT CGTCGGCAAG GACGAAGTCT GGCTTGGGCG CGAAATGCAC TGGATCCGGA TCGACCAGTA CTACGTCGGC GACGAGCACA CGCCAAACGT CTACAACATG GTCATGCTGT GCCAGCAGTG TGAACATGCG CCGTGCGAGA TCGTCTGCCC CGTCGCTGCC ACAGTTCACG ATGCCGAAGG GTTGAACAAC ATGGTGTACA ACCGCTGCGT CGGCACCAAG TACTGCTCGA ACAACTGCCC GTACAAAGTG CGCCGGTTCA ACTTCCTGCA ATATCAGGAT GTGCCGTACC GCTCGCCAAT CGATGCGTCG ACCGAGAACG ACAGCATTCC GGTGCTCAAG ATGATGCGCA ACCCGGATGT GACCGTGCGT GCGCGTGGTG TGATGGAAAA ATGCTCGTTC TGCGTCCAGC GCATCAACGA AGCGCGCATC GAGGCGCGCA AGGAGAATCG ACGCATCACC GACGGCGAGG TTGTGACGGC ATGCCAGCAG GTCTGCCCGA CACAGGCGAT CGTCTTTGGC GACCTGAATG ATCCGCAGGC GCGGGTTGTG ACCTTGAAGG ATCAACCGCT GAAGTACACA TCGCTCGATA AACTCAATAC AAAACCGCGT GTGAGTTATC TGGCAAAGAT CAAGAATCTG AACCCCGACC TCGCAGAGGA AAAGAAGGCA TAG
|
Protein sequence | MTMNTKSQDV NALRARLANA EGREFWRSLD ELADTPEFNE LLKREFPHGA AEWRDPVSRR NFLKLMGASL ALAGLSGCQF ALKQPQEKIV PYVRQPEEII HGKPLFFATA ATFAGFGTGV LVESHEGRPT KIEGNPDHPA SLGATDLITQ AMILTMYDPD RSQVPTNAGQ EATWESFVAA ATAAIQAQAA KQGAGLRILS GAITSPTLIA QKQQLLTQFP QAKWYQYEPV GRENVNAGAR LAFGEDVQTI YRLDAAKVIV GFDSDFTAPS PTGVRMARQL ADGRRIRKGT KEVNRLYLAE STPSITGLLA DHRLPVRSSQ IEHLVRALAI LVGVPDVAAG AALNEAEKKW IEAVAKDVQA HRGACVVLVG ENQPPVVHAL GHAINAQLGN VGSTVVYTDP VESDPSGGIA ALGALTQEMN AGTVEMLVML DSNPVYNAPA DIPFAEALAK VPLSVHVGLY RDETAQQSTW HINGTHFLEA WGDVRAFDGT VTIVQPLIAP LYNGKSAIET LNVLLGKPQE TGYQTLTAYW QTQDSSGNFR VFWNQALHDG IIPGTQAQAR QVTLQRGFAS AAPSAPAQGL EIVFRPDPSI WDGAFANNAW LQEVPKPYTK LTWDNVAMMS ARTANALRLK NGDVVRLTYQ GRSVDAPVWV QPGHADDSVT VHLGFGRTAA GRVGNNVGFN AYRLRTSATP WFGVGLEVAK VGENYKLAST QGHFLMEGRK KDLVRYGTLA EYVENEKFLQ VEKKEPISLI GEYEYNGYKW GMSIDLNVCN SCNACVVACQ SENNIPVVGK DEVWLGREMH WIRIDQYYVG DEHTPNVYNM VMLCQQCEHA PCEIVCPVAA TVHDAEGLNN MVYNRCVGTK YCSNNCPYKV RRFNFLQYQD VPYRSPIDAS TENDSIPVLK MMRNPDVTVR ARGVMEKCSF CVQRINEARI EARKENRRIT DGEVVTACQQ VCPTQAIVFG DLNDPQARVV TLKDQPLKYT SLDKLNTKPR VSYLAKIKNL NPDLAEEKKA
|
| |