Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1463 |
Symbol | |
ID | 5538937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1865472 |
End bp | 1868504 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640893601 |
Product | Fe-S-cluster-containing hydrogenase components 1-like protein |
Protein accession | YP_001431576 |
Protein GI | 156741447 |
COG category | [C] Energy production and conversion |
COG ID | [COG0437] Fe-S-cluster-containing hydrogenase components 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.210969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTATGA GCACCACTTC CCAGGATTTG AACGCGCTGC GCGCGCGTCT GGCACAAGCC GAAGGGCGAG AATTCTGGCG TAGCCTCGAT GAGTTGGCTG ATACGCCTGC GTTCAATGAA CTGCTGAAGC GCGAATTTCC GCGCGGCGCC GCTGAATGGC GCGACCCGGC GAGCCGGCGC AATTTCCTCA AACTTATGGG CGCCTCGCTG GCGCTCGCCG GTCTGTCGGG GTGTCAGTTC GCGCTTAAGC AGCCACAGGA AAAGATCGTT CCTTATGTGC GTCAGCCGGA GGAGATTATC CACGGTAGAC CCCTTTTCTT CGCTACTGCC GTCACATTCG CCGGCTTCGG CGTCGGTCTG CTTGTTGAAA GCCACGAGGG GCGCCCGACG AAAATCGAGG GAAACCCCGA TCATCCGGCG TCACTCGGTT CGACCGACCT GATCACGCAG GCGATGATTC TGACGATGTA CGACCCGGAT CGCTCGCAGG CGCCGACCAA CGCCGGACAG GAGACGACAT GGGATGCCTT CGTCGCCGCT GCAACGGCTG CGATGCAGGC GCAGACGGCA AAACAGGGCG CCGGGTTGCG CGTCCTCTCC GGGTCGCTTA CCTCGCCAAC GCTGATTGCG CAAAAGCAAC AATTGCTGAC GCAGTTTCCG CAGGCGAAGT GGTATGAGTA TGAACCGGTC GGGCGCGACA ATGCCAATGC CGGCGCACGG CTGGCGTTTG GCGCGGATGT GCATACGATC TATCGCCTCG ATACGGCGAA GGTGATCGTC GGGTTCGATG CCGATTTTAC CGCCCCGTCG CCGACGGGGG TGCGCATGGC GCGCCAGCTT GCCGATGGCC GCCGCATTCG CAAAGGGACG AAAGAGGTCA ATCGGTTGTA CCTCGCCGAG AGCACGCCGT CGATCACCGG TCTGCTCGCC GATCATCGCC TGCCGGTGCG CTCGTCGCAG ATTGAACATC TGGTGCGCGC GCTGGCGACC CTCGTCGGTG TGCCGAATGT GGCGGCCGGC GCTCCTCTGA GCGATACGGA GAAGAAATGG GTTGAGGCAG CGGCAAAGGA CCTTCAGGCG AATCGCGGGG CGTGCGTGGT GCTGGTTGGC GAAAGTCAGC CGCCGGTCGT CCACGCGCTC GGTCACGCGA TCAATGCGCA ACTCGGCAAT GTCGGCAGCA CAGTGGTGTA CACCGAGCCG GTTGAGGACG ATCCATCTGG CGGTATTGCC GCCCTGAGCG CCTTGACGCA GGAAATGAAC GCCGGGACGG TCGAGGTGTT GCTGATGATC GAGAGCAACC CGGTGTACAA TGCGCCTGCC GACATTCCGT TTGCTGAGGC GCTGGCGAAA GTGCCGCTCA GCATGCACGT CGGTCTCTAC CGTGATGAAA CCGCGCAGCA GAGCGTTTGG CACATCAATG GCGCGCACTT CCTGGAAGCC TGGGGCGATG TGCGCGCTTT CGATGGGACG ACGACGATTG TGCAACCGCT GATAGCTCCG CTGTACAACG GCAAGTCGGC AATCGAAGTG CTCAATGTGC TGCTCGGCAA GCCGCAGGAG ACCGGTTATC AGACGCTGAC CGCCTACTGG CAGACGCAGG ATGCGAGCGG CAATTTCCGC GTCTTCTGGA ATACGGCGTT GCACGATGGT GTGATTACTG CTACACAGGC GCGCAGTCGC CAGGTGACGC TCCAGCAGGG TTTTGCCGAT GCTGCGCCGC CGGCGCCGAC GCAGGGATTG GAAATTGTGT TTCGCCCCGA TCCGTCGCTG TGGGACGGTG CGTTCGCCAA TAATGCCTGG CTCCAGGAGA CCCCTAAGCC GTATACCAAA TTGACGTGGG ATAATGTCGC GCTGATGAGC GTTCGCACCG CAAACGCGCT TGGGCTTAAG AATGGTGATG TGGTGCGGTT GACGTACCAG GGGCGCTCGG TGGATGCACC GGTTTGGGTG CAGCCGGGGC ACGCCGACGA TTCGGTGACG GTGCATTTCG GATTTGGGCG CACGGCTGCC GGAAGAGTTG GCAACAATGT TGGGTTCAAC GCTTATCGCC TGCGCACCAG CGCAACGCCG TGGTTCGGTG TTGGGTTGGA GGTGGCGAAA GTCGGCGAGA ACTATAAACT GGCAAGCACC CAGGGGCACT TCCTGATGGA AGGGCGCAAG AAGGACCTGG TGCGCTATGG CACGCTCGCC GAGTATGTCG AGGACGAGAA GTTCCTTCAG GTCGAAAAGG AAGAGCCAAT CTCGCTGATC GGCGAGTATG AGTACAACGG CTATAAGTGG GGCATGTCGA TCGACCTGAA TGTGTGTAAC TCGTGCAACG CCTGTGTGGT CGCATGCCAG TCGGAGAACA ACATTCCGGT GGTCGGCAAA GACGAAGTCT GGCTTGGGCG CGAAATGCAC TGGATCCGTA TCGACCAGTA TTACGTCGGT GATGAGCATA CTCCGAACGT CTATAACATG GTGATGCTCT GCCAGCAGTG CGAGCACGCG CCGTGCGAAA TTGTCTGCCC GGTCGCTGCG ACCGTCCACG ACGCGGAAGG GTTGAACAAT ATGGTGTATA ACCGCTGCGT CGGCACCAAG TACTGCTCGA ACAACTGCCC GTACAAAGTG CGTCGGTTCA ATTTCCTTCA GTATCAGGAC GTGCCATACC GTTCGCCGAT CGACGCCTCG ACCGAGAATG ACAGCATCCC GGTGCTCAAA ATGATGCGCA ACCCGGATGT GACGGTGCGC GCGCGCGGTG TGATGGAAAA ATGCACGTTC TGCGTCCAGC GCATCAATGA GGCGCGCATC CAGGCGCGCA CAGAGAATCG ACGCATCGCC GACGGCGAGA TTATGACTGC GTGCCAGCAG GTGTGCCCGA CGCAGGCAAT TGTCTTCGGC GACCTGAACG ATCCGCAGGC GCGGGTTGTG GACCTGAAGG AACAACCGCT GAAGTATACC TCGCTCGATA AACTGAACAC CAAACCACGG GTCAGTTATC TGGCGAAGAT CAAGAATCTG AACCCCGATC TCGCAGAGGA GAAAACGGCA TAA
|
Protein sequence | MTMSTTSQDL NALRARLAQA EGREFWRSLD ELADTPAFNE LLKREFPRGA AEWRDPASRR NFLKLMGASL ALAGLSGCQF ALKQPQEKIV PYVRQPEEII HGRPLFFATA VTFAGFGVGL LVESHEGRPT KIEGNPDHPA SLGSTDLITQ AMILTMYDPD RSQAPTNAGQ ETTWDAFVAA ATAAMQAQTA KQGAGLRVLS GSLTSPTLIA QKQQLLTQFP QAKWYEYEPV GRDNANAGAR LAFGADVHTI YRLDTAKVIV GFDADFTAPS PTGVRMARQL ADGRRIRKGT KEVNRLYLAE STPSITGLLA DHRLPVRSSQ IEHLVRALAT LVGVPNVAAG APLSDTEKKW VEAAAKDLQA NRGACVVLVG ESQPPVVHAL GHAINAQLGN VGSTVVYTEP VEDDPSGGIA ALSALTQEMN AGTVEVLLMI ESNPVYNAPA DIPFAEALAK VPLSMHVGLY RDETAQQSVW HINGAHFLEA WGDVRAFDGT TTIVQPLIAP LYNGKSAIEV LNVLLGKPQE TGYQTLTAYW QTQDASGNFR VFWNTALHDG VITATQARSR QVTLQQGFAD AAPPAPTQGL EIVFRPDPSL WDGAFANNAW LQETPKPYTK LTWDNVALMS VRTANALGLK NGDVVRLTYQ GRSVDAPVWV QPGHADDSVT VHFGFGRTAA GRVGNNVGFN AYRLRTSATP WFGVGLEVAK VGENYKLAST QGHFLMEGRK KDLVRYGTLA EYVEDEKFLQ VEKEEPISLI GEYEYNGYKW GMSIDLNVCN SCNACVVACQ SENNIPVVGK DEVWLGREMH WIRIDQYYVG DEHTPNVYNM VMLCQQCEHA PCEIVCPVAA TVHDAEGLNN MVYNRCVGTK YCSNNCPYKV RRFNFLQYQD VPYRSPIDAS TENDSIPVLK MMRNPDVTVR ARGVMEKCTF CVQRINEARI QARTENRRIA DGEIMTACQQ VCPTQAIVFG DLNDPQARVV DLKEQPLKYT SLDKLNTKPR VSYLAKIKNL NPDLAEEKTA
|
| |