Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1793 |
Symbol | |
ID | 5208752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2215273 |
End bp | 2218317 |
Gene Length | 3045 bp |
Protein Length | 1014 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640595401 |
Product | formate dehydrogenase |
Protein accession | YP_001276133 |
Protein GI | 148655928 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.546724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0478269 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACT CACTGGTGAA GAACATCCTG CGCACACTGG CGCAGCGGCA GCGTGCGGAA GCGCAACAGG AAGTCGTTGC GCGCCAACCG GCGACACAGC CCGGCGCCGT CGAACGGAGT GTGCCACAAC CGGCGCCAGC GCATGCCGTG CAAGGAGCGA CTGCCGAATT GAGCGGCTAC CCGCCGGTTG AGCGCTGGCA GCACTGGACG GAGTATGACC CGAAGGCATG GCCCCAGAAA ATCTCGCGTT CCTATACGCT GGTTCCGACG ATCTGCTTCA ACTGCGAAAG CGCCTGTGGA CTCCTGGCGT ATGTCGATAC ATCCACGCTC AAGATTCAGA AATTCGAGGG CAATCCATTA CATCCCGGCA GTCGTGGTCG CAATTGCGCA AAAGGACCGG CAACGCTCAA TCAGGTGTAT GACCCGGATC GCATTCTCTA CCCGCTCAAA CGAGTCGGGC GGCGTGGCGA AGGGAAATGG AAGCGTGTCA GTTGGGACGA GGCGCTCGAT GATATTGCGG CGCGCATCCG CTGTGCCATC GTCGAAAAAC GCCTGACCGA AATCATGTAC CACGTCGGTC GTCCGGGGCA CGATGGCATC ATGGAATGGG TACTGCCCGC CTGGGGGGTG GATGCCCACA ATTCGCACAC GAATGTCTGC TCGTCGAGTG CGCGTTGTGG ACAGGCGTTG TGGATGGGGT ATGATCGCCC TTCGCCCGAT CATGCGCACG CACGAGTCAT TCTGCTGATC AGTTCGCACC TGGAGACCGG ACACTACTTC AACCCGCACG CGCAGCGCAT TATGGAAGGG AAAATGGCAG GCGCGAAACT GATTGTCTTC GACACGCGCC TGTCGAACAC CGCATCGCTG GCTGATGAGT GGATCGCCCC CTGGCCCGGC AGTGAAACCG CGATTTTGCT GGCGATCGCC CGGCATCTGA TCGTCACCCG GAAATATGAC CGCACCTTTG TGCGTCGCTG GGTCAACTGG GAAGAATACC TGCGCCACGA GCATCCCGAT CTGCCGCTGC GTTTCGAGAC GTTTGAGGCG AAACTGGAAG AACTCTATGC CCGCTACACG TTCGAGTTTG CGGCGCAGGA GAGCGGCGTG AGCGTCGAGC AGATCGCGCG GGTAGCGGAT TACATTGCGC AGTGCGATGG TCGCCTTGCG ACCCATACCT GGCGGAGCGC CACCAGCGCC AATCTGGGTG GATGGATGGT CGCTCGCTGT CTCTGGTTCC TGAATGTGCT GACCGGATCG ATAGGCAAGG AAGGCGGCAC ATCGGCGAAC GTGTGGGACA AATGGGTTCC GCGCCATCCG AATATGGCGC CGCACGTCAA AGTGTGGAAC GAACTGACCT GGCCCCAGGA ATACCCGCTC AGCTTCTACG AACTGAGTTA TCTTCTCCCG CACTTTCTCA AAGAACAGCG CGGCAAGATC GACGTTTACT TCACCCGTGT GTACAACCCG CTCTGGACCA ATCCCGACGG CATGAGCTGG ATGGAAGTGT TGACCGACGA GTCGAAGATC GGGTTGCATG TACACCTTTC GCCGGGCTGG AGCGAGACGG GTCTGTTCGC CGATTATATT CTGCCGATGG GTCATGGCGC CGAGCGCCAC GATATGATGA GTCAGGAAAC CCATGGCGGT TGCTGGATCG CATTTCGCCA GCCAGTGATC CGCGAGGCGC TGCGCCGTCT GGGCAGACCG GTCGATGACA CGCGCAAGGC GAATCCCGGC GAAGTATGGG AAGAGACCGA GTTCTGGATC GAACTGTCGT GGCGGATTGA CCCGGACGGC AGTCTGGGAA TTCGCAAAAC ATTCGAAAGC CCGTATCGAC CCGGCGAGAA GATCACCGTG GATGAACTCT ACGGCTGGAT GTTCGAGAAC CATGTGCCCG GATTGCCCGA AGCGGCGGCG AAGGAAGGGT TGACGCCGCT GGAATATATG CGTCGCTATG GTGCGTTTGA AATACGCAAA GGCGTTCAGC CGACCTATGA TCAACCCCTG GCTGAATCGG AAGTGGAAGG TGTAACGGTC GATCCGGAAA CACGCGTGGT CTATACAAAG AAGCCGGCTG CTCCCTCATC GAACATCACG CCGGTGCCAT ACTTCCAGCC GGACCCGGAG CGTGGCCGTC CGATTGGGGT GCAACTCGAA GATGGCTCAC TCGTGTCTGG CTTCCCTACG CCGTCACGCA AACTGGAGTT TTATTCCACC ACCATGCGCC ATTGGGGATG GCCCGAATAT GCAATCCCGA CATACGTTCA CAGCCACGTG CATCCCAGCA AGATCGACCG CGAGCGAAAC GAAGCGATTC TGCTCTCGAC GTTCCGTCTA CCGACGCTGA TCCACACTCG CAGCGGTAAC GCGAAGTGGC TCTATGAGAT CAGTCACAAG AATCCGGTCT GGATCCACCC CTCCGATGCG CAGCGACTTG GCGTGCAGAC CGGTGACCTG ATCAAAGTCG TCACCTCGAT AGGGTACTTT GTCGATCGGG TCTGGGTGAC GGAAGGCATT CGTCCGGGGG TCATCGCCTG TTCCCACCAT CTTGGGCGCT GGCGACTCCG GGAGGATGAA GGGGGGAAAC TTTCGACGGC GCTGGTCGAA CTTGCGCCTG CGGGTGAGTC GCGGTGGCAG ATGCGCCAGG TGCACGGAAT CAAACCGTAT GCGAGCGCTG ACCCCGATAC CGAACACATC TGGTGGGAAG ATGCTGGTGT TCATCAGAAT CTGACCTTCG CCGTTCAGCC GGATCCGGTC AGCGGCATGC ACTGCTGGCA TCAGAAAGTG CGTCTGGAGC GGGCAGGACC GGACGACCGG TACGGTGACA TTTTTGTCGA TACCCGGCGC GCGCACGAGG TGTACCGGGA ATGGCTGGCA ATGACCCGCC CGGCGAGTCA GGTATCACCG AACGGTCTCC GGCGACCGCA CTGGTGGCTC CGTCCATTCC GCCCCGATCT GGAAGCCTAT TATCTTCCCG GTCGAGACCC CGGCAACGGG CATATCGCCG TCCATGCGGT ATCAGCTTCC GATCATAAGA AATGA
|
Protein sequence | MADSLVKNIL RTLAQRQRAE AQQEVVARQP ATQPGAVERS VPQPAPAHAV QGATAELSGY PPVERWQHWT EYDPKAWPQK ISRSYTLVPT ICFNCESACG LLAYVDTSTL KIQKFEGNPL HPGSRGRNCA KGPATLNQVY DPDRILYPLK RVGRRGEGKW KRVSWDEALD DIAARIRCAI VEKRLTEIMY HVGRPGHDGI MEWVLPAWGV DAHNSHTNVC SSSARCGQAL WMGYDRPSPD HAHARVILLI SSHLETGHYF NPHAQRIMEG KMAGAKLIVF DTRLSNTASL ADEWIAPWPG SETAILLAIA RHLIVTRKYD RTFVRRWVNW EEYLRHEHPD LPLRFETFEA KLEELYARYT FEFAAQESGV SVEQIARVAD YIAQCDGRLA THTWRSATSA NLGGWMVARC LWFLNVLTGS IGKEGGTSAN VWDKWVPRHP NMAPHVKVWN ELTWPQEYPL SFYELSYLLP HFLKEQRGKI DVYFTRVYNP LWTNPDGMSW MEVLTDESKI GLHVHLSPGW SETGLFADYI LPMGHGAERH DMMSQETHGG CWIAFRQPVI REALRRLGRP VDDTRKANPG EVWEETEFWI ELSWRIDPDG SLGIRKTFES PYRPGEKITV DELYGWMFEN HVPGLPEAAA KEGLTPLEYM RRYGAFEIRK GVQPTYDQPL AESEVEGVTV DPETRVVYTK KPAAPSSNIT PVPYFQPDPE RGRPIGVQLE DGSLVSGFPT PSRKLEFYST TMRHWGWPEY AIPTYVHSHV HPSKIDRERN EAILLSTFRL PTLIHTRSGN AKWLYEISHK NPVWIHPSDA QRLGVQTGDL IKVVTSIGYF VDRVWVTEGI RPGVIACSHH LGRWRLREDE GGKLSTALVE LAPAGESRWQ MRQVHGIKPY ASADPDTEHI WWEDAGVHQN LTFAVQPDPV SGMHCWHQKV RLERAGPDDR YGDIFVDTRR AHEVYREWLA MTRPASQVSP NGLRRPHWWL RPFRPDLEAY YLPGRDPGNG HIAVHAVSAS DHKK
|
| |