Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2741 |
Symbol | |
ID | 4897707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 2883545 |
End bp | 2886427 |
Gene Length | 2883 bp |
Protein Length | 960 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640113343 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001044615 |
Protein GI | 126463501 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.313061 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGACT TCATCCTTCC CGACGACCGC GACTTCGGCA CCCCCCGCTC GCGCGCGACC GAGACGGTGA CGCTCGAGAT CGATGGCTTC CCGGTGACGG TGCCTGCGGG CACGTCGGTG ATGCGCGCCG CCGCCGAAGC GGGCATTTCG GTGCCGAAGC TCTGCGCGAG CGACACTCTC GACGCCTTCG GCTCCTGCCG GCTCTGCCTC GTCGAGATCG AGGGCCGCGC GGGCACGCCC GCCTCCTGCA CCACGCCGGT GACGCCCGGC ATGAAGGTGC GCACCCAGAC GCCGAAGCTG AAGCAGCTGC GCCGCGGGGT GATGGAGCTC TATATTTCGG ATCACCCGCT CGACTGCCTG ACCTGCGCCG CCAACGGCGA TTGCGAGCTG CAGGACATGG CGGGCGCGGT GGGCCTGCGC GATGTGCGCT ACGAGGCCGT AGAAAATCAT TTCACGCCCC GCAATGCCGG CGGCGATCTC AATCCGCAAT GGATGGTCAA GGACGAGTCG AACCCCTATT TCACCTACGA CCCGTCGAAA TGCATCGTCT GCTCGCGCTG CGTGCGGGCC TGCGAGGAGG TGCAGGGCAC CTTCGCGCTG ACCATCGAGG GCCGGGGCTT CGACAGCCGC GTCTCGGCCG GGATGGCCAG CGACAGTTTC CTCACCTCCG ACTGCGTGAG CTGCGGCGCC TGCGTGCAGG CCTGCCCGAC CGCCACGCTG CAGGAGAAGT CGGTGATCGA GATCGGCACG CCTGAGCGTG CGGTCGTGAC CACCTGCGCC TATTGCGGCG TCGGCTGCTC GTTCAAGGCC GAAATGCGCG GCGACGAGCT GGTGCGGATG GTCCCCTACA AGGGCGGCAA GGCCAACCAC GGCCATTCCT GCGTCAAGGG GCGCTTCGCC TATGGCTATG CGGCCCACAA GGACCGGATC CTGAAGCCCA TGGTGCGCGA GTCGATCCAC GATCCCTGGC AGGAGGTGAG CTGGGACGAG GCCTTGGGCT TCGCCGCGCG CCGCCTGACG GCGATCCAGG AGAAGCACGG CCGCCAATCC GTGGGCGTCA TCACCTCGTC TCGCTGCACG AACGAGGAGA CCTACCTCGT CCAGAAGCTG ACCCGCGCCG TCTTCCGCAA CAACAACACC GACACCTGCG CCCGGGTCTG CCACTCGCCC ACCGGCTACG GCCTGGGCCA GACCTTCGGC ACCTCGGCCG GGACGCAGGA TTTCGATTCG GTCGAGGCTG CGGACGTGGT GATGGTGATC GGCGCGAACC CGACCGACGG CCATCCGGTC TTCGCAAGCC GGCTGAAGAA GCGGCTGCGC AAGGGGGCGA AACTGATCGT GGTCGATCCG CGGCGCATCG ATCTGGTGAA GAGCCCCCAT ATCGCGGCGG CCCACCATCT GGCGCTCAGG CCCGGCACCA ACGTGGCCGT GGTGACGGCC ATGGCCCATG TCATCGTGAC CGAGGGGCTG GCGGATGAAA AATTCATCCG AGAACGCTGC GACTGGGACG AGTTCCAGGA CTTCGCCGAA TTCGCCGCCG ATCCGCGTCA CGCGCCCGAG GCGATCGAGA GCCTGACCGG CGTGCCCGCG GCCGAGCTGC GTGCGGCGGC CCGCCTCTAT GCCACCGGCG GGAATGCCGC GATCTATTAC GGGCTGGGCG TGACCGAGCA CAGCCAGGGC TCGACCACCG TCATCGGCAT CGCGAACCTC GCCATGCTCA CCGGCAACAT CGGCCGGCCC GGCGTGGGCG TGAACCCGCT GCGGGGCCAG AACAATGTGC AGGGCTCCTG CGACATGGGC TCGTTCCCGC ACGAGCTGCC GGGCTACCGT CATGTGAAGA GCGATGCGGC GCGCGCGGTG TTCGAGCGGC TCTGGGGCGT CGAGATCGAT CCCGAGCCGG GACTGCGGAT CCCGAACATG CTCGATGCGG CGGTCGAGGG CACCTTCAAG GGGCTTTATT GCCAGGGGGA GGACATCCTG CAATCGGACC CCGACACGCG CCATGTCGCG GCGGGCCTTG CGGCGATGGA GTGCGTGATC GTCCACGACC TCTTCCTGAA CGAGACCGCC AACTACGCCC ATGTCTTCCT TCCGGGCTCC TCTTTCCTCG AGAAGGACGG CACCTTCACC AACGCCGAGC GCCGCATCAA CCGCGTGCGC AAGGTCATGG CGCCGAAAAA TGGCTTCGCC GACTGGGAAG TGACGCAGAT GCTGGCCAAT GCGCTGGGCG CGGGCTGGGG CTACACCCAT CCGAGCCAGA TCATGGATGA GATCGCGGCC ACCACGCCCT CCTTCGCCGG CGTCTCCTAC GAGCGGCTGG AAGAGGCGGG CTCGATCCAG TGGCCCTGCA ACGAGGAGCA TCCGCTGGGC ACGCCGCTCA TGCATGTCGA GGGCTTCGTG CGCGGCCGCG GAAAACTCAT CCGCACGGAA TATGTGGCGA CGGACGAGAA GACGGGCCCG CGTTTCCCGC TGCTACTCAC CACCGGGCGG ATCCTCTCGC AGTACAACGT GGGCGCACAG ACGCGGCGGA CGGCGAACAG CGTCTGGCAT CCCGAGGACG TGCTCGAGAT CCATCCGCAC GATGCCGAGG TGCGCGGCGT GGCCGAAGGC GACTGGGTGC GCCTCGCCTC GCGGGCGGGC GAGACGACGC TCCGGGCGCG GCTGACGGAT CGCGTATCGC CGGGCGTGGT CTATACGACC TTCCACCATC CTGCGACCCA AGCGAATGTC ATCACCACCG ACTTCTCGGA CTGGGCGACG AACTGCCCGG AATACAAGGT GACGGCGGTG CAGGTTGCGC CGTCGAACGG GCCGTCGGAC TGGCAGGAGG ATTACCGCGC CCAGGCGGAC CTCGCGCGGC GCATCCTGCC GGCTGCCGAA TGA
|
Protein sequence | MKDFILPDDR DFGTPRSRAT ETVTLEIDGF PVTVPAGTSV MRAAAEAGIS VPKLCASDTL DAFGSCRLCL VEIEGRAGTP ASCTTPVTPG MKVRTQTPKL KQLRRGVMEL YISDHPLDCL TCAANGDCEL QDMAGAVGLR DVRYEAVENH FTPRNAGGDL NPQWMVKDES NPYFTYDPSK CIVCSRCVRA CEEVQGTFAL TIEGRGFDSR VSAGMASDSF LTSDCVSCGA CVQACPTATL QEKSVIEIGT PERAVVTTCA YCGVGCSFKA EMRGDELVRM VPYKGGKANH GHSCVKGRFA YGYAAHKDRI LKPMVRESIH DPWQEVSWDE ALGFAARRLT AIQEKHGRQS VGVITSSRCT NEETYLVQKL TRAVFRNNNT DTCARVCHSP TGYGLGQTFG TSAGTQDFDS VEAADVVMVI GANPTDGHPV FASRLKKRLR KGAKLIVVDP RRIDLVKSPH IAAAHHLALR PGTNVAVVTA MAHVIVTEGL ADEKFIRERC DWDEFQDFAE FAADPRHAPE AIESLTGVPA AELRAAARLY ATGGNAAIYY GLGVTEHSQG STTVIGIANL AMLTGNIGRP GVGVNPLRGQ NNVQGSCDMG SFPHELPGYR HVKSDAARAV FERLWGVEID PEPGLRIPNM LDAAVEGTFK GLYCQGEDIL QSDPDTRHVA AGLAAMECVI VHDLFLNETA NYAHVFLPGS SFLEKDGTFT NAERRINRVR KVMAPKNGFA DWEVTQMLAN ALGAGWGYTH PSQIMDEIAA TTPSFAGVSY ERLEEAGSIQ WPCNEEHPLG TPLMHVEGFV RGRGKLIRTE YVATDEKTGP RFPLLLTTGR ILSQYNVGAQ TRRTANSVWH PEDVLEIHPH DAEVRGVAEG DWVRLASRAG ETTLRARLTD RVSPGVVYTT FHHPATQANV ITTDFSDWAT NCPEYKVTAV QVAPSNGPSD WQEDYRAQAD LARRILPAAE
|
| |