Gene Rsph17029_2741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2741 
Symbol 
ID4897707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2883545 
End bp2886427 
Gene Length2883 bp 
Protein Length960 aa 
Translation table11 
GC content68% 
IMG OID640113343 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_001044615 
Protein GI126463501 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.313061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGACT TCATCCTTCC CGACGACCGC GACTTCGGCA CCCCCCGCTC GCGCGCGACC 
GAGACGGTGA CGCTCGAGAT CGATGGCTTC CCGGTGACGG TGCCTGCGGG CACGTCGGTG
ATGCGCGCCG CCGCCGAAGC GGGCATTTCG GTGCCGAAGC TCTGCGCGAG CGACACTCTC
GACGCCTTCG GCTCCTGCCG GCTCTGCCTC GTCGAGATCG AGGGCCGCGC GGGCACGCCC
GCCTCCTGCA CCACGCCGGT GACGCCCGGC ATGAAGGTGC GCACCCAGAC GCCGAAGCTG
AAGCAGCTGC GCCGCGGGGT GATGGAGCTC TATATTTCGG ATCACCCGCT CGACTGCCTG
ACCTGCGCCG CCAACGGCGA TTGCGAGCTG CAGGACATGG CGGGCGCGGT GGGCCTGCGC
GATGTGCGCT ACGAGGCCGT AGAAAATCAT TTCACGCCCC GCAATGCCGG CGGCGATCTC
AATCCGCAAT GGATGGTCAA GGACGAGTCG AACCCCTATT TCACCTACGA CCCGTCGAAA
TGCATCGTCT GCTCGCGCTG CGTGCGGGCC TGCGAGGAGG TGCAGGGCAC CTTCGCGCTG
ACCATCGAGG GCCGGGGCTT CGACAGCCGC GTCTCGGCCG GGATGGCCAG CGACAGTTTC
CTCACCTCCG ACTGCGTGAG CTGCGGCGCC TGCGTGCAGG CCTGCCCGAC CGCCACGCTG
CAGGAGAAGT CGGTGATCGA GATCGGCACG CCTGAGCGTG CGGTCGTGAC CACCTGCGCC
TATTGCGGCG TCGGCTGCTC GTTCAAGGCC GAAATGCGCG GCGACGAGCT GGTGCGGATG
GTCCCCTACA AGGGCGGCAA GGCCAACCAC GGCCATTCCT GCGTCAAGGG GCGCTTCGCC
TATGGCTATG CGGCCCACAA GGACCGGATC CTGAAGCCCA TGGTGCGCGA GTCGATCCAC
GATCCCTGGC AGGAGGTGAG CTGGGACGAG GCCTTGGGCT TCGCCGCGCG CCGCCTGACG
GCGATCCAGG AGAAGCACGG CCGCCAATCC GTGGGCGTCA TCACCTCGTC TCGCTGCACG
AACGAGGAGA CCTACCTCGT CCAGAAGCTG ACCCGCGCCG TCTTCCGCAA CAACAACACC
GACACCTGCG CCCGGGTCTG CCACTCGCCC ACCGGCTACG GCCTGGGCCA GACCTTCGGC
ACCTCGGCCG GGACGCAGGA TTTCGATTCG GTCGAGGCTG CGGACGTGGT GATGGTGATC
GGCGCGAACC CGACCGACGG CCATCCGGTC TTCGCAAGCC GGCTGAAGAA GCGGCTGCGC
AAGGGGGCGA AACTGATCGT GGTCGATCCG CGGCGCATCG ATCTGGTGAA GAGCCCCCAT
ATCGCGGCGG CCCACCATCT GGCGCTCAGG CCCGGCACCA ACGTGGCCGT GGTGACGGCC
ATGGCCCATG TCATCGTGAC CGAGGGGCTG GCGGATGAAA AATTCATCCG AGAACGCTGC
GACTGGGACG AGTTCCAGGA CTTCGCCGAA TTCGCCGCCG ATCCGCGTCA CGCGCCCGAG
GCGATCGAGA GCCTGACCGG CGTGCCCGCG GCCGAGCTGC GTGCGGCGGC CCGCCTCTAT
GCCACCGGCG GGAATGCCGC GATCTATTAC GGGCTGGGCG TGACCGAGCA CAGCCAGGGC
TCGACCACCG TCATCGGCAT CGCGAACCTC GCCATGCTCA CCGGCAACAT CGGCCGGCCC
GGCGTGGGCG TGAACCCGCT GCGGGGCCAG AACAATGTGC AGGGCTCCTG CGACATGGGC
TCGTTCCCGC ACGAGCTGCC GGGCTACCGT CATGTGAAGA GCGATGCGGC GCGCGCGGTG
TTCGAGCGGC TCTGGGGCGT CGAGATCGAT CCCGAGCCGG GACTGCGGAT CCCGAACATG
CTCGATGCGG CGGTCGAGGG CACCTTCAAG GGGCTTTATT GCCAGGGGGA GGACATCCTG
CAATCGGACC CCGACACGCG CCATGTCGCG GCGGGCCTTG CGGCGATGGA GTGCGTGATC
GTCCACGACC TCTTCCTGAA CGAGACCGCC AACTACGCCC ATGTCTTCCT TCCGGGCTCC
TCTTTCCTCG AGAAGGACGG CACCTTCACC AACGCCGAGC GCCGCATCAA CCGCGTGCGC
AAGGTCATGG CGCCGAAAAA TGGCTTCGCC GACTGGGAAG TGACGCAGAT GCTGGCCAAT
GCGCTGGGCG CGGGCTGGGG CTACACCCAT CCGAGCCAGA TCATGGATGA GATCGCGGCC
ACCACGCCCT CCTTCGCCGG CGTCTCCTAC GAGCGGCTGG AAGAGGCGGG CTCGATCCAG
TGGCCCTGCA ACGAGGAGCA TCCGCTGGGC ACGCCGCTCA TGCATGTCGA GGGCTTCGTG
CGCGGCCGCG GAAAACTCAT CCGCACGGAA TATGTGGCGA CGGACGAGAA GACGGGCCCG
CGTTTCCCGC TGCTACTCAC CACCGGGCGG ATCCTCTCGC AGTACAACGT GGGCGCACAG
ACGCGGCGGA CGGCGAACAG CGTCTGGCAT CCCGAGGACG TGCTCGAGAT CCATCCGCAC
GATGCCGAGG TGCGCGGCGT GGCCGAAGGC GACTGGGTGC GCCTCGCCTC GCGGGCGGGC
GAGACGACGC TCCGGGCGCG GCTGACGGAT CGCGTATCGC CGGGCGTGGT CTATACGACC
TTCCACCATC CTGCGACCCA AGCGAATGTC ATCACCACCG ACTTCTCGGA CTGGGCGACG
AACTGCCCGG AATACAAGGT GACGGCGGTG CAGGTTGCGC CGTCGAACGG GCCGTCGGAC
TGGCAGGAGG ATTACCGCGC CCAGGCGGAC CTCGCGCGGC GCATCCTGCC GGCTGCCGAA
TGA
 
Protein sequence
MKDFILPDDR DFGTPRSRAT ETVTLEIDGF PVTVPAGTSV MRAAAEAGIS VPKLCASDTL 
DAFGSCRLCL VEIEGRAGTP ASCTTPVTPG MKVRTQTPKL KQLRRGVMEL YISDHPLDCL
TCAANGDCEL QDMAGAVGLR DVRYEAVENH FTPRNAGGDL NPQWMVKDES NPYFTYDPSK
CIVCSRCVRA CEEVQGTFAL TIEGRGFDSR VSAGMASDSF LTSDCVSCGA CVQACPTATL
QEKSVIEIGT PERAVVTTCA YCGVGCSFKA EMRGDELVRM VPYKGGKANH GHSCVKGRFA
YGYAAHKDRI LKPMVRESIH DPWQEVSWDE ALGFAARRLT AIQEKHGRQS VGVITSSRCT
NEETYLVQKL TRAVFRNNNT DTCARVCHSP TGYGLGQTFG TSAGTQDFDS VEAADVVMVI
GANPTDGHPV FASRLKKRLR KGAKLIVVDP RRIDLVKSPH IAAAHHLALR PGTNVAVVTA
MAHVIVTEGL ADEKFIRERC DWDEFQDFAE FAADPRHAPE AIESLTGVPA AELRAAARLY
ATGGNAAIYY GLGVTEHSQG STTVIGIANL AMLTGNIGRP GVGVNPLRGQ NNVQGSCDMG
SFPHELPGYR HVKSDAARAV FERLWGVEID PEPGLRIPNM LDAAVEGTFK GLYCQGEDIL
QSDPDTRHVA AGLAAMECVI VHDLFLNETA NYAHVFLPGS SFLEKDGTFT NAERRINRVR
KVMAPKNGFA DWEVTQMLAN ALGAGWGYTH PSQIMDEIAA TTPSFAGVSY ERLEEAGSIQ
WPCNEEHPLG TPLMHVEGFV RGRGKLIRTE YVATDEKTGP RFPLLLTTGR ILSQYNVGAQ
TRRTANSVWH PEDVLEIHPH DAEVRGVAEG DWVRLASRAG ETTLRARLTD RVSPGVVYTT
FHHPATQANV ITTDFSDWAT NCPEYKVTAV QVAPSNGPSD WQEDYRAQAD LARRILPAAE