Gene Rsph17025_2853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2853 
Symbol 
ID5084231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2908405 
End bp2911287 
Gene Length2883 bp 
Protein Length960 aa 
Translation table11 
GC content68% 
IMG OID640484423 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_001169044 
Protein GI146278885 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.771386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACT TCATCCTTCC CGACGACCGC GACTTCGGCA CACCCGCCTC GCGCGCGACC 
GAGACCGTGA CGCTCGAGAT CGACGGCTTT CCGGTGACGG TGCCTGCGGG CACCTCGGTG
ATGCGGGCCG CGGCCGAGGC GGGCATCTCG GTGCCGAAGC TCTGCGCGAC CGACAGCCTC
GACGCCTTCG GCTCCTGCCG GCTGTGTCTC GTCGAGATCG AGGGCCGCGC GGGCACGCCC
GCCTCCTGCA CCACGCCCGT CACGCCGGGC ATGAAGGTGC GCACCCAGAC CCCCAAGCTG
AAGCAGCTCC GCCGCGGCGT GATGGAGCTT TACATCTCCG ACCATCCGCT CGACTGCCTG
ACCTGCTCCG CCAACGGCGA TTGCGAGCTT CAGGACATGG CCGGGGCCGT GGGTCTGCGC
GACGTGCGCT ACGAGGCGGT GGAAAACCAC TTCACCCCGC GCAACGCCGG CGGCGACCTC
AACCCGCAAT GGATGGCCAA GGACGAGTCG AACCCCTATT TCACCTACGA CCCGTCGAAG
TGCATCGTCT GCTCGCGCTG CGTGCGGGCC TGCGAGGAGG TGCAGGGCAC CTTCGCCCTG
ACGATCGAGG GCCGGGGCTT CGACAGCCGC GTCTCGGCCG GCATGGCGAG CGACAGTTTC
CTCACCTCCG ACTGCGTGAG CTGCGGCGCC TGCGTGCAGG CCTGCCCGAC CGCCACCCTG
CAGGAGAAGT CGGTGATCGA GATCGGCACG CCCGAGCGTG CGGTCGTGAC CACCTGCGCC
TATTGCGGTG TCGGCTGCTC GTTCAAGGCC GAGATGCGCG GGGACGAGCT GGTTCGGATG
GTCCCCTACA AGGGCGGCAA GGCCAACCAC GGCCACTCCT GCGTCAAGGG CCGTTTCGCC
TACGGCTATG CGGCGCACAA GGACCGGATC CTGAAGCCGA TGGTGCGCGA GTCGATCCAC
GACCCGTGGC AGGAGGTGAG CTGGGACGAG GCGCTGGGCT TCGCCGCACG CCGTCTGACC
GCGATCCAGC AGAAGCACGG CCGCCAGTCG GTCGGCGTCA TCACCTCGTC GCGCTGCACG
AACGAGGAGA CCTACCTCGT CCAGAAGCTG ACCCGTGCGG TCTTCCGCAA CAACAATACC
GACACCTGCG CCCGCGTCTG CCATTCGCCC ACCGGCTACG GGCTTGGCCA GACCTTCGGC
ACCTCGGCCG GCACGCAGGA CTTCGATTCC GTCGAGGCCT CGGATGTGGT GATGGTGATC
GGGGCAAACC CGACCGACGG CCACCCGGTC TTCGCAAGCC GACTGAAGAA GCGGCTGCGT
CAGGGCGCAA AGCTGATCGT CATCGACCCG CGGCGCATTG ATCTCGTCAG GAGCCCGCAT
GTCGCCGCGG CCCATCACCT CGCGCTGAGA CCGGGCACCA ACGTGGCCGT GGTCACGGCG
CTGGCCCATG TGATCGTGAC CGAGGGGCTG GCGGATGACA CATTCATCCG CGAACGCTGC
GACTGGGACG AGTTCCAGGA TTTTGCCGAG TTCGCCGCCG ATCCCCGCCA TTCGCCCGAA
GCGGTCGAGG CGCTGACGGG CGTGCCCGCG GCCGAGCTGC GCGCCGCCGC CCGCCTCTAT
GCCACCGGCG GCAACGCCGC GATCTACTAC GGGCTGGGCG TGACCGAGCA CAGCCAGGGC
TCGACCACCG TCATCGGCAT CGCGAACCTT GCCATGCTCA CCGGCAACAT CGGACGGCCG
GGCGTGGGGG TGAACCCGCT GCGGGGCCAG AACAACGTGC AGGGCTCGTG CGACATGGGT
TCGTTCCCGC ACGAACTGCC GGGCTACCGC CATGTGAAGA GCGACGCCGC GCGCGAGGTG
TTCGAGCGGC TCTGGGGCGT CGGGATCGAC CCCGAGCCGG GCCTGCGCAT CCCCAACATG
CTCGATGCCG CGGTCGAGGG CACCTTCAAG GGTCTCTATT GCCAGGGCGA GGACATCCTG
CAATCGGACC CCGACACCCG CCATGTCGCG GCGGGTCTTG CGGCGATGGA ATGCGTGATC
GTCCATGACC TCTTCCTGAA CGAGACCGCC AACTACGCCC ATGTCTTCCT GCCGGGTTCC
TCCTTCCTGG AAAAGGACGG CACCTTCACC AACGCCGAGC GGCGCATCAA CCGCGTGCGC
AAGGTCATGG CGCCGAAGAA CGGCTTCGCC GACTGGGAAG TGACGCAGAT GCTGGCCAAC
GCGCTCGGCG CGGGCTGGGG CTACACGCAC CCGAGCCAGA TCATGGACGA GATCGCGGCC
ACCACGCCCT CCTTCGCCGG GGTCTCCTAC GAGCGGCTGG AAGAGGCGGG CTCGATCCAA
TGGCCCTGCG ACGAAGAGCA CCCGCTGGGC ACGCCTCTCA TGCATGTCGA AGGTTTCGTG
CGCGGGCGCG GACGGTTCAT CCGCACCGAA TATGTGGCGA CGGATGAACG GACCGGCCCG
CGTTTCCCGC TGCTGCTGAC CACCGGGCGG ATCCTCTCGC AATACAACGT GGGGGCCCAG
ACGCGGCGGA CGGCGAACAG CATCTGGCAC CCCGAGGACG TGCTGGAGAT CCATCCCCAC
GACGCCGAGG TGCGGGGGGT CAAGGACGAG GACTGGGTGC GGCTTGCCTC GCGCGCGGGC
GAGACGACGC TGCGCGCGAA ACTCACCGAT CGGGTCTCTC CGGGCGTGGT CTATACGACC
TTCCACCACC CGGCAACGCA GGCCAACGTC ATCACGACCG ACTTTTCGGA CTGGGCGACC
AACTGCCCCG AATACAAGGT GACGGCGGTG CAGGTCTCGC CCTCGAACGG TCCGTCGGAC
TGGCAGGAGG ATTACCGCCT GCAGGCCGAA CTGGCCCGCC GCATCCTGCC GGCGGCCGAA
TGA
 
Protein sequence
MKDFILPDDR DFGTPASRAT ETVTLEIDGF PVTVPAGTSV MRAAAEAGIS VPKLCATDSL 
DAFGSCRLCL VEIEGRAGTP ASCTTPVTPG MKVRTQTPKL KQLRRGVMEL YISDHPLDCL
TCSANGDCEL QDMAGAVGLR DVRYEAVENH FTPRNAGGDL NPQWMAKDES NPYFTYDPSK
CIVCSRCVRA CEEVQGTFAL TIEGRGFDSR VSAGMASDSF LTSDCVSCGA CVQACPTATL
QEKSVIEIGT PERAVVTTCA YCGVGCSFKA EMRGDELVRM VPYKGGKANH GHSCVKGRFA
YGYAAHKDRI LKPMVRESIH DPWQEVSWDE ALGFAARRLT AIQQKHGRQS VGVITSSRCT
NEETYLVQKL TRAVFRNNNT DTCARVCHSP TGYGLGQTFG TSAGTQDFDS VEASDVVMVI
GANPTDGHPV FASRLKKRLR QGAKLIVIDP RRIDLVRSPH VAAAHHLALR PGTNVAVVTA
LAHVIVTEGL ADDTFIRERC DWDEFQDFAE FAADPRHSPE AVEALTGVPA AELRAAARLY
ATGGNAAIYY GLGVTEHSQG STTVIGIANL AMLTGNIGRP GVGVNPLRGQ NNVQGSCDMG
SFPHELPGYR HVKSDAAREV FERLWGVGID PEPGLRIPNM LDAAVEGTFK GLYCQGEDIL
QSDPDTRHVA AGLAAMECVI VHDLFLNETA NYAHVFLPGS SFLEKDGTFT NAERRINRVR
KVMAPKNGFA DWEVTQMLAN ALGAGWGYTH PSQIMDEIAA TTPSFAGVSY ERLEEAGSIQ
WPCDEEHPLG TPLMHVEGFV RGRGRFIRTE YVATDERTGP RFPLLLTTGR ILSQYNVGAQ
TRRTANSIWH PEDVLEIHPH DAEVRGVKDE DWVRLASRAG ETTLRAKLTD RVSPGVVYTT
FHHPATQANV ITTDFSDWAT NCPEYKVTAV QVSPSNGPSD WQEDYRLQAE LARRILPAAE