Gene YpsIP31758_4119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4119 
SymbolfdoG 
ID5386955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4647595 
End bp4650642 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content52% 
IMG OID640867148 
Productaerobic formate dehydrogenase subunit alpha 
Protein accessionYP_001403062 
Protein GI153948475 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGTCA GCAGAAGGCA GTTCTTTAAG ATCTGCGCTG GCGGTATGGC AGGAACAACG 
GTCGCGGCAC TCGGCTTCGC GCCGTCAGTG GCGCTGGCGG AAACGCGCAA TTATAAATTG
CTGCGCGCTC GCGAGACACG TAACACCTGC ACATATTGCT CTGTCGGTTG TGGGCTTTTG
ATGTATAGCC TTGGCGACGG CGCGAAAAAT GCTAAAGAAA GTATTTTTCA CATTGAAGGG
GACCCGGATC ATCCGGTGAA CCGTGGCGCA CTGTGCCCGA AAGGGGCAGG ATTAGTCGAC
TTTATCCACA GTGAAAGCCG CTTGAAATAC CCAGAATACC GGGCGCCAGG CTCAGACAAG
TGGCAGAGAA TCACTTGGGA TGATGCGTTT ACCCGTATTG CCAAATTAAT GAAAGAAGAC
CGGGACGCTA ACTTCATTAA GACCAACGAC GCCGGTGTTA CCGTCAACCG TTGGTTAAGC
ACCGGTATGC TGTGTGCTTC GGCATCAAGC AATGAAACGG GTTATTTGAC CCAAAAATTT
AGTCGCGCTC TCGGCATGCT TGCCGTAGAC AACCAAGCAC GTGTCTGACA CGGACCAACG
GTAGCAAGTC TTGCTCCAAC ATTTGGTCGC GGTGCGATGA CCAACCACTG GGTTGACATT
AAGAACGCGG ATTTAATTAT CGTCATGGGC GGTAATGCGG CAGAAGCGCA TCCGGTGGGG
TTCCGCTGGG CGATGGAAGC CAAGATCCAC AACAATGCCA AGCTTCTGGT GATAGATCCG
CGCTTTACTC GTACGGCATC GGTGGCTGAT TTCTATACGC CAATCCGCTC CGGTACGGAT
ATTGCGTTTC TGTCCGGTGT CTTGTTGTAC CTGATCTCCA ACAATAAAAT TAACCGTGAA
TATGTCGAAG CCTATACCAA CGCCAGCTTG CTGGTGCGGG AAGACTATGC TTTCGATGAT
GGCCTGTTCA GTGGCTATGA CGCCGAAAAC CGTAAATACG ATAAAACCAG CTGGAACTAT
CAGTTGGATG AAGACGGTTT TGCTAAACGG GATGTCACGC TGCAACACCC GCGTTGTGTG
TGGAACCTGC TGAAAGAGCA CGTTAGCCGC TATACACCGG AAGTGGTCTC CAATATTTGC
GGTACGCCAA AAGACGATTT CCTGCAGGTT TGTGAATATC TCGCCGAAAC CAGTGTATCC
AATAAAACGG CGACGTTCCT GTATGCCTTG GGTTGGACGC AGCACTCTGT GGGTGCGCAG
AATATCCGTA CTATGGCGAT GATCCAGTTG CTGTTGGGCA ACATGGGGAT GGCAGGTGGC
GGTATTAACG CCCTACGCGG TCACTCCAAT ATCCAAGGGC TGACTGACCT TGGCCTGTTG
TCGCAAAGCC TGCCGGGTTA CTTGAACTTG CCGTCAGAAA AACAGCCGGA TATTGATACC
TACCTGAAGG CCAACACGCC GAAAACCCTG TTACCAGGCC AGGTTAACTA CTGGAGCAAT
TACCCGAAAT TCTTTGTCAG TTTGATGAAA AGTTTCTACG GTGATAACGC CCAAAAGGAA
AATGGCTGGG GCTACGACTG GTTGCCGAAG TGGGATAAAG GCTACGACGT ATTACAGTAT
TTCGAAATGA TGTCGCAGGG CAAGGTCAAC GGCTATCTGT GCCAAGGCTT TAACCCGATT
GCCTCGTTCC CGGATAAAAA CAAAGTGACA GCAGCGCTGT CGAAGCTGAA ATTCTTGGTG
ACCATTGATC CGCTCAATAC TGAAACCGCG AATTTCTGGC AAAACCACGG TGAATTTAAC
GATGTCGATC CATCGAAAAT TCAAACTGAG GTGTTCCGCT TGCCATCCAG TTGTTTTGCT
GAAGAGAACG GCTCGATCGT TAACTCCAGC CGCTGGCTGC AATGGCACTG GAAAGGCGCT
GATTCACCGG GAGAAGCACT GAACGATGGT GCGATTCTGG CGGGCATCTT TATGCGTATG
CGTGAGATGT ACCAGCGGGA AGGTGGTGCG GTGCCTGAAC AGGTACTCAA TATGACTTGG
GACTACCTGA CACCAGAAAA TCCAGAGCCG GAAGAAGTGG CAATGGAAAG TAATGGGCGA
GCGCTGGCGG ATCTCACCGA TGCCGATGGC AAAGTGCTGG TCAAAAAAGG CGAACAGCTC
AGTACCTTCG CTCAACTGCG TGATGACGGT ACCACTTCCA GTGGTTGCTG GATCTTTGCG
GGTAGCTGGA CACCAGCCGG TAACCAAATG GCGCGGCGTG ATAATGCTGA TCCATCTGGC
CTTGGCAATA CCTTGGGCTG GGCCTGGGCA TGGCCGCTTA ACCGTCGCAT TCTGTACAAC
CGTGCGTCTG CTGACCCGCA GGGTAAACCG TGGGATCCGA AACGCCAGCT GCTGGAGTGG
GATGGTGCTA AGTGGGCTGG CATTGATGTT GCTGACTACA GTGCAGCGGC ACCGGGCAGT
GATGTTGGGC CGTTTATCAT GCAGCCTGAA GGGATGGGCC GTTTGTTTGC AATCGATAAG
ATGGCTGAAG GGCCGTTCCC TGAGCATTAT GAGCCATTTG AAACGCCGCT GGGGACCAAC
CCGCTGCATC CGAATGTGAT ATCTAACCCA GCCGCTCGTG TATTTAAAGA CGATCTGGCC
GCAATGGGGT CGCACGAGCA ATTCCCTTAT GTCGGCACCA CTTATCGTCT GACCGAACAT
TTCCACTACT GGACCAAACA TGCGTTGCTC AATGCTATCG CTCAACCGGA ACAGTTTGTG
GAAATTGGCG AAAAACTGGC GGCGAAGAAA GGGATTAAGC AAGGCGATAC GGTGAAAGTC
AGCTCTAACC GTGGCTTTAT CAAAGCCAAG GCGGTGGTGA CTAAACGTAT TCGTACGCTG
AATGTTCATG GGCAAGAAGT TGACACCATT GGTATTCCGA TCCACTGGGG ATACGAAGGG
GTGGCGAAAA AAGGCTTCCT GGCGAATACC CTGACACCGT ATGTCGGTGA TGCTAATACG
CAAACGCCAG AGTTCAAGGC GTTTCTGGTC AATGTGGAAA AGGTGTAA
 
Protein sequence
MQVSRRQFFK ICAGGMAGTT VAALGFAPSV ALAETRNYKL LRARETRNTC TYCSVGCGLL 
MYSLGDGAKN AKESIFHIEG DPDHPVNRGA LCPKGAGLVD FIHSESRLKY PEYRAPGSDK
WQRITWDDAF TRIAKLMKED RDANFIKTND AGVTVNRWLS TGMLCASASS NETGYLTQKF
SRALGMLAVD NQARVUHGPT VASLAPTFGR GAMTNHWVDI KNADLIIVMG GNAAEAHPVG
FRWAMEAKIH NNAKLLVIDP RFTRTASVAD FYTPIRSGTD IAFLSGVLLY LISNNKINRE
YVEAYTNASL LVREDYAFDD GLFSGYDAEN RKYDKTSWNY QLDEDGFAKR DVTLQHPRCV
WNLLKEHVSR YTPEVVSNIC GTPKDDFLQV CEYLAETSVS NKTATFLYAL GWTQHSVGAQ
NIRTMAMIQL LLGNMGMAGG GINALRGHSN IQGLTDLGLL SQSLPGYLNL PSEKQPDIDT
YLKANTPKTL LPGQVNYWSN YPKFFVSLMK SFYGDNAQKE NGWGYDWLPK WDKGYDVLQY
FEMMSQGKVN GYLCQGFNPI ASFPDKNKVT AALSKLKFLV TIDPLNTETA NFWQNHGEFN
DVDPSKIQTE VFRLPSSCFA EENGSIVNSS RWLQWHWKGA DSPGEALNDG AILAGIFMRM
REMYQREGGA VPEQVLNMTW DYLTPENPEP EEVAMESNGR ALADLTDADG KVLVKKGEQL
STFAQLRDDG TTSSGCWIFA GSWTPAGNQM ARRDNADPSG LGNTLGWAWA WPLNRRILYN
RASADPQGKP WDPKRQLLEW DGAKWAGIDV ADYSAAAPGS DVGPFIMQPE GMGRLFAIDK
MAEGPFPEHY EPFETPLGTN PLHPNVISNP AARVFKDDLA AMGSHEQFPY VGTTYRLTEH
FHYWTKHALL NAIAQPEQFV EIGEKLAAKK GIKQGDTVKV SSNRGFIKAK AVVTKRIRTL
NVHGQEVDTI GIPIHWGYEG VAKKGFLANT LTPYVGDANT QTPEFKAFLV NVEKV