Gene YpAngola_A3770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3770 
SymbolfdoG 
ID5802247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3998049 
End bp4001096 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content52% 
IMG OID641341569 
Productformate dehydrogenase, alpha subunit, selenocysteine-containing 
Protein accessionYP_001608081 
Protein GI162421766 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTCA GCAGAAGGCA GTTCTTTAAG ATCTGCGCTG GCGGTATGGC AGGAACAACG 
GTCGCGGCAC TCGGCTTCGC GCCGTCAGTG GCGCTGGCGG AAACGCGCAA TTATAAATTG
CTGCGCGCTC GCGAGACACG TAACACCTGC ACATATTGCT CTGTCGGTTG TGGGCTTTTG
ATGTATAGCC TTGGCGACGG CGCGAAAAAT GCTAAAGAAA GTATTTTTCA CATTGAAGGG
GACCCGGATC ATCCGGTGAA CCGTGGCGCA CTGTGTCCGA AAGGGGCAGG GTTAGTCGAC
TTTATCCATA GCGAAAGCCG CTTGAAATAC CCAGAATACC GGGCGCCAGG CTCAGACAAG
TGGCAGAGAA TCACTTGGGA TGATGCGTTT ACCCGTATTG CCAAATTAAT GAAAGAAGAC
CGGGACGCTA ACTTCATTAA GACCAACGAC GCCGGTGTTA CCGTCAACCG TTGGTTAAGC
ACCGGTATGC TGTGTGCTTC GGCATCAAGC AATGAAACGG GTTATTTGAC CCAAAAATTT
AGTCGCGCTC TCGGCATGCT TGCCGTAGAC AACCAAGCAC GTGTCTGACA CGGACCAACG
GTAGCAAGTC TTGCTCCAAC ATTTGGTCGC GGTGCGATGA CCAACCACTG GGTTGACATT
AAGAACGCGG ATTTAATTAT CGTCATGGGC GGTAATGCGG CAGAAGCGCA TCCGGTGGGG
TTCCGCTGGG CGATGGAAGC CAAAATCCAC AACAATGCCA AGCTTCTGGT GATAGATCCG
CGCTTTACTC GTACGGCATC GGTGGCTGAT TTCTATACGC CAATCCGCTC CGGTACGGAT
ATTGCGTTTC TGTCCGGTGT CTTGTTGTAC CTGATCTCCA ACAATAAAAT TAACCGTGAA
TATGTCGAAG CCTATACCAA CGCCAGCTTG CTGGTGCGGG AAGACTATGC TTTCGATGAT
GGCCTGTTCA GTGGCTATGA CGCCGAAAAC CGTAAATACG ATAAAACCAG CTGGAACTAT
CAGTTGGATG AAGACGGTTT TGCTAAACGG GATGTCACGC TGCAACACCC GCGTTGTGTG
TGGAACCTGC TGAAAGAGCA CGTTAGCCGC TATACACCGG AAGTGGTCTC CAATATTTGC
GGTACGCCAA AAGACGATTT CCTGCAGGTT TGTGAATATC TCGCCGAAAC CAGTGTATCC
AATAAAACGG CGACGTTCCT GTATGCCTTG GGTTGGACGC AGCACTCTGT GGGTGCGCAG
AATATCCGTA CTATGGCGAT GATCCAGTTG CTGTTGGGCA ACATGGGGAT GGCAGGTGGC
GGTATTAACG CCCTACGCGG TCACTCCAAT ATCCAAGGGC TGACTGACCT TGGCCTGTTG
TCGCAAAGCC TGCCGGGTTA CTTGAACTTG CCGTCAGAAA AACAGCCGGA TATTGATACC
TACCTGAAGG CCAACACGCC GAAAACCCTG TTACCAGGCC AGGTTAACTA CTGGAGCAAT
TACCCGAAAT TCTTTGTCAG TTTGATGAAA AGTTTCTACG GTGATAACGC CCAAAAGGAA
AATGGCTGGG GCTATGACTG GCTGCCGAAG TGGGATAAAG GCTACGACGT CTTGCAGTAT
TTCGAAATGA TGTCGCAGGG CAAGGTCAAT GGCTATCTGT GCCAAGGCTT TAACCCGATT
GCCTCGTTCC CGGATAAAAA CAAAGTGACA GCGGCGCTGT CGAAGCTGAA ATTCTTGGTG
ACGATTGATC CACTCAATAC TGAAACCGCG AATTTCTGGC AAAACCACGG TGAATTTAAC
GATGTCGATC CATCGAAAAT TCAAACTGAG GTGTTCCGCT TGCCATCCAG TTGTTTTGCT
GAAGAGAACG GCTCGATCGT TAACTCCAGC CGCTGGCTGC AATGGCACTG GAAAGGCGCT
GATTCACCGG GAGAAGCACT GAACGATGGT GCAATTCTGG CGGGCATCTT TATGCGTATG
CGTGAGATGT ACCAGCGGGA AGGTGGCGCG GTGCCTGAGC AGGTGCTCAA TATGACCTGG
GACTACCTGA CACCAGAAAA TCCAGAGCCG GAAGAAGTGG CAATGGAAAG TAATGGGCGA
GCGCTGGCGG ATCTCACTGA TGCCGACGGC AAAGTGTTGG TCAAAAAAGG CGAACAGCTC
AGTACCTTCG CTCAACTGCG TGATGACGGT ACCACTTCCA GTGGTTGCTG GATCTTTGCG
GGTAGCTGGA CACCGGCCGG TAACCAAATG GCGCGGCGTG ATAATGCGGA TCCATCGGGT
CTCGGCAATA CCTTGGGCTG GGCCTGGGCA TGGCCGCTTA ACCGCCGCAT TCTGTACAAC
CGTGCGTCTG CTGACCCGCA GGGTAAACCG TGGGATCCGA AACGCCAGTT GCTGGAGTGG
GATGGTGCTA AGTGGGCCGG TATTGATGTT GCTGACTACA GTGCTGCGGC ACCGGGCAGT
GATGTTGGGC CGTTTATCAT GCAGCCTGAA GGGATGGGCC GTTTGTTTGC AATCGATAAG
ATGGCTGAAG GGCCGTTCCC TGAGCATTAT GAGCCATTTG AAACGCCGTT GGGGACCAAC
CCGCTGCATC CGAATGTGAT CTCTAACCCA GCCGCTCGTG TATTTAAAGA TGATCTGGCC
GCGATGGGGT CGCACGAGCA ATTCCCTTAT GTTGGCACCA CTTATCGTTT AACCGAACAT
TTCCACTACT GGACCAAACA TGCGTTGCTC AATGCCATCG CTCAACCGGA ACAGTTTGTG
GAAATTGGCG AAAAACTGGC GGCGAAGAAA GGGATTAAGC AAGGCGATAC GGTGAAAGTC
AGCTCTAACC GTGGCTTTAT CAAAGCCAAG GCGGTGGTGA CTAAACGTAT TCGTACTCTG
AATGTTCATG GGCAAGAAGT TGACACCATT GGTATTCCGA TCCATTGGGG ATATGAAGGG
GTGGCAAAAA AAGGCTTCCT GGCGAATACC CTGACACCGT ATGTCGGTGA TGCCAATACG
CAAACGCCAG AGTTCAAGGC GTTTCTGGTC AATGTGGAAA AGGTGTAA
 
Protein sequence
MQVSRRQFFK ICAGGMAGTT VAALGFAPSV ALAETRNYKL LRARETRNTC TYCSVGCGLL 
MYSLGDGAKN AKESIFHIEG DPDHPVNRGA LCPKGAGLVD FIHSESRLKY PEYRAPGSDK
WQRITWDDAF TRIAKLMKED RDANFIKTND AGVTVNRWLS TGMLCASASS NETGYLTQKF
SRALGMLAVD NQARVUHGPT VASLAPTFGR GAMTNHWVDI KNADLIIVMG GNAAEAHPVG
FRWAMEAKIH NNAKLLVIDP RFTRTASVAD FYTPIRSGTD IAFLSGVLLY LISNNKINRE
YVEAYTNASL LVREDYAFDD GLFSGYDAEN RKYDKTSWNY QLDEDGFAKR DVTLQHPRCV
WNLLKEHVSR YTPEVVSNIC GTPKDDFLQV CEYLAETSVS NKTATFLYAL GWTQHSVGAQ
NIRTMAMIQL LLGNMGMAGG GINALRGHSN IQGLTDLGLL SQSLPGYLNL PSEKQPDIDT
YLKANTPKTL LPGQVNYWSN YPKFFVSLMK SFYGDNAQKE NGWGYDWLPK WDKGYDVLQY
FEMMSQGKVN GYLCQGFNPI ASFPDKNKVT AALSKLKFLV TIDPLNTETA NFWQNHGEFN
DVDPSKIQTE VFRLPSSCFA EENGSIVNSS RWLQWHWKGA DSPGEALNDG AILAGIFMRM
REMYQREGGA VPEQVLNMTW DYLTPENPEP EEVAMESNGR ALADLTDADG KVLVKKGEQL
STFAQLRDDG TTSSGCWIFA GSWTPAGNQM ARRDNADPSG LGNTLGWAWA WPLNRRILYN
RASADPQGKP WDPKRQLLEW DGAKWAGIDV ADYSAAAPGS DVGPFIMQPE GMGRLFAIDK
MAEGPFPEHY EPFETPLGTN PLHPNVISNP AARVFKDDLA AMGSHEQFPY VGTTYRLTEH
FHYWTKHALL NAIAQPEQFV EIGEKLAAKK GIKQGDTVKV SSNRGFIKAK AVVTKRIRTL
NVHGQEVDTI GIPIHWGYEG VAKKGFLANT LTPYVGDANT QTPEFKAFLV NVEKV