Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2183 |
Symbol | |
ID | 6067466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2393248 |
End bp | 2396295 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641601590 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001725149 |
Protein GI | 170020195 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.668487 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCA GTCGCAGACA ATTTTTTAAA ATCTGCGCGG GCGGTATGGC TGGAACAACA GTAGCGGCAT TGGGCTTTGC CCCGAAGCAA GCACTGGCTC AGGCGCGAAA CTACAAATTA TTACGCGCTA AAGAGATCCG TAACACCTGC ACATACTGTT CCGTAGGTTG CGGGCTATTG ATGTATAGCC TGGGTGATGG CGCGAAAAAC GCCAGAGAAG CGATTTATCA CATTGAAGGT GATCCGGATC ATCCGGTAAG CCGTGGTGCG CTGTGCCCGA AAGGGGCCGG TTTGCTGGAT TACGTCAACA GCGAAAACCG TCTGCGCTAC CCGGAATATC GTGCGCCAGG TTCTGACAAA TGGCAGCGCA TTAGCTGGGA AGAAGCATTC TCCCGTATTG CGAAGCTGAT GAAAGCTGAC CGTGACGCTA ACTTTATTGA AAAGAACGAG CAGGGCGTAA CGGTAAACCG TTGGCTTTCT ACCGGTATGC TGTGTGCTTC CGGTGCCAGC AACGAAACCG GGATGCTGAC CCAGAAATTT GCCCGCTCCC TCGGGATGCT GGCGGTAGAC AACCAGGCGC GCGTCTGACA CGGACCAACG GTAGCAAGTC TTGCTCCAAC ATTTGGTCGC GGTGCGATGA CCAACCACTG GGTGGATATC AAAAACGCTA ACGTCGTAAT GGTAATGGGC GGTAACGCTG CTGAAGCGCA TCCCGTCGGT TTCCGCTGGG CGATGGAAGC GAAAAACAAC AACGATGCAA CCTTGATCGT TGTCGATCCC CGTTTTACGC GTACCGCTTC TGTAGCGGAT ATTTACGCAC CTATTCGTTC CGGTACGGAC ATTACGTTCC TGTCTGGCGT TTTGCGCTAC CTGATCGAAA ACAACAAAAT CAACGCCGAA TACGTTAAGC ATTACACCAA CGCCAGCTTG CTGGTGCGTG ATGATTTTGC TTTCGAAGAC GGTCTGTTCA GCGGCTACGA CGCTGAAAAA CGGCAATACG ATAAATCGTC CTGGAACTAT CAGTTCGATG AAAACGGCTA TGCGAAACGC GATGAAACAC TGACTCATCC GCGCTGTGTG TGGAACCTGC TGAAAGCGCA CGTTTCCCGC TACACGCCGG ACGTTGTTGA AAACATCTGT GGTACGCCAA AAGCCGACTT CCTGAAAGTG TGTGAAGTGC TGGCCTCCAC CAGCGCACCG GATCGCACAA CCACCTTCCT GTACGCGCTG GGCTGGACGC AGCACACCGT GGGTGCGCAG AACATCCGTA CTATGGCGAT GATCCAGTTG CTGCTCGGTA ACATGGGTAT GGCCGGTGGC GGCGTGAACG CATTGCGTGG TCACTCCAAC ATTCAGGGCC TGACTGACTT AGGCCTGCTC TCTACCAGCC TGCCAGGTTA TCTGACGCTG CCGTCAGAAA AACAGGTTGA TTTGCAGTCG TATCTGGAAG CGAACACGCC GAAAGCGACG CGGCCTGACC AGGTGAACTA CTGGAGCAAC TATCCGAAGT TCTTCGTTAG CCTGATGAAA TCTTTCTATG GCGATGCCGC GCAGAAAGAG AACAACTGGG GCTATGACTG GCTGCCGAAG TGGGACCAGA CCTACGACGT CATCAAGTAT TTCAACATGA TGGACGAAGG CAAAGTCACC GGTTATTTCT GCCAGGGCTT TAACCCGGTT GCGTCCTTCC CGGACAAAAA CAAAGTGGTG AGCTGCCTGA GCAAGCTGAA GTACATGGTG GTTATCGATC CGCTGGTGAC TGAAACCTCT ACCTTCTGGC AGAACCACGG CGAGTCGAAC GATGTCGATC CGGCGTCTAT TCAGACTGAA GTATTCCGTC TCCCTTCGAC CTGCTTTGCT GAAGAAGATG GTTCTATCGC TAACTCCGGT CGCTGGTTGC AGTGGCACTG GAAAGGTCAG GACGCGCCGG GCGAAGCGCG TAACGACGGC GAAATTCTGG CGGGTATTTA CCATCATCTG CGCGAGCTGT ACCAGGCCGA AGGTGGTAAA GGCGTAGAAC CGCTGATGAA GATGAGCTGG AACTACAAGC AACCGCACGA ACCGCAATCT GACGAAGTGG CTAAAGAGAA CAACGGCTAT GCGCTGGAAG ATCTCTATGA CGCTAATGGC GTGCTGATTG CGAAGAAAGG TCAGTTGCTG AGTAGCTTTG CGCATCTGCG TGATGACGGT ACAACCGCAT CTTCTTGCTG GATCTACACC GGTAGCTGGA CAGAGCAGGG CAACCAGATG GCTAACCGCG ATAACTCCGA CCCATCCGGT CTGGGGAATA CGCTGGGATG GGCCTGGGCG TGGCCGCTCA ACCGTCGCGT GCTCTACAAC CGTGCTTCGG CGGATATCAA TGGTAAACCG TGGGATCCGA AACGGATGCT GATCCAGTGG AACGGCAGCA AGTGGACGGG TAACGATATT CCGGACTTCG GCAATGCCGC ACCGGGTACG CCAACCGGGC CGTTTATCAT GCAGCCGGAA GGGATGGGAC GCCTGTTTGC CATCAACAAA ATGGCGGAAG GTCCGTTCCC GGAACACTAC GAGCCGATTG AAACGCCGCT GGGCACTAAC CCGCTGCATC CGAACGTGGT GTCTAACCCG GTCGTTCGTC TGTATGAACA AGACGCGCTG CGGATGGGTA AAAAAGAGCA GTTCCCGTAT GTGGGTACGA CCTATCGTCT GACCGAGCAC TTCCACACCT GGACCAAGCA CGCATTGCTC AACGCAATTG CTCAGCCGGA ACAGTTTGTG GAAATCAGCG AAACGCTGGC GGCGGCGAAA GGCATTAATA ATGGCGATCG TGTCACTGTC TCAAGCAAAC GTGGCTTTAT CCGCGCGGTG GCAGTGGTAA CGCGCCGTCT GAAGCCGCTG AATGTAAACG GTCAGCAGGT TGAAACGGTG GGTATTCCAA TCCACTGGGG CTTTGAGGGT GTCGCGCGTA AAGGTTATAT CGCTAACACT CTGACGCCGA ATGTCGGTGA TGCAAACTCG CAAACGCCGG AATATAAAGC GTTCTTAGTC AACATCGAGA AGGCGTAA
|
Protein sequence | MDVSRRQFFK ICAGGMAGTT VAALGFAPKQ ALAQARNYKL LRAKEIRNTC TYCSVGCGLL MYSLGDGAKN AREAIYHIEG DPDHPVSRGA LCPKGAGLLD YVNSENRLRY PEYRAPGSDK WQRISWEEAF SRIAKLMKAD RDANFIEKNE QGVTVNRWLS TGMLCASGAS NETGMLTQKF ARSLGMLAVD NQARVUHGPT VASLAPTFGR GAMTNHWVDI KNANVVMVMG GNAAEAHPVG FRWAMEAKNN NDATLIVVDP RFTRTASVAD IYAPIRSGTD ITFLSGVLRY LIENNKINAE YVKHYTNASL LVRDDFAFED GLFSGYDAEK RQYDKSSWNY QFDENGYAKR DETLTHPRCV WNLLKAHVSR YTPDVVENIC GTPKADFLKV CEVLASTSAP DRTTTFLYAL GWTQHTVGAQ NIRTMAMIQL LLGNMGMAGG GVNALRGHSN IQGLTDLGLL STSLPGYLTL PSEKQVDLQS YLEANTPKAT RPDQVNYWSN YPKFFVSLMK SFYGDAAQKE NNWGYDWLPK WDQTYDVIKY FNMMDEGKVT GYFCQGFNPV ASFPDKNKVV SCLSKLKYMV VIDPLVTETS TFWQNHGESN DVDPASIQTE VFRLPSTCFA EEDGSIANSG RWLQWHWKGQ DAPGEARNDG EILAGIYHHL RELYQAEGGK GVEPLMKMSW NYKQPHEPQS DEVAKENNGY ALEDLYDANG VLIAKKGQLL SSFAHLRDDG TTASSCWIYT GSWTEQGNQM ANRDNSDPSG LGNTLGWAWA WPLNRRVLYN RASADINGKP WDPKRMLIQW NGSKWTGNDI PDFGNAAPGT PTGPFIMQPE GMGRLFAINK MAEGPFPEHY EPIETPLGTN PLHPNVVSNP VVRLYEQDAL RMGKKEQFPY VGTTYRLTEH FHTWTKHALL NAIAQPEQFV EISETLAAAK GINNGDRVTV SSKRGFIRAV AVVTRRLKPL NVNGQQVETV GIPIHWGFEG VARKGYIANT LTPNVGDANS QTPEYKAFLV NIEKA
|
| |