Gene EcolC_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2183 
Symbol 
ID6067466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2393248 
End bp2396295 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content54% 
IMG OID641601590 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_001725149 
Protein GI170020195 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.668487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCA GTCGCAGACA ATTTTTTAAA ATCTGCGCGG GCGGTATGGC TGGAACAACA 
GTAGCGGCAT TGGGCTTTGC CCCGAAGCAA GCACTGGCTC AGGCGCGAAA CTACAAATTA
TTACGCGCTA AAGAGATCCG TAACACCTGC ACATACTGTT CCGTAGGTTG CGGGCTATTG
ATGTATAGCC TGGGTGATGG CGCGAAAAAC GCCAGAGAAG CGATTTATCA CATTGAAGGT
GATCCGGATC ATCCGGTAAG CCGTGGTGCG CTGTGCCCGA AAGGGGCCGG TTTGCTGGAT
TACGTCAACA GCGAAAACCG TCTGCGCTAC CCGGAATATC GTGCGCCAGG TTCTGACAAA
TGGCAGCGCA TTAGCTGGGA AGAAGCATTC TCCCGTATTG CGAAGCTGAT GAAAGCTGAC
CGTGACGCTA ACTTTATTGA AAAGAACGAG CAGGGCGTAA CGGTAAACCG TTGGCTTTCT
ACCGGTATGC TGTGTGCTTC CGGTGCCAGC AACGAAACCG GGATGCTGAC CCAGAAATTT
GCCCGCTCCC TCGGGATGCT GGCGGTAGAC AACCAGGCGC GCGTCTGACA CGGACCAACG
GTAGCAAGTC TTGCTCCAAC ATTTGGTCGC GGTGCGATGA CCAACCACTG GGTGGATATC
AAAAACGCTA ACGTCGTAAT GGTAATGGGC GGTAACGCTG CTGAAGCGCA TCCCGTCGGT
TTCCGCTGGG CGATGGAAGC GAAAAACAAC AACGATGCAA CCTTGATCGT TGTCGATCCC
CGTTTTACGC GTACCGCTTC TGTAGCGGAT ATTTACGCAC CTATTCGTTC CGGTACGGAC
ATTACGTTCC TGTCTGGCGT TTTGCGCTAC CTGATCGAAA ACAACAAAAT CAACGCCGAA
TACGTTAAGC ATTACACCAA CGCCAGCTTG CTGGTGCGTG ATGATTTTGC TTTCGAAGAC
GGTCTGTTCA GCGGCTACGA CGCTGAAAAA CGGCAATACG ATAAATCGTC CTGGAACTAT
CAGTTCGATG AAAACGGCTA TGCGAAACGC GATGAAACAC TGACTCATCC GCGCTGTGTG
TGGAACCTGC TGAAAGCGCA CGTTTCCCGC TACACGCCGG ACGTTGTTGA AAACATCTGT
GGTACGCCAA AAGCCGACTT CCTGAAAGTG TGTGAAGTGC TGGCCTCCAC CAGCGCACCG
GATCGCACAA CCACCTTCCT GTACGCGCTG GGCTGGACGC AGCACACCGT GGGTGCGCAG
AACATCCGTA CTATGGCGAT GATCCAGTTG CTGCTCGGTA ACATGGGTAT GGCCGGTGGC
GGCGTGAACG CATTGCGTGG TCACTCCAAC ATTCAGGGCC TGACTGACTT AGGCCTGCTC
TCTACCAGCC TGCCAGGTTA TCTGACGCTG CCGTCAGAAA AACAGGTTGA TTTGCAGTCG
TATCTGGAAG CGAACACGCC GAAAGCGACG CGGCCTGACC AGGTGAACTA CTGGAGCAAC
TATCCGAAGT TCTTCGTTAG CCTGATGAAA TCTTTCTATG GCGATGCCGC GCAGAAAGAG
AACAACTGGG GCTATGACTG GCTGCCGAAG TGGGACCAGA CCTACGACGT CATCAAGTAT
TTCAACATGA TGGACGAAGG CAAAGTCACC GGTTATTTCT GCCAGGGCTT TAACCCGGTT
GCGTCCTTCC CGGACAAAAA CAAAGTGGTG AGCTGCCTGA GCAAGCTGAA GTACATGGTG
GTTATCGATC CGCTGGTGAC TGAAACCTCT ACCTTCTGGC AGAACCACGG CGAGTCGAAC
GATGTCGATC CGGCGTCTAT TCAGACTGAA GTATTCCGTC TCCCTTCGAC CTGCTTTGCT
GAAGAAGATG GTTCTATCGC TAACTCCGGT CGCTGGTTGC AGTGGCACTG GAAAGGTCAG
GACGCGCCGG GCGAAGCGCG TAACGACGGC GAAATTCTGG CGGGTATTTA CCATCATCTG
CGCGAGCTGT ACCAGGCCGA AGGTGGTAAA GGCGTAGAAC CGCTGATGAA GATGAGCTGG
AACTACAAGC AACCGCACGA ACCGCAATCT GACGAAGTGG CTAAAGAGAA CAACGGCTAT
GCGCTGGAAG ATCTCTATGA CGCTAATGGC GTGCTGATTG CGAAGAAAGG TCAGTTGCTG
AGTAGCTTTG CGCATCTGCG TGATGACGGT ACAACCGCAT CTTCTTGCTG GATCTACACC
GGTAGCTGGA CAGAGCAGGG CAACCAGATG GCTAACCGCG ATAACTCCGA CCCATCCGGT
CTGGGGAATA CGCTGGGATG GGCCTGGGCG TGGCCGCTCA ACCGTCGCGT GCTCTACAAC
CGTGCTTCGG CGGATATCAA TGGTAAACCG TGGGATCCGA AACGGATGCT GATCCAGTGG
AACGGCAGCA AGTGGACGGG TAACGATATT CCGGACTTCG GCAATGCCGC ACCGGGTACG
CCAACCGGGC CGTTTATCAT GCAGCCGGAA GGGATGGGAC GCCTGTTTGC CATCAACAAA
ATGGCGGAAG GTCCGTTCCC GGAACACTAC GAGCCGATTG AAACGCCGCT GGGCACTAAC
CCGCTGCATC CGAACGTGGT GTCTAACCCG GTCGTTCGTC TGTATGAACA AGACGCGCTG
CGGATGGGTA AAAAAGAGCA GTTCCCGTAT GTGGGTACGA CCTATCGTCT GACCGAGCAC
TTCCACACCT GGACCAAGCA CGCATTGCTC AACGCAATTG CTCAGCCGGA ACAGTTTGTG
GAAATCAGCG AAACGCTGGC GGCGGCGAAA GGCATTAATA ATGGCGATCG TGTCACTGTC
TCAAGCAAAC GTGGCTTTAT CCGCGCGGTG GCAGTGGTAA CGCGCCGTCT GAAGCCGCTG
AATGTAAACG GTCAGCAGGT TGAAACGGTG GGTATTCCAA TCCACTGGGG CTTTGAGGGT
GTCGCGCGTA AAGGTTATAT CGCTAACACT CTGACGCCGA ATGTCGGTGA TGCAAACTCG
CAAACGCCGG AATATAAAGC GTTCTTAGTC AACATCGAGA AGGCGTAA
 
Protein sequence
MDVSRRQFFK ICAGGMAGTT VAALGFAPKQ ALAQARNYKL LRAKEIRNTC TYCSVGCGLL 
MYSLGDGAKN AREAIYHIEG DPDHPVSRGA LCPKGAGLLD YVNSENRLRY PEYRAPGSDK
WQRISWEEAF SRIAKLMKAD RDANFIEKNE QGVTVNRWLS TGMLCASGAS NETGMLTQKF
ARSLGMLAVD NQARVUHGPT VASLAPTFGR GAMTNHWVDI KNANVVMVMG GNAAEAHPVG
FRWAMEAKNN NDATLIVVDP RFTRTASVAD IYAPIRSGTD ITFLSGVLRY LIENNKINAE
YVKHYTNASL LVRDDFAFED GLFSGYDAEK RQYDKSSWNY QFDENGYAKR DETLTHPRCV
WNLLKAHVSR YTPDVVENIC GTPKADFLKV CEVLASTSAP DRTTTFLYAL GWTQHTVGAQ
NIRTMAMIQL LLGNMGMAGG GVNALRGHSN IQGLTDLGLL STSLPGYLTL PSEKQVDLQS
YLEANTPKAT RPDQVNYWSN YPKFFVSLMK SFYGDAAQKE NNWGYDWLPK WDQTYDVIKY
FNMMDEGKVT GYFCQGFNPV ASFPDKNKVV SCLSKLKYMV VIDPLVTETS TFWQNHGESN
DVDPASIQTE VFRLPSTCFA EEDGSIANSG RWLQWHWKGQ DAPGEARNDG EILAGIYHHL
RELYQAEGGK GVEPLMKMSW NYKQPHEPQS DEVAKENNGY ALEDLYDANG VLIAKKGQLL
SSFAHLRDDG TTASSCWIYT GSWTEQGNQM ANRDNSDPSG LGNTLGWAWA WPLNRRVLYN
RASADINGKP WDPKRMLIQW NGSKWTGNDI PDFGNAAPGT PTGPFIMQPE GMGRLFAINK
MAEGPFPEHY EPIETPLGTN PLHPNVVSNP VVRLYEQDAL RMGKKEQFPY VGTTYRLTEH
FHTWTKHALL NAIAQPEQFV EISETLAAAK GINNGDRVTV SSKRGFIRAV AVVTRRLKPL
NVNGQQVETV GIPIHWGFEG VARKGYIANT LTPNVGDANS QTPEYKAFLV NIEKA