Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1750 |
Symbol | |
ID | 5113271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 1902805 |
End bp | 1905855 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640491939 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_001176480 |
Protein GI | 146311406 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00182445 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGTTTA GCCGGAGGCA GTTCTTTAAG ATCTGCGCGG GCGGTATGGC AGGAACAACT GTTGCATCTC TCGGATTTTT ATCATCTTTT TCCGCGATCG CGGAAACACG TCAATACAAA CTCTTAAAAG CAAAAGAGAC GCGTAATAAC TGTACATACT GCTCAGTGGG CTGCGGCATG ATCATGTATA GCCTCGGCGA CGATGCCAAA AACGTCAAAG AAAGTATCTA TCACGTCGAA GGCGACTCGG ATCACCCGGT CAGCCGTGGG TCGCTTTGCC CGAAAGGGGC AGGGGTACTG GATTACATTC ACAGTGATAC TCGCCTGCTG TATCCCGAAT ACCGCGCGCC GGGGTCTGAT AAATGGCAGC GTATTTCATG GGATGACGCC ATTGAGCGTA TCGCCCGCCT GATGAAAGCG GATCGTGATG CCAACTTTAT CGAAAAGAAC GCCCAGGGGC TGACGGTCAA CCGCTGGACG ACCACGGGCA TGCTCTGTTC GTCGGCGGCC AGCAATGAAA CCGGTATTCT CGACGGAAAG TTTGCCCGCG CGCTGGGCAT GGTGGCGATC GACTGTCAGG CGCGTTTGTG CCACGGTCCA ACCGTTGCTG CGCTGGCACC GACCTTCGGG CGCGGGGCGA TGACCAATAA CTGGGTTGAT ATCAAAAACG CCAACGTGGT GTTGATCATG GGCGGCAACG CGGCGGAAGC CCATCCGGTC GGCTTTAAAT GGGTTGTTGA AGCGCAGACC AAAAACGACG CCACGGTGGT GGTGGTCGAT CCGCGCTTTA ACCGCAGCGC GGCGGTGGCC GATCTGTATG CGCCGATTCG CGCCGGGTCT GACACTGCGT TTCTGCTGGG GGCCATTCGC TATTTGATCG CGCATGACGC CATTCAGCAC GAATACGTCC GCGCCTATAC TAACGCCAGC CTGATTATCC GCGACGACTA TGCGTTCGAT GACGGTCTGT TCAGCGGCTA TGACGCCGAA AAGCGTCAGT ACGATAAATC GAGCTGGTTC TATCAACTGG ACGAGCAGGG CCACGTGCAG CGTGACGATA CGCTCAGCCA CCCGCGCTGC GTCTGGAATC TGCTTAAAGC GCACGTCGAT CGCTATACGC CGGAGATGGT GAACCGCTTG TGCGGCACGT CGATTGATGA TTTCAACCGC ATTTGCGCGA TCCTTGCCAG CACCAGCGTA CCGGACCGCA CCGCGACAAT CTTGTACGCA CTGGGCTGGA CGCACCATTC GGCGGGCGCA CAGATCATCC GTGCGGCGGG AATGTTGCAG TTGCTGCTGG GCAATATCGG TATGGCAGGC GGCGGCGTCA ACGCCCTTCG CGGTCACTCC AATATTCAGG GCTACACCGA TCTGGGGTTG CTCTCAACCA ATCTGCCGGG CTACATGCCT CTGCCGTCCG AAAAACAGCC GGATTATCAG ACCTATATCT CGCAGATCAC GCCGCCGTCG CTGGGGCTGA ACGAAGTGAA CTACTGGCAA AACACGCCGA AGTTCTTTAT CAGCATGATG AAAAGCTTCT GGGGCGAGCA TGCCACGGCG GACAACAACT GGGGCTACGA CTGGCTGCCG AAATGGGATC GTTTGTATGA CGTGATGACC CAGGCCAAGC TGATGCTCGA CGGCAAAATC AATGGTTACA TCGTTCAGGG CTTTAACCCG CTGGCGGCGT TCCCGGATAA AAACAAATCG TCCCGCGCGC TCTCGAAGCT CAAATACATG GTGGTTATCG ATCCGCTGGT CACCGAGTCG TCGACGTTCT GGCAGCATCA CGGTGAGATG AACGACGTGA ATCCGGCAGA TATTCAGACC GAAGTCTTCC GCCTGCCATC GTCCTGTTTT GCGGAAGAAG ACGGCTCGAT TGCTAACTCT GGCCGCTGGC TGCAATGGCA CTGGGCGGCC GCTGAGCCAC CGGGTGAAGC GATGCACGAT GGCAAAATCC TTGGCCGTCT GTTTACGCGC CTGCGCGAAC TCTATCAGGC CGAAGGCGGG GCAAACCCGG CGCCGGTGCT GAACATGTCC TGGGATTACA AAAATCCCCG CGATCCGCAT CCGGAAGAGA TTGCCCGCGA AGCCAACGGC ATGGCGCTGG TGGATTTGTA TGATGACAAA GGCCAACTGG TGGCGAAAAA AGGCCAGCAG CTCAGCAGTT TTGCGCAGTT ACGCGATGAC GGCACCACCA GCAGCTTCTG CTGGGTGTAC TGCGGAAGCT GGACCGAGCA GGGCAATCAG ATGGCGAACC GCGATAACAG CGATCCTTAT GGGCTGGGCT GTACGCCAGG CTGGGCGTGG TCGTGGCCGG CGAACCGTCG CATTCTGTAC AACCGCGCCT CTGCCGATGT GGCCGGGAAA CCCTGGGACG CCAAACGTGC CTTGCTGCAC TGGGATGGCA AAAAATGGAC CGGTCAGGAC GTGGCGGATT ACAACGCCTC GGCACCGGGT AGCAACGTCG GGCCGTTTAT CATGAATCCA GAAGGGGTGG CGCGCCTGTT CTCCATCGAC AAGATGAACG ACGGCCCGTT CCCGGAACAT TACGAACCGA TTGAATCACC GATTGGCACC AACCCGCTGC ATCCGAATGT CATCTCCAGC CCCGTTGCGC GGATCTTCAA AGAAGACCTG CCGAATATGG GCAAAGCGGA TGACTTCCCG TATGTCGCCA CGACCTATTC GATCACCGAG CTGTTCCGTC ACTGGACTAA GCATGCGCGG CTCAACGCGA TTGCACAGCC GGATCAGTTT GTCGAAATTG GCGAAGCGCT GGCGCAAGAG AAGGGCATTG TTGCCGGGGA TGAAGTGAAA GTGATGTCGA AACGAGGCTT TATCAAAGCA AAAGCGGTGG TCACTAAACG CCTGCAAACC CTGACCATTG ACGGTCGCAA GGTCAACACC GTGGGCATTC CGTGTCACTG GGGCTTTGAG GGGGCAACGC GTAAAGGGTT CCTGGCCAAT ACGTTGACGC CATCCGTGGG CGACGCCAAC TCGCAGACGC CGGAGTACAA GGCGTTTTTA GTCGACATCG AGAAGGCGTA A
|
Protein sequence | MEFSRRQFFK ICAGGMAGTT VASLGFLSSF SAIAETRQYK LLKAKETRNN CTYCSVGCGM IMYSLGDDAK NVKESIYHVE GDSDHPVSRG SLCPKGAGVL DYIHSDTRLL YPEYRAPGSD KWQRISWDDA IERIARLMKA DRDANFIEKN AQGLTVNRWT TTGMLCSSAA SNETGILDGK FARALGMVAI DCQARLCHGP TVAALAPTFG RGAMTNNWVD IKNANVVLIM GGNAAEAHPV GFKWVVEAQT KNDATVVVVD PRFNRSAAVA DLYAPIRAGS DTAFLLGAIR YLIAHDAIQH EYVRAYTNAS LIIRDDYAFD DGLFSGYDAE KRQYDKSSWF YQLDEQGHVQ RDDTLSHPRC VWNLLKAHVD RYTPEMVNRL CGTSIDDFNR ICAILASTSV PDRTATILYA LGWTHHSAGA QIIRAAGMLQ LLLGNIGMAG GGVNALRGHS NIQGYTDLGL LSTNLPGYMP LPSEKQPDYQ TYISQITPPS LGLNEVNYWQ NTPKFFISMM KSFWGEHATA DNNWGYDWLP KWDRLYDVMT QAKLMLDGKI NGYIVQGFNP LAAFPDKNKS SRALSKLKYM VVIDPLVTES STFWQHHGEM NDVNPADIQT EVFRLPSSCF AEEDGSIANS GRWLQWHWAA AEPPGEAMHD GKILGRLFTR LRELYQAEGG ANPAPVLNMS WDYKNPRDPH PEEIAREANG MALVDLYDDK GQLVAKKGQQ LSSFAQLRDD GTTSSFCWVY CGSWTEQGNQ MANRDNSDPY GLGCTPGWAW SWPANRRILY NRASADVAGK PWDAKRALLH WDGKKWTGQD VADYNASAPG SNVGPFIMNP EGVARLFSID KMNDGPFPEH YEPIESPIGT NPLHPNVISS PVARIFKEDL PNMGKADDFP YVATTYSITE LFRHWTKHAR LNAIAQPDQF VEIGEALAQE KGIVAGDEVK VMSKRGFIKA KAVVTKRLQT LTIDGRKVNT VGIPCHWGFE GATRKGFLAN TLTPSVGDAN SQTPEYKAFL VDIEKA
|
| |