Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3388 |
Symbol | |
ID | 5166783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 3978096 |
End bp | 3981128 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640550873 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001232117 |
Protein GI | 148265411 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTTT CACGAAGACA ATTTTTGCAG GGGGGAGTGC TGGCTGCTGC CGGCGTGGCT CTTTCCGGCA AGCCGGGCGA GGCGAGCGTC GATTCCCCGG AAATGCGCAC CAAGGGTTTG AAGGCTTCGA CCACAATCTG TCCATTCTGC GCGGTTGGAT GCGGTCTTAT CGTTCACAGC AAAAACGGCA AAATCATCAA CATCGAAGGT GATCCGCAGC ATCCCATCAA CCAGGGGGCG CTCTGCTCCA AGGGGAGTTC CCTGTTCCAG GTGGCCAACA ACGAACGGCG CCTGCAAAAG GTCATGTACC GCGCCCCCGG ATCCGACAAG TGGGAAGAGA AGTCCTGGGA CTGGGCCCTC GACCGGATCG CGGCGAAGAT GAAGGAGACC CGCGACCGGA CCTTCAAGGC AAAAGAGATC AACAAGAAGG ACAACAAGGA ATACGTGGTC AACCGCAACG AGGGGATGGC TTTCCTCGGC GGCGCCGGAC TCGACAACGA GGAGTGCTAC CTCTGGTCGA AATTTGCCCG CGCCATGGGG GTTGCCAACC TGGAACATCA GGCCCGAATA TGACACTCAG CTACAGTCGC CGGTCTGGCG GCTTCGTTTG GCCGTGGTGC CATGACAAAC CATTGGATAG ACCTGAAGAA CAGTGATTGC ATCCTCGCCA TCGGCTGTAA CCCGGCCGAG AACCACCCCA TTTCCTTCAA ATGGATTGAA ACAGCCATGG ATAGCGGCGG CAAGCTGATA GCCGTCGATC CCCGTTTTAC CCGGACAGCA AGTAAGGCGG ACATCTATGC CCAGATCCGT CCCGGCACCG ACATCGCCTT CCTCGGCGGG ATGATCAACT ACGCATTGCA GAACAACCTG ACCCATGCGG AGTACGTGCG GGAGTACACC AATGCCGCGT TCATCGTCTC GGAAACGTAT GACTTCGAGG ACGGCCTCTT CTGCGCCTTC GACGACCAGG AAAAGTCCTA CGACCTGAAG TCCTGGGCTT ATCAGACCGA CGGGGCGGGG AACCCGAAAC AGGACAAGAC CCTCCAGAAT CCGCGCTGCG TCTATCAGCT GATGAAGAAG CACTTCTCCC GCTATGACGT GGACAAGGTC TGCGCCATCA CCGGGACCAG GAAGGAAGAC TACCTTGCCG TGGCCAGGGC GTTCTGCGCC ACCGGCCGTC CCGACAAGGC CGGCACCATC ATGTACGCCA TGGGGATCAC CCAGTCCACC CACGGCACCC AGAATGTCCG CGCCGTGGCC ATGCTCCAGA TGCTCCTGGG GAACATCGGC ATTGCCGGCG GCGGGGTAAA TGCGCTGCGT GGTGAGTCCA ACGTCCAGGG CTCTTCCGAC TACGGCCTCC TCTTCCACCT CCTCCCCGGC TACCTGAAGT CGCCGGAGTT CGACAACACC GATTTGAAGG CGTACCTGGA GAAGTGGACG CCGAAATCCA AGGACAAGAA GAGCGCCAAC TGGATGGGCA ACACCCCGAA ATACACGGTG AGCCTCCTCA AGGCCTGGTA CGGCGACAAC GCCAAAAAGG AAAACGATTT CTGCTACGAC TACCTCCCCA AGCGGAGCGG CAACTACTCC TTCATGAAGC TGATGGAAAA GATGGGGCAG GGTGGGCTGG ACGGTCTCGT CTGCATGGGG CAGAACCCGG CAGTCGGCGG CCCTGATTCC ACCAAGACCC GAGAAGCGCT AGGCAAGCTC AAGTGGCTCG TCACCGTCGA CCTGTGGGAG ACGGAAACCT CCATCTTCTG GAAGCGCCCC GGCGTGAAGC CTGCGGATAT CCAGACCGAG GTCTTCATGC TCCCGGCCGC ATCAAGCGTG GAGAAGGAAG GCTCGATCTC CAACTCCGGC CGCTGGGCCC AGTGGCGTTA CAAGGCTGTG GAGCCGGTCG GGGAGGCCAG GAGCGACCTC TGGATCATCG ACCAGTTCTA CAAGCGGGTC AAAACCCTCT ATACAAAGGG GGGGGCCTTT CCCGAGCCGC TCACCAGGCT TTCCTGGAAC TACGGCAGCG GCCACGAGCC TGAAGTGCAG CTGGTGGCGA AGGAGATCAA CGGCTACTTC ACCAGGGACA TGAGCATCAA GGACAAGGAC AAGACCCTTG AGTTCAAGGC GGGCGACCAG GTCCCCATGT TCAAATACCT ACAGGACGAC GGTTCCACCG TCTCCGGCTG CTGGATCTAC TGCGGCTCCT TCACCAAGGA CGGCAACCAG ATGGCCCGCC GCGACCTGAC CGACGCTCCA AACAATCTGG GGCTCTTTCC CAAGTGGGCG TGGTGCTGGC CGGTCAACCG CCGCATCATC TACAACCGCG CCTCGGTCAA CCCCGAAGGT ATCCCGTTTA ACCCGAAACG GCCGGTCATC GCCTGGGACC CGCTGGAGAA GAAGTGGAAG GGTGATGTCC CGGACGGCCC CTGGCCGCCC ATGAAGGACG ACAAGGAAGG AAAGTATCCC TTCATCATGG TGCCGGAGGG GCTCGGACGG CTCTACGCCC TGGACATGAA GGACGGCCCG TTCCCCGAGC ATTACGAGCC GGTGGAAAGC CCGGCAAAGA ACCAGCTTTC CAGCGTTCAG AACAACCCGG CGGTCAAGCT GCCGAAAAAC GTTTCCAGCG ACACAGCCAA GTTTCCCTAC ATAGGCACCA CCTACCGGAT GACGGAGCAC TGGCAGGCAG GGGCCATGAC GCGGAACCTT CCCTGGCTGG TGGAACTGGT TCCCGACATG TTCATCGAGA TCAGCGAGAC GCTGGCCAGG AAGAAGGGGC TTGCAAACGG CGACAAGGTG CGCATTACCA CCGAGCGCGG CTCCATCGAG GCCGTGACCC TCGTCACTGC CAGGCTCAAG CCGTTCAATG TGGAAGGCAA GATGATCGAA CAGGTGGGAC TGCCGTGGCA TTTCGGCTAC GCCGGTCTTG CCAAGGGAGA CAGCGGCAAC GTCCTGACGC CATCGGTCGG CTGCGCAAAC ACGAGCATCC CCGAATTCAA GGCATTCCTC TGCAATATCG AGAAAGGGGG TAAGCGCTCA TGA
|
Protein sequence | MGFSRRQFLQ GGVLAAAGVA LSGKPGEASV DSPEMRTKGL KASTTICPFC AVGCGLIVHS KNGKIINIEG DPQHPINQGA LCSKGSSLFQ VANNERRLQK VMYRAPGSDK WEEKSWDWAL DRIAAKMKET RDRTFKAKEI NKKDNKEYVV NRNEGMAFLG GAGLDNEECY LWSKFARAMG VANLEHQARI UHSATVAGLA ASFGRGAMTN HWIDLKNSDC ILAIGCNPAE NHPISFKWIE TAMDSGGKLI AVDPRFTRTA SKADIYAQIR PGTDIAFLGG MINYALQNNL THAEYVREYT NAAFIVSETY DFEDGLFCAF DDQEKSYDLK SWAYQTDGAG NPKQDKTLQN PRCVYQLMKK HFSRYDVDKV CAITGTRKED YLAVARAFCA TGRPDKAGTI MYAMGITQST HGTQNVRAVA MLQMLLGNIG IAGGGVNALR GESNVQGSSD YGLLFHLLPG YLKSPEFDNT DLKAYLEKWT PKSKDKKSAN WMGNTPKYTV SLLKAWYGDN AKKENDFCYD YLPKRSGNYS FMKLMEKMGQ GGLDGLVCMG QNPAVGGPDS TKTREALGKL KWLVTVDLWE TETSIFWKRP GVKPADIQTE VFMLPAASSV EKEGSISNSG RWAQWRYKAV EPVGEARSDL WIIDQFYKRV KTLYTKGGAF PEPLTRLSWN YGSGHEPEVQ LVAKEINGYF TRDMSIKDKD KTLEFKAGDQ VPMFKYLQDD GSTVSGCWIY CGSFTKDGNQ MARRDLTDAP NNLGLFPKWA WCWPVNRRII YNRASVNPEG IPFNPKRPVI AWDPLEKKWK GDVPDGPWPP MKDDKEGKYP FIMVPEGLGR LYALDMKDGP FPEHYEPVES PAKNQLSSVQ NNPAVKLPKN VSSDTAKFPY IGTTYRMTEH WQAGAMTRNL PWLVELVPDM FIEISETLAR KKGLANGDKV RITTERGSIE AVTLVTARLK PFNVEGKMIE QVGLPWHFGY AGLAKGDSGN VLTPSVGCAN TSIPEFKAFL CNIEKGGKRS
|
| |