Gene Gura_3388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3388 
Symbol 
ID5166783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp3978096 
End bp3981128 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content60% 
IMG OID640550873 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_001232117 
Protein GI148265411 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTTT CACGAAGACA ATTTTTGCAG GGGGGAGTGC TGGCTGCTGC CGGCGTGGCT 
CTTTCCGGCA AGCCGGGCGA GGCGAGCGTC GATTCCCCGG AAATGCGCAC CAAGGGTTTG
AAGGCTTCGA CCACAATCTG TCCATTCTGC GCGGTTGGAT GCGGTCTTAT CGTTCACAGC
AAAAACGGCA AAATCATCAA CATCGAAGGT GATCCGCAGC ATCCCATCAA CCAGGGGGCG
CTCTGCTCCA AGGGGAGTTC CCTGTTCCAG GTGGCCAACA ACGAACGGCG CCTGCAAAAG
GTCATGTACC GCGCCCCCGG ATCCGACAAG TGGGAAGAGA AGTCCTGGGA CTGGGCCCTC
GACCGGATCG CGGCGAAGAT GAAGGAGACC CGCGACCGGA CCTTCAAGGC AAAAGAGATC
AACAAGAAGG ACAACAAGGA ATACGTGGTC AACCGCAACG AGGGGATGGC TTTCCTCGGC
GGCGCCGGAC TCGACAACGA GGAGTGCTAC CTCTGGTCGA AATTTGCCCG CGCCATGGGG
GTTGCCAACC TGGAACATCA GGCCCGAATA TGACACTCAG CTACAGTCGC CGGTCTGGCG
GCTTCGTTTG GCCGTGGTGC CATGACAAAC CATTGGATAG ACCTGAAGAA CAGTGATTGC
ATCCTCGCCA TCGGCTGTAA CCCGGCCGAG AACCACCCCA TTTCCTTCAA ATGGATTGAA
ACAGCCATGG ATAGCGGCGG CAAGCTGATA GCCGTCGATC CCCGTTTTAC CCGGACAGCA
AGTAAGGCGG ACATCTATGC CCAGATCCGT CCCGGCACCG ACATCGCCTT CCTCGGCGGG
ATGATCAACT ACGCATTGCA GAACAACCTG ACCCATGCGG AGTACGTGCG GGAGTACACC
AATGCCGCGT TCATCGTCTC GGAAACGTAT GACTTCGAGG ACGGCCTCTT CTGCGCCTTC
GACGACCAGG AAAAGTCCTA CGACCTGAAG TCCTGGGCTT ATCAGACCGA CGGGGCGGGG
AACCCGAAAC AGGACAAGAC CCTCCAGAAT CCGCGCTGCG TCTATCAGCT GATGAAGAAG
CACTTCTCCC GCTATGACGT GGACAAGGTC TGCGCCATCA CCGGGACCAG GAAGGAAGAC
TACCTTGCCG TGGCCAGGGC GTTCTGCGCC ACCGGCCGTC CCGACAAGGC CGGCACCATC
ATGTACGCCA TGGGGATCAC CCAGTCCACC CACGGCACCC AGAATGTCCG CGCCGTGGCC
ATGCTCCAGA TGCTCCTGGG GAACATCGGC ATTGCCGGCG GCGGGGTAAA TGCGCTGCGT
GGTGAGTCCA ACGTCCAGGG CTCTTCCGAC TACGGCCTCC TCTTCCACCT CCTCCCCGGC
TACCTGAAGT CGCCGGAGTT CGACAACACC GATTTGAAGG CGTACCTGGA GAAGTGGACG
CCGAAATCCA AGGACAAGAA GAGCGCCAAC TGGATGGGCA ACACCCCGAA ATACACGGTG
AGCCTCCTCA AGGCCTGGTA CGGCGACAAC GCCAAAAAGG AAAACGATTT CTGCTACGAC
TACCTCCCCA AGCGGAGCGG CAACTACTCC TTCATGAAGC TGATGGAAAA GATGGGGCAG
GGTGGGCTGG ACGGTCTCGT CTGCATGGGG CAGAACCCGG CAGTCGGCGG CCCTGATTCC
ACCAAGACCC GAGAAGCGCT AGGCAAGCTC AAGTGGCTCG TCACCGTCGA CCTGTGGGAG
ACGGAAACCT CCATCTTCTG GAAGCGCCCC GGCGTGAAGC CTGCGGATAT CCAGACCGAG
GTCTTCATGC TCCCGGCCGC ATCAAGCGTG GAGAAGGAAG GCTCGATCTC CAACTCCGGC
CGCTGGGCCC AGTGGCGTTA CAAGGCTGTG GAGCCGGTCG GGGAGGCCAG GAGCGACCTC
TGGATCATCG ACCAGTTCTA CAAGCGGGTC AAAACCCTCT ATACAAAGGG GGGGGCCTTT
CCCGAGCCGC TCACCAGGCT TTCCTGGAAC TACGGCAGCG GCCACGAGCC TGAAGTGCAG
CTGGTGGCGA AGGAGATCAA CGGCTACTTC ACCAGGGACA TGAGCATCAA GGACAAGGAC
AAGACCCTTG AGTTCAAGGC GGGCGACCAG GTCCCCATGT TCAAATACCT ACAGGACGAC
GGTTCCACCG TCTCCGGCTG CTGGATCTAC TGCGGCTCCT TCACCAAGGA CGGCAACCAG
ATGGCCCGCC GCGACCTGAC CGACGCTCCA AACAATCTGG GGCTCTTTCC CAAGTGGGCG
TGGTGCTGGC CGGTCAACCG CCGCATCATC TACAACCGCG CCTCGGTCAA CCCCGAAGGT
ATCCCGTTTA ACCCGAAACG GCCGGTCATC GCCTGGGACC CGCTGGAGAA GAAGTGGAAG
GGTGATGTCC CGGACGGCCC CTGGCCGCCC ATGAAGGACG ACAAGGAAGG AAAGTATCCC
TTCATCATGG TGCCGGAGGG GCTCGGACGG CTCTACGCCC TGGACATGAA GGACGGCCCG
TTCCCCGAGC ATTACGAGCC GGTGGAAAGC CCGGCAAAGA ACCAGCTTTC CAGCGTTCAG
AACAACCCGG CGGTCAAGCT GCCGAAAAAC GTTTCCAGCG ACACAGCCAA GTTTCCCTAC
ATAGGCACCA CCTACCGGAT GACGGAGCAC TGGCAGGCAG GGGCCATGAC GCGGAACCTT
CCCTGGCTGG TGGAACTGGT TCCCGACATG TTCATCGAGA TCAGCGAGAC GCTGGCCAGG
AAGAAGGGGC TTGCAAACGG CGACAAGGTG CGCATTACCA CCGAGCGCGG CTCCATCGAG
GCCGTGACCC TCGTCACTGC CAGGCTCAAG CCGTTCAATG TGGAAGGCAA GATGATCGAA
CAGGTGGGAC TGCCGTGGCA TTTCGGCTAC GCCGGTCTTG CCAAGGGAGA CAGCGGCAAC
GTCCTGACGC CATCGGTCGG CTGCGCAAAC ACGAGCATCC CCGAATTCAA GGCATTCCTC
TGCAATATCG AGAAAGGGGG TAAGCGCTCA TGA
 
Protein sequence
MGFSRRQFLQ GGVLAAAGVA LSGKPGEASV DSPEMRTKGL KASTTICPFC AVGCGLIVHS 
KNGKIINIEG DPQHPINQGA LCSKGSSLFQ VANNERRLQK VMYRAPGSDK WEEKSWDWAL
DRIAAKMKET RDRTFKAKEI NKKDNKEYVV NRNEGMAFLG GAGLDNEECY LWSKFARAMG
VANLEHQARI UHSATVAGLA ASFGRGAMTN HWIDLKNSDC ILAIGCNPAE NHPISFKWIE
TAMDSGGKLI AVDPRFTRTA SKADIYAQIR PGTDIAFLGG MINYALQNNL THAEYVREYT
NAAFIVSETY DFEDGLFCAF DDQEKSYDLK SWAYQTDGAG NPKQDKTLQN PRCVYQLMKK
HFSRYDVDKV CAITGTRKED YLAVARAFCA TGRPDKAGTI MYAMGITQST HGTQNVRAVA
MLQMLLGNIG IAGGGVNALR GESNVQGSSD YGLLFHLLPG YLKSPEFDNT DLKAYLEKWT
PKSKDKKSAN WMGNTPKYTV SLLKAWYGDN AKKENDFCYD YLPKRSGNYS FMKLMEKMGQ
GGLDGLVCMG QNPAVGGPDS TKTREALGKL KWLVTVDLWE TETSIFWKRP GVKPADIQTE
VFMLPAASSV EKEGSISNSG RWAQWRYKAV EPVGEARSDL WIIDQFYKRV KTLYTKGGAF
PEPLTRLSWN YGSGHEPEVQ LVAKEINGYF TRDMSIKDKD KTLEFKAGDQ VPMFKYLQDD
GSTVSGCWIY CGSFTKDGNQ MARRDLTDAP NNLGLFPKWA WCWPVNRRII YNRASVNPEG
IPFNPKRPVI AWDPLEKKWK GDVPDGPWPP MKDDKEGKYP FIMVPEGLGR LYALDMKDGP
FPEHYEPVES PAKNQLSSVQ NNPAVKLPKN VSSDTAKFPY IGTTYRMTEH WQAGAMTRNL
PWLVELVPDM FIEISETLAR KKGLANGDKV RITTERGSIE AVTLVTARLK PFNVEGKMIE
QVGLPWHFGY AGLAKGDSGN VLTPSVGCAN TSIPEFKAFL CNIEKGGKRS