Gene GSU3019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3019 
Symbol 
ID2686819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3312678 
End bp3314636 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content49% 
IMG OID637127712 
Productdehydrogenase, E1 component, alpha and beta subunits 
Protein accessionNP_954061 
Protein GI39998110 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit
[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGC GCAAGATCGA TATGCTCATG AAAGACTGCT ATGAGCTGAC TGTCACTGCC 
CTCACAATCA GAAAAGTTGA AGAACGTCTA TTGGAGCTCT TTTCTGAAGG AGTACTCAAC
GGAACAATCC ACACCTGTAT CGGACAGGAG TGGACCGGAG TGGCGGTTGC AAACGCACTG
CAAGCGGGAG ATACAGTTTT CTCCAATCAC CGCGGTCATG GACATTATAT TGCCCTGACT
GGTGATGTTT ATGGACTTAT CGCAGAAATC ATGGGCAAAG ACGACGGTGT CTGCGGAGGC
GTTGGAGGTA GCCAGCATCT ACATACAGAA AACTTCTTCT CTAACGGGAT TCAAGGCGGA
ATGGTGCCGG TCGCGGCTGG AAGAGCCTTG GCAAACGCGC TTCAAGGCAA CAATGCAATT
TCAGTGGTTT TTATCGGAGA TGGAACTCTT GGAGAAGGTG TTATCTATGA AACGTTTAAC
ATCGCATCCA AATGGCAGTT GCCACTGTTA GTTGTCCTGG AAAACAATCA GTACGCTCAG
TCCACACCCA CATCCCTGAC ATTGGCAGGC AATATACGGG ACCGCGTACG TGGCTTCGGC
ATAGAATATA TCAAGTGTGA CACATGGGAC ATCGCCGGGC TGCTTGACTC TGCAAAAGAG
GCCGTTGATT GCGTACGCAA GAACCAAAAG CCTGTATTGC TCGAAATTGA TACATACCGG
CTGAAAGCCC ACTCGAAGGG AGATGATCTT AGAGACCCTG TAGAGATCAG CCGCTACGCT
GGGCAGGACA GCATAAATGC TCTACTGGAA TCTGACGTTC CAAGAGTCGC TGAAACCGTC
AATCAAATCG ACAGCAATAT CCAACAGGCA ATCACCAAAG CCAGGGAAGC GACATTATGT
TCTTTCGCTC CGGCATCGAA CAGTGTTCGA CAGTATCAAT CAGTTACGTG GAGAACTGAA
TCCTTTGCCA GACAAAGAAT AATCACCTCA ATCAATTTAT CATTGCAATC GCTTCTGGAA
AACAACTCTA AGGCAGTAAT TATCGGGGAG GACATTGAAG CCCCCTACGG AGGAGCTTTC
AAAGCCACAA AGGATCTGAG TACGTTGTTC CCGGGAAGAG TCAAGAACAC CCCCATCAGC
GAGGGAGCGA TTACAGGCGT AGGCATCGGT CTGGCACTGA GCGGATTTCT TCCCGTTGTG
GAGATAATGT TCGGCGACTT CATGACTCTA ACCTTTGACC AACTGCTTCA ACACGCCGGT
AAATTCTGTG AGATGTATGG CAAGGATCTC GACGTACCCT TGATCATTCG CACTCCCATG
GGCGGACGTC GGGGTTATGG CCCAACACAC AGCCAATCTT TGGAGAAATT TTTTCTCGGC
ATACCGAATC TGGAGGTCAT TGCGTACAAC CATAGAGTTT CTCCCGCCCT GATCTTTGGA
AATCTGTGCA AGACGATCCG CAGGCCAACT CTCATCATCG AAAACAAGGT CCTCTATACT
CAGCACGTAG ACAGTACTCC CATGCCTGGA TTCCGCATCA ATATTTCCGA CGAACTATTC
CCAACCGTCA GAATATCGCC AAGTACCGGA GATCCGCAAG TGACGCTTGT CTGTTACGGG
GGAATGCTTG CGGAGGTTGA GATAGCCGCA GCCGCGGCTT TCGACGAAAA CGAAATACTC
TGCGAGATCA TATGTCCGAG CATTATCAAT CCTCTCAATG CATACCCCAT TCTTGAGTCG
GCACGGAAAA CCAGGAGGCT GATAACCGTA GAAGAAGGAC CGAGCATAGC CGCCCTGGGC
AGCGAGGTAG CCGCCCGAAT ACTTGAACAT TCGCTTCCCA TTGCCCATTA CAGTCGTATC
GGTTACGACT CTACCATTCC ATCATCAGCC AGCCGTGAAT CCAGATTGAT AACAAATGCC
GAATCAATCT TTGAACGTAT TGTGGAGATT TTCAAATGA
 
Protein sequence
MNKRKIDMLM KDCYELTVTA LTIRKVEERL LELFSEGVLN GTIHTCIGQE WTGVAVANAL 
QAGDTVFSNH RGHGHYIALT GDVYGLIAEI MGKDDGVCGG VGGSQHLHTE NFFSNGIQGG
MVPVAAGRAL ANALQGNNAI SVVFIGDGTL GEGVIYETFN IASKWQLPLL VVLENNQYAQ
STPTSLTLAG NIRDRVRGFG IEYIKCDTWD IAGLLDSAKE AVDCVRKNQK PVLLEIDTYR
LKAHSKGDDL RDPVEISRYA GQDSINALLE SDVPRVAETV NQIDSNIQQA ITKAREATLC
SFAPASNSVR QYQSVTWRTE SFARQRIITS INLSLQSLLE NNSKAVIIGE DIEAPYGGAF
KATKDLSTLF PGRVKNTPIS EGAITGVGIG LALSGFLPVV EIMFGDFMTL TFDQLLQHAG
KFCEMYGKDL DVPLIIRTPM GGRRGYGPTH SQSLEKFFLG IPNLEVIAYN HRVSPALIFG
NLCKTIRRPT LIIENKVLYT QHVDSTPMPG FRINISDELF PTVRISPSTG DPQVTLVCYG
GMLAEVEIAA AAAFDENEIL CEIICPSIIN PLNAYPILES ARKTRRLITV EEGPSIAALG
SEVAARILEH SLPIAHYSRI GYDSTIPSSA SRESRLITNA ESIFERIVEI FK