Gene GSU1108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1108 
Symbol 
ID2688555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1194322 
End bp1195749 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content57% 
IMG OID637125777 
Productaldehyde dehydrogenase family protein 
Protein accessionNP_952161 
Protein GI39996210 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00189493 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGC GCTATAAGGT TCTTGTTGGT GGTGAGTGGA CAGGGGACGA CCGACCGGGT 
ATCGAGGTCG TAAACCCTTA CGACGATTCG GTCATAGGGG TTGTGCCCGA GGCAACGAAC
GAGGATGTTG ACCACGCCAT ACGTGCAGCA CAGGCGGGCT TTGCCGAAAT GTCCGCTCTC
CCGGCGTATC GACGTTCCGA CATACTTGAT CGTACTTCGG AGCTGATCAA GCGAGACCGG
GAGGAGATCG CCGAAATAAT TGCCCGCGAA GCGGGCAAGT CGTGGAAATT CGCCCTTGCG
GAAGCGGATA GAAGTGCAGA GACCTTCCGT TTCGCCTCGC TGGAGGCTCG TAACGCCCAC
GGCGAAATCG TACCCATGGA TGCTTCGCCT GTGTCAGCTG GTCGTTTCGG TTTCTACCTC
AGAACCCCGA TCGGCGTAAT CGGTGCCATC GCACCCTTTA ACTTTCCTCT TAACCTGGTT
GCACACAAGG TGGCACCCGC CATAGCCGCC GGTAACGCGA TAGTGCTGAA GCCTGCCACA
AAGACTCCCC TCTCGTCCAT TAAGCTTGCG GAGCTTATGG TGGAGGCGGG GCTCCCTGCC
GGTGCGCTCA ATCTGGTTAT CGGGAGCGGT CGGACTGTCG GTAACCGTTT GGTAGAGGAT
GATCGGCTGG CAATGGTGAC ATTCACCGGA AGCCCGCCGG TTGGCGTTCA AATCAAGGAG
CGGAGCGGAC TCAAGAGAGT TACGCTGGAG CTTGGGTCCA ATTCACCCAC CATCATTGAG
GATGATGGCG ATGTGGATGC GGCAGTCGCC CGCTGTGTAG TGGGCAGTTT CGCCAACTCG
GGGCAGGTCT GTATCTCTGT TCAGCGAATT TTTGTACACC AGCGGCGTTA TCGCGAATTT
GTTGACAAGT TTGTGGCCGC GACCCAAAAG CTCAAGGTTG GGGATCCTAT GGACCGTGAC
TGCGACATCG GACCGATGAT TTCCCGCGAA GAGCTGCAGC GCGCCGTCGA GTGGCTGGGT
GAGGCCACGT CTCTGGGGGC GAGACTTGAA ACCGGGGGTA CGGTTGCCGG CAACTGTCTC
ACTCCGGCAA TTCTGAGCGG CGTAACTCCC GACATGAAGG TGGTCTGCTC CGAGGTGTTT
GCGCCGATTG TTTCCGTCAT CCCTTATGAG ACCTTCGATC AGGCCCTCGA TATGGCTGAC
GACTCAATCT ATGGCCTTCA GGCCGGGGTT TACACCAGCG ACATCAATAA GGCGTTCAAG
GCCATCCGCC GACTCGATGT GGGAGGAGTA ATCATTAACG ATATTCCGAC GTTCAGGGTC
GATCATATGC CCTATGGCGG TAACAAGCAG AGTGGACTCG GGCGGGAAGG TATCCGCTAC
GCCATGGAAG AGATGACGAA CATAAAATTT GTGTGCTTGA ATCTATGA
 
Protein sequence
MAKRYKVLVG GEWTGDDRPG IEVVNPYDDS VIGVVPEATN EDVDHAIRAA QAGFAEMSAL 
PAYRRSDILD RTSELIKRDR EEIAEIIARE AGKSWKFALA EADRSAETFR FASLEARNAH
GEIVPMDASP VSAGRFGFYL RTPIGVIGAI APFNFPLNLV AHKVAPAIAA GNAIVLKPAT
KTPLSSIKLA ELMVEAGLPA GALNLVIGSG RTVGNRLVED DRLAMVTFTG SPPVGVQIKE
RSGLKRVTLE LGSNSPTIIE DDGDVDAAVA RCVVGSFANS GQVCISVQRI FVHQRRYREF
VDKFVAATQK LKVGDPMDRD CDIGPMISRE ELQRAVEWLG EATSLGARLE TGGTVAGNCL
TPAILSGVTP DMKVVCSEVF APIVSVIPYE TFDQALDMAD DSIYGLQAGV YTSDINKAFK
AIRRLDVGGV IINDIPTFRV DHMPYGGNKQ SGLGREGIRY AMEEMTNIKF VCLNL