Gene GSU1693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1693 
Symbolhom 
ID2687052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1851187 
End bp1852497 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content59% 
IMG OID637126374 
Producthomoserine dehydrogenase 
Protein accessionNP_952744 
Protein GI39996793 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.240016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAGA TCAAGATCGG ACTCATCGGT TTCGGCACCA TCGGCACGGG TGTCGCCAAG 
CTCCTCCAGG CCAACGCCGG CCTCATTGCC GACAAGGTCG GCGCTATGGT TACTCTTAAA
AAAATTGCCG ATCTGGACGT AACCACCGAC AGGGGCATCG AGCTCCCTCC GGGAACGCTC
ACCAGCAATG TGGCCGACGT GCTCGATGAT CCGGAGATCA GCGTGGTGAT TGAGCTGATC
GGCGGGTATG AGCCGGCCAA GAGTTTCGTG CTGCGTGCCA TCAACAACGG CAAGCACGTT
GTTACCGCCA ACAAGGCCCT GCTCGCCCTG CACGGGGAGG AGATCTACCC GGCGGCCGCT
GCCAAGGGGG TTCAGGTACT TTTCGAGGCG GCGGTAGGCG GCGGCATCCC GGTCATCTCG
GCCATACTGG GTAACATGGC GGCAAACAAC TTCACCACGG TGCTCGGCAT CCTCAACGGA
ACCTGCAACT ATATCCTCAC CCGCATGACT CAGGAAGGGG CCGATTTTGG CGATGTCCTC
AAGACCGCCC AGGAACTGGG CTATGCCGAG GCGGATCCGA CCTTCGACAT CGAGGGGGTC
GATACTGCCC ACAAACTGGC GCTGCTGGTT TCCCTCTGTT TCGGGACAAA GGTTGATTTC
AACGCCATCC ACACCGAAGG GATCAGTTCC ATCTCGTCAG CGGATATTGG TTTTGCCCGG
GATTTCGGGT ACAAGATCAA GCTGCTCGCC ATTGGCAAGC GCACCGGCGA TACCGTGGAA
GCCCGTGTCC ACCCGACCAT GATCCCTGTC AACTACCCAC TTGCCGATGT GGACGGGGTT
TTCAATGCCA TCCGCTTCAC CGGCGATTTT ATCGGTCCAG TGATGTTCTA TGGCCGCGGC
GCCGGCATGG ATCCCACCGC CAGTGCGGTA GTGGGCGATG TCATTGAAAT CGCCCGGAAT
ATCATTGCCG GCGTAAGCCG CCGGTGCGCG CCCCTCGGCT ATCGGGACGA GGCAGTCACG
ACGCTTGCCC TCAAGCCCAT GGGTGAGATC GAGGGCAAGT ACTATCTTCG CTTCAGTGCC
GTCGACAAGC CCGGAGTGCT GGCAAAAATC TCGGGGGCCC TCGGCAAGTA CGATATCAGC
ATTGAATCGA TGATTCAGAA GGGGAGGAGC GCCGGTGAAT CGGTGCCCAT CGTGATCATG
ACCCATGAGG CCCGTGAAAA GGACATTCGC GCTGCTCTTG AGGAAATCGA CACCTTCGAG
CTCATCAGCG AGAAGAGCAG GTTCATCAGG ATTGAGGACA ACTTGGAATA A
 
Protein sequence
MKEIKIGLIG FGTIGTGVAK LLQANAGLIA DKVGAMVTLK KIADLDVTTD RGIELPPGTL 
TSNVADVLDD PEISVVIELI GGYEPAKSFV LRAINNGKHV VTANKALLAL HGEEIYPAAA
AKGVQVLFEA AVGGGIPVIS AILGNMAANN FTTVLGILNG TCNYILTRMT QEGADFGDVL
KTAQELGYAE ADPTFDIEGV DTAHKLALLV SLCFGTKVDF NAIHTEGISS ISSADIGFAR
DFGYKIKLLA IGKRTGDTVE ARVHPTMIPV NYPLADVDGV FNAIRFTGDF IGPVMFYGRG
AGMDPTASAV VGDVIEIARN IIAGVSRRCA PLGYRDEAVT TLALKPMGEI EGKYYLRFSA
VDKPGVLAKI SGALGKYDIS IESMIQKGRS AGESVPIVIM THEAREKDIR AALEEIDTFE
LISEKSRFIR IEDNLE