Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1693 |
Symbol | hom |
ID | 2687052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 1851187 |
End bp | 1852497 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637126374 |
Product | homoserine dehydrogenase |
Protein accession | NP_952744 |
Protein GI | 39996793 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.240016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAGA TCAAGATCGG ACTCATCGGT TTCGGCACCA TCGGCACGGG TGTCGCCAAG CTCCTCCAGG CCAACGCCGG CCTCATTGCC GACAAGGTCG GCGCTATGGT TACTCTTAAA AAAATTGCCG ATCTGGACGT AACCACCGAC AGGGGCATCG AGCTCCCTCC GGGAACGCTC ACCAGCAATG TGGCCGACGT GCTCGATGAT CCGGAGATCA GCGTGGTGAT TGAGCTGATC GGCGGGTATG AGCCGGCCAA GAGTTTCGTG CTGCGTGCCA TCAACAACGG CAAGCACGTT GTTACCGCCA ACAAGGCCCT GCTCGCCCTG CACGGGGAGG AGATCTACCC GGCGGCCGCT GCCAAGGGGG TTCAGGTACT TTTCGAGGCG GCGGTAGGCG GCGGCATCCC GGTCATCTCG GCCATACTGG GTAACATGGC GGCAAACAAC TTCACCACGG TGCTCGGCAT CCTCAACGGA ACCTGCAACT ATATCCTCAC CCGCATGACT CAGGAAGGGG CCGATTTTGG CGATGTCCTC AAGACCGCCC AGGAACTGGG CTATGCCGAG GCGGATCCGA CCTTCGACAT CGAGGGGGTC GATACTGCCC ACAAACTGGC GCTGCTGGTT TCCCTCTGTT TCGGGACAAA GGTTGATTTC AACGCCATCC ACACCGAAGG GATCAGTTCC ATCTCGTCAG CGGATATTGG TTTTGCCCGG GATTTCGGGT ACAAGATCAA GCTGCTCGCC ATTGGCAAGC GCACCGGCGA TACCGTGGAA GCCCGTGTCC ACCCGACCAT GATCCCTGTC AACTACCCAC TTGCCGATGT GGACGGGGTT TTCAATGCCA TCCGCTTCAC CGGCGATTTT ATCGGTCCAG TGATGTTCTA TGGCCGCGGC GCCGGCATGG ATCCCACCGC CAGTGCGGTA GTGGGCGATG TCATTGAAAT CGCCCGGAAT ATCATTGCCG GCGTAAGCCG CCGGTGCGCG CCCCTCGGCT ATCGGGACGA GGCAGTCACG ACGCTTGCCC TCAAGCCCAT GGGTGAGATC GAGGGCAAGT ACTATCTTCG CTTCAGTGCC GTCGACAAGC CCGGAGTGCT GGCAAAAATC TCGGGGGCCC TCGGCAAGTA CGATATCAGC ATTGAATCGA TGATTCAGAA GGGGAGGAGC GCCGGTGAAT CGGTGCCCAT CGTGATCATG ACCCATGAGG CCCGTGAAAA GGACATTCGC GCTGCTCTTG AGGAAATCGA CACCTTCGAG CTCATCAGCG AGAAGAGCAG GTTCATCAGG ATTGAGGACA ACTTGGAATA A
|
Protein sequence | MKEIKIGLIG FGTIGTGVAK LLQANAGLIA DKVGAMVTLK KIADLDVTTD RGIELPPGTL TSNVADVLDD PEISVVIELI GGYEPAKSFV LRAINNGKHV VTANKALLAL HGEEIYPAAA AKGVQVLFEA AVGGGIPVIS AILGNMAANN FTTVLGILNG TCNYILTRMT QEGADFGDVL KTAQELGYAE ADPTFDIEGV DTAHKLALLV SLCFGTKVDF NAIHTEGISS ISSADIGFAR DFGYKIKLLA IGKRTGDTVE ARVHPTMIPV NYPLADVDGV FNAIRFTGDF IGPVMFYGRG AGMDPTASAV VGDVIEIARN IIAGVSRRCA PLGYRDEAVT TLALKPMGEI EGKYYLRFSA VDKPGVLAKI SGALGKYDIS IESMIQKGRS AGESVPIVIM THEAREKDIR AALEEIDTFE LISEKSRFIR IEDNLE
|
| |