Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0960 |
Symbol | |
ID | 8136281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1135858 |
End bp | 1137141 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868574 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_003020783 |
Protein GI | 253699594 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 0.353784 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGG ATTGGCGCAT CGAAACCCAG GCGATTCAGG AAGGTTATGC CCCCAAGGAC GGAGACCCCC GGATCCTGCC GATCTACCAG AGCACGACCT TCAAGTTTTC CAGCGCGGAG CACGTGGCGA AACTGTTCGA CCTCGAGGTG GGAGGACACT TCTACACCCG GCTCAGCAAC CCGACCGCCG AGGGTTTCGA GACCAAGATA GCGGCCATGG AAGGGGGCGT CGCCGCCATG GCCACCTCCA GCGGCCAGGC CGCGACCAGC ATGGCCATCA TGAACATCTG CCGCGCCGGG CAGCACGTGG TCGCCGCCAG CACCTTGTAC GGCGGCACCT ACTCGCTCTT CGCCAACACC TTCCCGAAGA TGGGTATCGA GGTGACCTTC GTCGATCCCG AGGCAGGCGA GGCCGCCATC GAGGCCGCCT TCCGCCCCGA GACCCGCGCA CTTTTCGGCG AGACCATAGG GAACCCCGGC TTGAACGTGC TCGACTTCGA GAAGTTCTCC CGGATCGCCA AGAAGATGCA GGTGCCGCTC ATCATCGACA ACACCTTCCC GACGCCGTAC CTGTGCCGCC CCTTCGAGCA CGGCGCGGAC ATAGTTATCC ACTCGGCCAC CAAGTACATC GACGGGCACG CCACCAGCGT GGGCGGGGTC ATCATCGACA GCGGCAACTT CGACTGGGGT AACGGCAAGT ACCCCGAGAT GACGGAGCCC GACGCGAGCT ACCACGGGCT CAAGTACCTG GAGACCTTCG GCAAGCTTGC CTACATCGTC AAGGCGCGCG TGCAGCTCAT GCGCGACCTG GGCTCCTGTC CTGCGCCGAT GAACGCCTTC CTGTTCAACC TGGGGCTGGA GACCCTGCCG CTGCGCATGC AGCGCCACAG CGAGAACGCC CTCGCCATGG CGAAATACCT GGAAAAGCAC CCCGCGGTAA GCTGGGTGAC CTACCCCGGG CTGGAGAGCC ACAAAAGCCA CGCCCGCTGC AAGAAGTACC TCCCCAAGGG GGCGAGCGGC GTCCTCACCT TCGGCATCAA GGGAGGTGCC GCGGCAGGAA AGAAATTCAT GGAGAGCTGC CAGCTGGTGG CGCTGGTGGT GCACGTGGGC GACGCCAGGA GCTGCGTCCT GCACCCGGCG AGCACCACCC ACCGGCAGTT GAACGAGGAG CAGCAGATCG CCTCCGGCGT CTCCCCCGAC CTGATCAGGC TCTCGGTCGG CATCGAGCAC ATCGACGACC TGATTGAGGA CGTGAACCAG GCGCTTTTGG CCAGCCAGAA GTAA
|
Protein sequence | MKKDWRIETQ AIQEGYAPKD GDPRILPIYQ STTFKFSSAE HVAKLFDLEV GGHFYTRLSN PTAEGFETKI AAMEGGVAAM ATSSGQAATS MAIMNICRAG QHVVAASTLY GGTYSLFANT FPKMGIEVTF VDPEAGEAAI EAAFRPETRA LFGETIGNPG LNVLDFEKFS RIAKKMQVPL IIDNTFPTPY LCRPFEHGAD IVIHSATKYI DGHATSVGGV IIDSGNFDWG NGKYPEMTEP DASYHGLKYL ETFGKLAYIV KARVQLMRDL GSCPAPMNAF LFNLGLETLP LRMQRHSENA LAMAKYLEKH PAVSWVTYPG LESHKSHARC KKYLPKGASG VLTFGIKGGA AAGKKFMESC QLVALVVHVG DARSCVLHPA STTHRQLNEE QQIASGVSPD LIRLSVGIEH IDDLIEDVNQ ALLASQK
|
| |