Gene GM21_3245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3245 
Symbol 
ID8138602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3774676 
End bp3775959 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content57% 
IMG OID644870854 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_003023029 
Protein GI253701840 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones123 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAA ATTCGTTCGG ATTCGATACC CTTGCACTTC ATGCCGGCCA GACTGTAGAT 
CCTGCCACCC TGTCCCGTGC CGTTCCGATC TACCAGACAT CATCCTATGT CTTCAAGAAC
TCCGAACACG CCGCCAACCT CTTCGGGCTC AAAGAGTTCG GCAATATCTA CACCCGTTTG
ATGAACCCCA CCACCGACGT CCTTGAGAAG AGGGTGGCCG AACTGGATGG TGGTGTCGCC
GCGCTCGCAG TAGCTTCCGG CCAGGCTGCC ACTACCTATG CGGTGTTGAA CATCGCCAGT
GCAGGGCAGA ACATCATCTC CACCAGCTAT CTCTATGGTG GTACCTACAA CCTGTTCCAC
TACACCCTGC CGAAACTCGG CATCACGGTG AAATTCGTTG ACTCCTCCGA CCCGGAGAAC
ATCCGCAAAG CCATCGATGA GAACACCCGT TTGGTGTACA GCGAGGCCAT AGGCAACCCC
AAGAACAACG TTGACGACTT CGAGGCCATT GCCAAGGTCG CGCACGATGC GGGCATCCCC
TATATCGTGG ACAACACCGC GGCAACCCCT TTCGTATTCC AGCCGCTCAA GCATGGCGCA
GACATCGTCG TCTATTCGCT GACCAAATTC TTGGGAGGTC ACGGAACCAG CATTGGCGGC
TGCGTAGTCG ATGGCGGAAC CTTTCCGTGG AACAACGGCA AGTTCCCCGA GTTCACCGAG
CCGGATCCCT CCTACCACGG GTTGAAGTTC TGGGACGCGT TAGGCAACAT TTCCTACATC
ATCAAGATGA GGGTAGAGCT TCTGCGCGAC ATGGGCGCCT GCATTTCACC CTTCAACGCT
TTCCAGATCA TCCAAGGCAT AGAGACCCTG CATGTCAGGA TGCAGCGTCA CGTGGAGAAC
GCGCAGAAGG TCGCCGAATG GTTGGAGCAG AATCCACTGG TGAGCTGGGT CAACTATCCC
GGTCTGCCGA GCCATAAGGA CCACGCCAAC GCCAAGAAGT ACCTGAACGG CGCAGGTGCC
ATCATCGGCT TCGGCATCAA GGGAGGCCTT GAGGCAGGCA TGAAGTTCAT CGACAACGTC
AAGCTGCTGT CGCACCTGGC CAACATCGGC GATGCCAAGA GCCTCGTGAT CCATCCGGCG
TCCACCACTC ACCAGCAACT CTCCGCAGAA GAGCAATTGG CCACCGGCGT GAGCCCCGAC
TTCATCAGGC TCTCTATAGG TATAGAAGAC GTCAAAGACA TCATAGCCGA CATAGAGCAG
GCCCTGAAAG CGGCACAAGC CTAG
 
Protein sequence
MSENSFGFDT LALHAGQTVD PATLSRAVPI YQTSSYVFKN SEHAANLFGL KEFGNIYTRL 
MNPTTDVLEK RVAELDGGVA ALAVASGQAA TTYAVLNIAS AGQNIISTSY LYGGTYNLFH
YTLPKLGITV KFVDSSDPEN IRKAIDENTR LVYSEAIGNP KNNVDDFEAI AKVAHDAGIP
YIVDNTAATP FVFQPLKHGA DIVVYSLTKF LGGHGTSIGG CVVDGGTFPW NNGKFPEFTE
PDPSYHGLKF WDALGNISYI IKMRVELLRD MGACISPFNA FQIIQGIETL HVRMQRHVEN
AQKVAEWLEQ NPLVSWVNYP GLPSHKDHAN AKKYLNGAGA IIGFGIKGGL EAGMKFIDNV
KLLSHLANIG DAKSLVIHPA STTHQQLSAE EQLATGVSPD FIRLSIGIED VKDIIADIEQ
ALKAAQA