Gene GSU1183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1183 
Symbol 
ID2688369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1290337 
End bp1291623 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content61% 
IMG OID637125857 
ProductO-acetyl-L-homoserine sulfhydrylase 
Protein accessionNP_952236 
Protein GI39996285 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.112948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACA TCCAAAAAGG CTTCGATACC CTCGCACTCC ACGCCGGCCA GGATCCGGAT 
CCCACGACGC TGTCGCGTGC GGTTCCCATT TACCAGACCT CTTCCTATGC GTTCCGCAGC
TCCGAACACG CGGCCAATCT CTTTGGTCTC CGGGAGTTCG GCAATATCTA CACGCGGATC
ATGAATCCGA CCTGCGATGT ACTGGAAAAG CGCCTGGCGG AACTTGACGG CGGAGTGGGC
GCGCTTGCTC TCGCCTCGGG GCAGGCGGCA ATCACGTATG CGGTGCTTAA CATCGCCGGC
GCCGGGCAGA ATATCGTCTC CACCAGCTAT CTCTATGGCG GAACCTACAA CCTCTTCCAC
TATACTCTGC CAAGATTGGG GATATCGGTC CGGTTCGTCG ACACTTCTGA CCCTGAAAAC
GTCCGTCGGG CCATGGATGA AAACACTCGC CTGGTCTACA CCGAATCGGT AGGGAACCCG
AAAAACAATG TGGACGACTT CGAGTCCATT GCCCGGATCG CCCATGAGGC GGGAATCCCG
TTCATAGTGG ACAACACCGT TACCACTCCG TACCTGTTCA GGCCTTTTGA CCATGGGGCC
GACATCGCCG TCTATTCCCT CACCAAATTC ATCGGTGGCC ACGGTACGAG CATCGGGGGG
GCGGTGGTAG ACAGCGGACG TTTTCCCTGG AACAACGGCC GGTTCCCCGA GTTCACGGAA
CCGGATCCCT CCTACCATGG TTTGCGCTAC TGGGAGGCCC TGGGGAACCT CTCCTACATC
CTCAAGATGC GGATCACGCT CCTGCGCGAT ATGGGGGCCT GCCTCGCGCC GTTCAACGCA
TTCCTCTTCC TCCAGGGGCT GGAGACCTTG CCGGTGCGCA TGGCACGCCA CGTTGACAAC
GCGCGTACTG TTGCCGAGTG GCTGGAGCGG CATCCACTGG TCACCTGGGT CAACTATCCG
GGCCTGCCCA GCCACCGGGA CCACGACAAT GCCGGCAAGT ACCTCCCCAA GGGCGCCGGT
GCCATCATCG GCTTCGGAGT CAAGGGAGGG CTCGAGGCGG GCAAGAAGTT CATCGACAGC
GTGAAGCTCC TGTCGCATCT TGCCAACATC GGCGACGCCA AGTCCCTCGT CATCCACCCG
GCATCCACCA CGCACGAGCA GCTCACCGAT GAAGAGCGTC TCTCGGCCGG GGTAACGCCG
GATTTCATCC GCCTTTCCGT CGGCATCGAG GATGTGGCCG ACATCATTGC CGACATCGAC
CAGGCCCTGC ATGCCTCCCA ATCCTGA
 
Protein sequence
MSDIQKGFDT LALHAGQDPD PTTLSRAVPI YQTSSYAFRS SEHAANLFGL REFGNIYTRI 
MNPTCDVLEK RLAELDGGVG ALALASGQAA ITYAVLNIAG AGQNIVSTSY LYGGTYNLFH
YTLPRLGISV RFVDTSDPEN VRRAMDENTR LVYTESVGNP KNNVDDFESI ARIAHEAGIP
FIVDNTVTTP YLFRPFDHGA DIAVYSLTKF IGGHGTSIGG AVVDSGRFPW NNGRFPEFTE
PDPSYHGLRY WEALGNLSYI LKMRITLLRD MGACLAPFNA FLFLQGLETL PVRMARHVDN
ARTVAEWLER HPLVTWVNYP GLPSHRDHDN AGKYLPKGAG AIIGFGVKGG LEAGKKFIDS
VKLLSHLANI GDAKSLVIHP ASTTHEQLTD EERLSAGVTP DFIRLSVGIE DVADIIADID
QALHASQS