Gene P9303_13191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_13191 
Symbolmet17 
ID4776489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1126436 
End bp1127764 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content50% 
IMG OID640086827 
Productputative O-Acetyl homoserine sulfhydrylase 
Protein accessionYP_001017331 
Protein GI124023024 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.288572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTTCAC ATCGCTTCGA AACCCTCCAG CTTCATGCCG GCCAGGTAGC TGATCCTGTC 
ACGAACTCAC GAGCTGTACC CATTTATCAA ACCAGTTCGT ATGTCTTTAA TGACGCTGAA
CATGGTGCCA ACCTGTTTGG TCTTAAAGAA TTCGGCAACA TTTACACCCG TCTGATGAAC
CCCACGACGG ATGTGTTTGA AAAGCGTGTT GCGGCTCTTG AGGGAGGGAT TGCTGCTGTA
GCGACAGCTT CCGGACAATC CGCCCAGTTC CTGGCAATTA CAAACTGCAT GCAGGCTGGT
GACAATCTTG TCTCAACATC CTTCTTGTAT GGAGGAACCT ACAATCAATT CAAGGTGCAG
TTTCCCCGCC TTGGCATTGA TGTGAAGTTT GCGGATGGTG ATGATGTTGA TAGCTTTGCC
ACACAAATCG ATGCCAACAC AAAAGCAATT TATGTAGAGT CGATGGGGAA CCCTCGCTTC
AATATCCCTG ACTTTAAAGG CCTTTCAGGC TTAGCAAAAG ACATGGGAAT CCCCTTAATT
GTTGACAACA CCCTTGGAGC AGCTGGTGCT TTGATTCGGC CCATAGAGCA TGGCGCTGAT
GTGGTGGTGG AAAGTGCGAC CAAATGGATT GGAGGCCATG GCACCAGCCT TGGTGGAGTG
CTTGTGGATG CTGGCACCTT TAACTGGGGC AATGGCAAGT TCCCACTAAT GAGTGAGCCA
AGCGCGGCAT ATCACGGTCT GGTGCACTGG GATGCTTTCG GGTTCGGCAG CGACATCTGC
TCCATGCTCG GCGTTCCAAG CAATCGCAAT GTGGCCTTTG CACTGCGAGC TCGAGTTGAG
GGCCTGCGCG ACTGGGGAGC AGCTCTTAGT CCATTCAATT CATTCCTACT TTTGCAAGGG
CTCGAAACAC TGAGTTTGCG AGTGGAGCGC CATGCATCCA ATGCCATGGC TCTTGCTACA
TGGCTTCAGG ATCATCCCAA GGTTGCCAGT GTCAACTATC CCGGCCTGAA AAATGACCCA
TACCACGCAC AAGCCAAAAC ATACCTAACG AATCGAGGCA TGGGATGCAT GCTGATGTTC
TCCCTTAAAG GAGGCTTTGA TGATGCCGTG AGCTTCATCA ATGGTCTTGA ATTAGCTAGC
CATCTTGCCA ATGTTGGTGA TGCCAAGACC TTGGTGATTC ATCCTGCATC CACCACGCAT
CAACAACTTT CTGCTCAAGA GCAGGAATCT GCAGGAGTCA CTCCCACAAT GGTTAGGGTA
TCTGTTGGCC TTGAACACAT CGAAGACATC AAGGCTGATT TTGAACAGGC CCTAGCGGTC
ATCAGCTGA
 
Protein sequence
MTSHRFETLQ LHAGQVADPV TNSRAVPIYQ TSSYVFNDAE HGANLFGLKE FGNIYTRLMN 
PTTDVFEKRV AALEGGIAAV ATASGQSAQF LAITNCMQAG DNLVSTSFLY GGTYNQFKVQ
FPRLGIDVKF ADGDDVDSFA TQIDANTKAI YVESMGNPRF NIPDFKGLSG LAKDMGIPLI
VDNTLGAAGA LIRPIEHGAD VVVESATKWI GGHGTSLGGV LVDAGTFNWG NGKFPLMSEP
SAAYHGLVHW DAFGFGSDIC SMLGVPSNRN VAFALRARVE GLRDWGAALS PFNSFLLLQG
LETLSLRVER HASNAMALAT WLQDHPKVAS VNYPGLKNDP YHAQAKTYLT NRGMGCMLMF
SLKGGFDDAV SFINGLELAS HLANVGDAKT LVIHPASTTH QQLSAQEQES AGVTPTMVRV
SVGLEHIEDI KADFEQALAV IS