Gene NATL1_07071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07071 
Symbolmet17 
ID4780803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp649332 
End bp650690 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content37% 
IMG OID640083981 
Productputative O-Acetyl homoserine sulfhydrylase 
Protein accessionYP_001014530 
Protein GI124025414 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTAAAA AGAAAATAGA GATAATTAAT TCTTTGACTT CCCAACGTTT TGAAACTCTT 
CAATTGCATG CAGGTCAAGT ACCAGATCCT GTTACCAATT CTAGAGCGGT TCCTATTTAT
CAGACAAGTT CATATGTTTT TAACGATGTA GATCATGGAG CAAATTTATT TGGTTTGAAG
GAATTTGGAA ATATTTATAC GCGCTTGATG AATCCTACAA CGGATGTTTT CGAAAAAAGA
ATTGCAGCAC TTGAAGGAGG AGTAGCTGCT TTAGCAACAG CATCAGGACA GTCTGCACAA
TTCATTGCGA TTACAAACTT CCTTACTGCA GGAGACAGCT TTGTATCAAC ATCGTTTTTG
TATGGTGGCA GCTATAACCA ATTCAAAGTT CAATTTCCAC GATTAGGAAT CAATGTCAAG
TTTGCTGATG GTGATGATGC AGAAAGCTTT GAAGAACAAA TTGATTCATC TACAAAAGCA
ATTTATGTAG AGTCAATGGG TAATCCTAGG TTTAACATCC CTGATTTTGA TGGACTATCG
AAATTAGCTA AATCGAAAAA TATCCCCTTA ATTGTAGATA ACACTCTTGG TGCTGCTGGA
GCTTTAATTC GTCCTATTGA ACATGGCGCA GATGTGGTTG TTCAAAGTGC TACAAAATGG
ATAGGAGGAC ATGGAACAAG TCTTGGAGGG GTAATTGTTG ATGCAGGAAC TTTTGATTGG
GGAAATGGAA AATATCCTTT AATGAGTCAA CCAAGCGCGG CTTATCATGG TTTAGTTCAT
TGGGATGCTT TTGGATTTGG GAGTGATATT TGTGGAATGC TTGGAGTCCC TACAGATCGA
AACATTGCTT TTGCTTTGAG AGCTAGGTTA GAGGGGTTAC GAGATTGGGG CCCTGCGATT
AGTCCTTTTA ATTCTTTTTT ATTACTTCAA GGATTAGAAA CTTTAAGTTT AAGAATAGAA
AGACATTGCT CTAATGCACT TGCATTGGCT AAGTGGTTAG ACGATCATTC GAAAGTTGAT
AATGTTAGTT ATCCAGGATT ACCATCGGAC AAATACCATT CAAGAGCCTC TAATTATATG
ACCAATAGAG GAAAAGGTTC TATGTTGATT TTCTCTCTGA AAGGTGGCTT TGATGATGCT
GTGAAGTTTA TAAACTCTTT AAAACTTTCT AGTCATCTAG CAAATGTAGG CGATGCAAAA
ACATTAGTAA TTCATCCTGC TTCAACAACT CATCAACAAC TATCCCCAGA AGAACAGTTG
TCCGCAGGTG TTACTCCAAC TATGGTAAGA GTGTCTGTTG GTATAGAACA TATTGATGAT
ATCTTAGAGG ATTTCGAGCA GGCTCTAAAT TTAATTTAA
 
Protein sequence
MAKKKIEIIN SLTSQRFETL QLHAGQVPDP VTNSRAVPIY QTSSYVFNDV DHGANLFGLK 
EFGNIYTRLM NPTTDVFEKR IAALEGGVAA LATASGQSAQ FIAITNFLTA GDSFVSTSFL
YGGSYNQFKV QFPRLGINVK FADGDDAESF EEQIDSSTKA IYVESMGNPR FNIPDFDGLS
KLAKSKNIPL IVDNTLGAAG ALIRPIEHGA DVVVQSATKW IGGHGTSLGG VIVDAGTFDW
GNGKYPLMSQ PSAAYHGLVH WDAFGFGSDI CGMLGVPTDR NIAFALRARL EGLRDWGPAI
SPFNSFLLLQ GLETLSLRIE RHCSNALALA KWLDDHSKVD NVSYPGLPSD KYHSRASNYM
TNRGKGSMLI FSLKGGFDDA VKFINSLKLS SHLANVGDAK TLVIHPASTT HQQLSPEEQL
SAGVTPTMVR VSVGIEHIDD ILEDFEQALN LI