Gene P9211_04841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04841 
SymbolhemL 
ID5730581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp452725 
End bp454023 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content45% 
IMG OID641284843 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_001550369 
Protein GI159903025 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00934887 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.931792 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAAACG CGTTTAACAC AAATCTCTCT CAAGCAGTTT TTAATGCTGC ACAGGATCTA 
ATGCCTGGTG GAGTGAGTTC TCCAGTTCGA GCTTTCAAAT CGGTCAATGG AGATCCAATT
GTATTTGACC GAGTTAAAGG ACCATATGCA TGGGATCTTG ATGGCAATCG ATTTATTGAC
TATGTAGGGA GCTGGGGGCC GGCCATATGC GGCCATTCTC ATCCAGAAGT AATTGCTGCA
CTTCAAGAAG CCCTTGAGAA AGGAACAAGC TTTGGTGCCC CTTGCGAATT AGAAAACAAA
CTTGCAGGGA TGGTCATAGA GGCTGTACCA AGCGTGGAGA TGGTCCGTTT TGTTAATAGC
GGTACAGAAG CTTGCATGGC AGTCTTAAGG CTAATGAGGG CCTTTACAGG CAGGGACAAG
TTAATAAAGT TCGAAGGTTG TTATCACGGA CACGCAGATA TGTTCTTAGT AAAGGCAGGG
TCTGGAGTCG CCACACTTGG TCTACCTGAC TCACCTGGTG TTCCAAGAAG CACAACTTCA
AACACTCTTA CAGCTCCATA CAACGACTTA GAAGCTGTTA AAGCATTATT TGCAGAAAAT
CCTGATGCAA TTTCTGGAGT AATCCTTGAG CCAATAGTTG GGAACGCTGG ATTTATACCC
CCAGAACCAG GTTTCTTGGA GGGGCTGAGA GAACTTACCA AAGAGAATGG GTCTCTCCTT
GTTTTTGATG AGGTGATGAC AGGCTTCAGA ATTAGCTACG GTGGCGCCCA AGAAAGATTT
GGGGTAACAC CAGATCTAAC CACAATGGGC AAAGTTATTG GCGGAGGTCT ACCTGTAGGT
GCATATGGTG GTCGCAAAGA AATTATGTCA ATGGTTGCTC CGGCAGGGCC TATGTATCAG
GCTGGCACTC TAAGCGGGAA CCCCCTTGCA ATGACTGCAG GAATAAAAAC GCTAGAACTC
CTTAAGCAAG AAGGCACCTA TGAAAGATTA GAGAGCCTTT CTCAACGATT AATCAATGGA
ATTTGTGAAT CTGCCAAGAA AGCAGGTATC CCTATTACAG GAAGCTTTAT TAGTGGAATG
TTTGGTTTTT ACCTATGCGA AGGCCCTGTG AGAAATTTCC AAGAAGCTAA GCAGACAAAT
GCAGAGCTAT TTGGCAAACT TCACAGGGCC ATGCTTGAGA AAGGAATTTA TTTAGCTCCA
AGCGCCTTTG AGGCTGGGTT CACATCATTA GCCCACTCTA ATGATGATAT TGAAACAACC
ATAAAAGCTT TTGAAGCTAG CTTTTCAGAA ATTGTCTGA
 
Protein sequence
MTNAFNTNLS QAVFNAAQDL MPGGVSSPVR AFKSVNGDPI VFDRVKGPYA WDLDGNRFID 
YVGSWGPAIC GHSHPEVIAA LQEALEKGTS FGAPCELENK LAGMVIEAVP SVEMVRFVNS
GTEACMAVLR LMRAFTGRDK LIKFEGCYHG HADMFLVKAG SGVATLGLPD SPGVPRSTTS
NTLTAPYNDL EAVKALFAEN PDAISGVILE PIVGNAGFIP PEPGFLEGLR ELTKENGSLL
VFDEVMTGFR ISYGGAQERF GVTPDLTTMG KVIGGGLPVG AYGGRKEIMS MVAPAGPMYQ
AGTLSGNPLA MTAGIKTLEL LKQEGTYERL ESLSQRLING ICESAKKAGI PITGSFISGM
FGFYLCEGPV RNFQEAKQTN AELFGKLHRA MLEKGIYLAP SAFEAGFTSL AHSNDDIETT
IKAFEASFSE IV