Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_04841 |
Symbol | hemL |
ID | 5730581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 452725 |
End bp | 454023 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641284843 |
Product | glutamate-1-semialdehyde aminotransferase |
Protein accession | YP_001550369 |
Protein GI | 159903025 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0001] Glutamate-1-semialdehyde aminotransferase |
TIGRFAM ID | [TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00934887 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.931792 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAAACG CGTTTAACAC AAATCTCTCT CAAGCAGTTT TTAATGCTGC ACAGGATCTA ATGCCTGGTG GAGTGAGTTC TCCAGTTCGA GCTTTCAAAT CGGTCAATGG AGATCCAATT GTATTTGACC GAGTTAAAGG ACCATATGCA TGGGATCTTG ATGGCAATCG ATTTATTGAC TATGTAGGGA GCTGGGGGCC GGCCATATGC GGCCATTCTC ATCCAGAAGT AATTGCTGCA CTTCAAGAAG CCCTTGAGAA AGGAACAAGC TTTGGTGCCC CTTGCGAATT AGAAAACAAA CTTGCAGGGA TGGTCATAGA GGCTGTACCA AGCGTGGAGA TGGTCCGTTT TGTTAATAGC GGTACAGAAG CTTGCATGGC AGTCTTAAGG CTAATGAGGG CCTTTACAGG CAGGGACAAG TTAATAAAGT TCGAAGGTTG TTATCACGGA CACGCAGATA TGTTCTTAGT AAAGGCAGGG TCTGGAGTCG CCACACTTGG TCTACCTGAC TCACCTGGTG TTCCAAGAAG CACAACTTCA AACACTCTTA CAGCTCCATA CAACGACTTA GAAGCTGTTA AAGCATTATT TGCAGAAAAT CCTGATGCAA TTTCTGGAGT AATCCTTGAG CCAATAGTTG GGAACGCTGG ATTTATACCC CCAGAACCAG GTTTCTTGGA GGGGCTGAGA GAACTTACCA AAGAGAATGG GTCTCTCCTT GTTTTTGATG AGGTGATGAC AGGCTTCAGA ATTAGCTACG GTGGCGCCCA AGAAAGATTT GGGGTAACAC CAGATCTAAC CACAATGGGC AAAGTTATTG GCGGAGGTCT ACCTGTAGGT GCATATGGTG GTCGCAAAGA AATTATGTCA ATGGTTGCTC CGGCAGGGCC TATGTATCAG GCTGGCACTC TAAGCGGGAA CCCCCTTGCA ATGACTGCAG GAATAAAAAC GCTAGAACTC CTTAAGCAAG AAGGCACCTA TGAAAGATTA GAGAGCCTTT CTCAACGATT AATCAATGGA ATTTGTGAAT CTGCCAAGAA AGCAGGTATC CCTATTACAG GAAGCTTTAT TAGTGGAATG TTTGGTTTTT ACCTATGCGA AGGCCCTGTG AGAAATTTCC AAGAAGCTAA GCAGACAAAT GCAGAGCTAT TTGGCAAACT TCACAGGGCC ATGCTTGAGA AAGGAATTTA TTTAGCTCCA AGCGCCTTTG AGGCTGGGTT CACATCATTA GCCCACTCTA ATGATGATAT TGAAACAACC ATAAAAGCTT TTGAAGCTAG CTTTTCAGAA ATTGTCTGA
|
Protein sequence | MTNAFNTNLS QAVFNAAQDL MPGGVSSPVR AFKSVNGDPI VFDRVKGPYA WDLDGNRFID YVGSWGPAIC GHSHPEVIAA LQEALEKGTS FGAPCELENK LAGMVIEAVP SVEMVRFVNS GTEACMAVLR LMRAFTGRDK LIKFEGCYHG HADMFLVKAG SGVATLGLPD SPGVPRSTTS NTLTAPYNDL EAVKALFAEN PDAISGVILE PIVGNAGFIP PEPGFLEGLR ELTKENGSLL VFDEVMTGFR ISYGGAQERF GVTPDLTTMG KVIGGGLPVG AYGGRKEIMS MVAPAGPMYQ AGTLSGNPLA MTAGIKTLEL LKQEGTYERL ESLSQRLING ICESAKKAGI PITGSFISGM FGFYLCEGPV RNFQEAKQTN AELFGKLHRA MLEKGIYLAP SAFEAGFTSL AHSNDDIETT IKAFEASFSE IV
|
| |