Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01101 |
Symbol | spsE |
ID | 4778707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 110482 |
End bp | 111489 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640085609 |
Product | sialic acid synthase |
Protein accession | YP_001016130 |
Protein GI | 124021823 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2089] Sialic acid synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.252845 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTTA AGATAATAGC AGAGGCAGGT GTCAACCATA ATGGCGAAAT ACAAAATGCA TATCGACTAA TAGATAGTGC GAAGTTTGCT GGTGCTGACG CAATAAAGTT CCAAAGCTTT TCGTCAATAG AATTGACCGC AGAGAATGCA CCAAAAGCAA AGTATCAGCA AATGTACGCT GACAGAAGCA CTCAACGTGA AATGTTGGCA CGGCTTGAAC TTAATGAAAT GGAGCATCAG CTGATAGCAG ACTATTGTGA GAAAATTGAG ATAGAGTTTC TATCAACAGC GTTTGGGCAA TCGCAACTAG ACTTATTAAT TAAACTAAAG GTTCAGGCGA TTAAAGTGGC CTCAGGAGAA ATAACACACT GGCCATTATT AAAGGAAATG GCAAAAAAAG CCTATGAAAA TAATCTAGAT GTCTATCTAT CTACAGGAAT GTCAGAAATA AAAGAAATCC AGGATGCACT AGATATTTTT ATCAAGGAAG ATATTTCATT AGGGAAAATT TTTATCCTAC ATTGTACTAG TCATTATCCT GCTCCTTACA ACGCTGTCAA TATGAAAGCA CTAAGGACAT TGAGAAATAC ATTTCAATGT AAAGTTGGAT ACTCAGACCA TACCACAGGG ATACTTACAT CAGTTGTTGC TGTAGCGCTT GGTGCTGAAA TTATTGAAAA GCACATAACA CTTGATCAAG CAATGAAAGG TCCAGATCAT AAAGCCAGTT TAAATCCACA AGAGTTTAGA AGTATGGTAA AGGAAATTAG AAATTGTGAG GAGATACTTG GACAAGAAGA GAAAACACTG CAAGATTGCG AACGAGATAC TAGGTATGTA GCCAGACGCT CTATCAGAGC ATCTGAAATC ATAAATAAAG GTGATGTCTT AAATGAGAAG AATCTGATAT GCAAGCGCCC TAATGATGGG ATAAGTCCTA TGAATTATCC GCAAATATTA GGTAAACGTG CAAAGCGCGA ATACGAAATT GGTGAATGTA TTGACTAG
|
Protein sequence | MAVKIIAEAG VNHNGEIQNA YRLIDSAKFA GADAIKFQSF SSIELTAENA PKAKYQQMYA DRSTQREMLA RLELNEMEHQ LIADYCEKIE IEFLSTAFGQ SQLDLLIKLK VQAIKVASGE ITHWPLLKEM AKKAYENNLD VYLSTGMSEI KEIQDALDIF IKEDISLGKI FILHCTSHYP APYNAVNMKA LRTLRNTFQC KVGYSDHTTG ILTSVVAVAL GAEIIEKHIT LDQAMKGPDH KASLNPQEFR SMVKEIRNCE EILGQEEKTL QDCERDTRYV ARRSIRASEI INKGDVLNEK NLICKRPNDG ISPMNYPQIL GKRAKREYEI GECID
|
| |