Gene P9303_01101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01101 
SymbolspsE 
ID4778707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp110482 
End bp111489 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content38% 
IMG OID640085609 
Productsialic acid synthase 
Protein accessionYP_001016130 
Protein GI124021823 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2089] Sialic acid synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.252845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTTA AGATAATAGC AGAGGCAGGT GTCAACCATA ATGGCGAAAT ACAAAATGCA 
TATCGACTAA TAGATAGTGC GAAGTTTGCT GGTGCTGACG CAATAAAGTT CCAAAGCTTT
TCGTCAATAG AATTGACCGC AGAGAATGCA CCAAAAGCAA AGTATCAGCA AATGTACGCT
GACAGAAGCA CTCAACGTGA AATGTTGGCA CGGCTTGAAC TTAATGAAAT GGAGCATCAG
CTGATAGCAG ACTATTGTGA GAAAATTGAG ATAGAGTTTC TATCAACAGC GTTTGGGCAA
TCGCAACTAG ACTTATTAAT TAAACTAAAG GTTCAGGCGA TTAAAGTGGC CTCAGGAGAA
ATAACACACT GGCCATTATT AAAGGAAATG GCAAAAAAAG CCTATGAAAA TAATCTAGAT
GTCTATCTAT CTACAGGAAT GTCAGAAATA AAAGAAATCC AGGATGCACT AGATATTTTT
ATCAAGGAAG ATATTTCATT AGGGAAAATT TTTATCCTAC ATTGTACTAG TCATTATCCT
GCTCCTTACA ACGCTGTCAA TATGAAAGCA CTAAGGACAT TGAGAAATAC ATTTCAATGT
AAAGTTGGAT ACTCAGACCA TACCACAGGG ATACTTACAT CAGTTGTTGC TGTAGCGCTT
GGTGCTGAAA TTATTGAAAA GCACATAACA CTTGATCAAG CAATGAAAGG TCCAGATCAT
AAAGCCAGTT TAAATCCACA AGAGTTTAGA AGTATGGTAA AGGAAATTAG AAATTGTGAG
GAGATACTTG GACAAGAAGA GAAAACACTG CAAGATTGCG AACGAGATAC TAGGTATGTA
GCCAGACGCT CTATCAGAGC ATCTGAAATC ATAAATAAAG GTGATGTCTT AAATGAGAAG
AATCTGATAT GCAAGCGCCC TAATGATGGG ATAAGTCCTA TGAATTATCC GCAAATATTA
GGTAAACGTG CAAAGCGCGA ATACGAAATT GGTGAATGTA TTGACTAG
 
Protein sequence
MAVKIIAEAG VNHNGEIQNA YRLIDSAKFA GADAIKFQSF SSIELTAENA PKAKYQQMYA 
DRSTQREMLA RLELNEMEHQ LIADYCEKIE IEFLSTAFGQ SQLDLLIKLK VQAIKVASGE
ITHWPLLKEM AKKAYENNLD VYLSTGMSEI KEIQDALDIF IKEDISLGKI FILHCTSHYP
APYNAVNMKA LRTLRNTFQC KVGYSDHTTG ILTSVVAVAL GAEIIEKHIT LDQAMKGPDH
KASLNPQEFR SMVKEIRNCE EILGQEEKTL QDCERDTRYV ARRSIRASEI INKGDVLNEK
NLICKRPNDG ISPMNYPQIL GKRAKREYEI GECID