Gene NATL1_08721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08721 
SymbolspsE 
ID4779694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp809477 
End bp810505 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content33% 
IMG OID640084147 
Productialic acid synthase 
Protein accessionYP_001014695 
Protein GI124025579 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2089] Sialic acid synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.938537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000164183 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATATCA ACGGTAGAGA AATTAATGAG AAAAACACTC CCTTAATAAT AGCGGAACTA 
GGAATTAATC ACGGTGGAGA TATAGATATA GCAAAAAATA TGGTAACTTT AGCAGCTTCA
TCAGGAGCAG AGTGCATTAA ACATCAAACA CATTTTGTGG AAGATGAAAT GACAGAAGAA
GCGAAAGATA TTTACCCTCC CAATGCTACT GAATCCATAT GGGATGTTAT AAAAAAATCA
TCTCTTAATA AAGATGAAGA AATAGAATTA AAAAAATATA CAGAGGATTT AGGATTAATA
TATATATCAA CTCCTTTCTC AAGAGCTGCT GCTGATTTTC TAAATGATAT TAATATACCT
GCCTTTAAAA TAGGATCGGG TGAGGCTAGT AATATCCCGC TTATAAGACA TATATCATCA
TTTGGAAAGC CTATTATTTT ATCAACAGGA ATGCATTCTT TAGAGCAAAT AATTGAACCA
GTTAATATTT TCAAGAAAAA CAAAATAGAT TTCGCCCTTC TAGAATGTAC GAATTCTTAC
CCTTCACCAC CTGAAATAGT ATCTCTCCAA GGAATAAAAG ATTTAAAAAA AGCTTTTCCT
GAGGCAATTG TTGGTTTTTC TGATCATTCG ATAGGGCCTT ACATATCATT GGGAGCAGTT
GCATTAGGGG CTTGTATTAT TGAAAGACAC TTTACTGACT CAAGGTATAG AGAAGGTCCT
GACATCTCTT GTTCAATGGA TCCCCTAGAA CTAAGACTTT TAGTAGATAG GTCTAAAGAA
ATTTTTACAG CGATAAATAA TCCAAAAGAA AGAACTCTTC AGGAAGAAGA TGTTTACAAA
TTTGCGAGAG GTACTATTGT AGCTGATAGA GAAATTAAAA AAGGAACAAT TATCGCAGAA
AAAGATATAT GGGCAAGACG TCCTGGTAAC GGTGAAATAG CAGCTGCATT CTATGATAAT
ATCTTGGGAA GAAAAACGAA AAAAGATATA AAATATAATC AACAATTAAG ATGGAAAGAT
CTTATTTAA
 
Protein sequence
MNINGREINE KNTPLIIAEL GINHGGDIDI AKNMVTLAAS SGAECIKHQT HFVEDEMTEE 
AKDIYPPNAT ESIWDVIKKS SLNKDEEIEL KKYTEDLGLI YISTPFSRAA ADFLNDINIP
AFKIGSGEAS NIPLIRHISS FGKPIILSTG MHSLEQIIEP VNIFKKNKID FALLECTNSY
PSPPEIVSLQ GIKDLKKAFP EAIVGFSDHS IGPYISLGAV ALGACIIERH FTDSRYREGP
DISCSMDPLE LRLLVDRSKE IFTAINNPKE RTLQEEDVYK FARGTIVADR EIKKGTIIAE
KDIWARRPGN GEIAAAFYDN ILGRKTKKDI KYNQQLRWKD LI