Gene P9211_14951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_14951 
Symbol 
ID5730416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1338307 
End bp1339515 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content36% 
IMG OID641285873 
Producthypothetical protein 
Protein accessionYP_001551380 
Protein GI159904036 
COG category 
COG ID 
TIGRFAM ID[TIGR03573] N-acetyl sugar amidotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0184708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGT CACTCTTAAA TATTAGCAAT CTTCCCAAAC TTCCACCCTT AACAGATATT 
GAGAAGCAAC TTCTAGAAGA AAAAGTAGAT ATTGATTCAA AGTATAAATT GCCAAAAGAA
ATAAAATTGT GTCATAAATG TGTAATTACA AACCAACGCC CAAGGATCAC TATTAATGAA
GATGGTATAT GTAATCCATG CAAATACTGG GCGAGAAAGC ATTCCTCATT TGACTGGAAT
TCATTAGCAG ATGAATTTCG AGAGCTGTGT GACAAGTATC GCTCATCAGA TGGTTCATAT
GACGTATTAG TTCCTTCTAG TGGAGGTAAA GATAGCTCCT ATGTTGCCTA TAGATTAAGA
GATGAATACG ATATGCATCC TCTTACAGTT ACATGGTCAC CTTCTTTATA TACAGAGATA
GGGTTTGAAA ACTTTCAGAA CCATATACAT CATGGCTTAG ACAATGTTTT AGTAACAGCG
AATGGATTGG TTCATAGGCG ACTATGTAGA AGTTCAACAA TTATTATGGG TGATCCTTTT
CAACCTTTTG TATATGGTCA ATGCAATGTT CCATTAAGAA TAGCCAAAGC CTATGATATC
CCTTTAATAG TTGATGGAGA GAATGGAGAA GTTGAGTATG GAGGAGATGA CAATACAGAA
CAATTGACTG GTTTTCAGAA TGATGAATCA GTTGAGTTTT GGCAATCTGG TATGGCAGTT
GAGGAATGGC AAAAATATGG TTATTCCGAT TCCGAGTTGT TTATTTATCA ACCACCTAAG
CAGCAAATTA ATGTTCGTAG AGTATTTTTT AGCTATTACC ATAATTGGAT GCCCCACGAC
CATTACTATT ACGCAAGTCA AAATGCAGGT TTCGTCTCTA ACCCTGATAG ATCTGAATGT
ACTTTTTCCC GCTATGCAAG CCTTGATGAT TCAATAGATC CATTTCATTA TTACTTTGCG
TTGCTAAAAT TTGGTATTGG AAGAGCGACC TCGGATGCCG CCCATGAATT AAGAGAAGGG
GTTCTAGAAA GAGATGAAGC GATTCAATTA GTTAACAAGT TTGATTGCCA AGCTCCATCT
AAAGAAACTA CCGAGATTTT TCTGAAATAC TGTTCTATAG ACAAAGGTGC TTTACAGAAA
ATCGTGGATA GATGGACTAA TAGCAGAATA TGGTCTGCTA GAAATGACTT ACCTTCTCTT
CAATTCTAA
 
Protein sequence
MRKSLLNISN LPKLPPLTDI EKQLLEEKVD IDSKYKLPKE IKLCHKCVIT NQRPRITINE 
DGICNPCKYW ARKHSSFDWN SLADEFRELC DKYRSSDGSY DVLVPSSGGK DSSYVAYRLR
DEYDMHPLTV TWSPSLYTEI GFENFQNHIH HGLDNVLVTA NGLVHRRLCR SSTIIMGDPF
QPFVYGQCNV PLRIAKAYDI PLIVDGENGE VEYGGDDNTE QLTGFQNDES VEFWQSGMAV
EEWQKYGYSD SELFIYQPPK QQINVRRVFF SYYHNWMPHD HYYYASQNAG FVSNPDRSEC
TFSRYASLDD SIDPFHYYFA LLKFGIGRAT SDAAHELREG VLERDEAIQL VNKFDCQAPS
KETTEIFLKY CSIDKGALQK IVDRWTNSRI WSARNDLPSL QF