Gene P9303_20301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_20301 
Symbol 
ID4778332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1787759 
End bp1788946 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content33% 
IMG OID640087544 
Producthypothetical protein 
Protein accessionYP_001018037 
Protein GI124023730 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.236649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTAG AAGATAACCC TTTTAATCAT CTTCAAAGGG CCAGACATTT TTATGAGAAA 
AAATATTCAC GAAGTACTAC AGAAATTGAT ATTTCTAATA ATAAAAAAAT ACATATTGGA
TACTTTTCCT CTGACTTTTA TGATCATGCA ACATTACACT TGATTTCAAA GCTATTCGAA
TTACATGATA AGGCTGTGTT CAAGATATAT GCATACTCAA TTGGATCTAA TCCTTCAGAT
CACTATACTT ATCACCTTGT ATCTAATGTT GAGGTGTTTC GTGATATTCA TTTAGTGGAT
GATCAATCGG CTGTGTCAAT TGTCCGTAAA GACAATCTTG ATATTGCAAT TGATTTAAAT
GGTTATACTA AAGGCAATAG ATTTTCCATT TTTGCTAATA GAATAGCACC TATACAGATT
AATTACTTGG GCTATCCAGG CTCTACTGGT GCTGAATGTA TTGATTACCT TATAGCTGAT
AAAGTCGTAA TACCGGAGAG ATTTGAGAAA TACTATAGCG AAAAAATTTT ATATTTACCT
AATTCCTTTC AATTTAATCA TGACAGAAGG GAACAAAACC ATCCTACTTT AAGACGAAGT
GATTTTGGGC TCCCTGAGTC TTCATTTGTT TTTGTATGTT TTTGTGCAAA TTATAAGATT
ACTCCATCAG TCTTTAATGT TTGGATGAGG TTACTTAAAC AAGTAGATGA TAGTGTATTG
TGGCTCTATA GATCAAATAA ATGGGCAGAA ATAAATCTTA GGCGTCAGGC AGAGTCGAGA
GATATAGATC CAGAAAGGCT TATTTTTGCA GGTCGCTTAC CTTTAAATAA GCATCTTGCA
AGACACTCCT TAGCAGACCT ATTCTTAGAT ACCTTTAATG TAAATGCACA TACAACGGCA
TCTGATGCCT TGTTAGCAGG TTTACCACTA TTAACTCTCG CTGGTAAAAG TTTTACCTCA
AGAGTTGCTG CAAGTCTTCT TGTGACTTTG AACTTACCTG AGTTAATTAC ATATACAATT
AAGGACTACG AGGAAAAGGC ATTAATGATC GCTTTGGACC CAAAACTTAA TAGAAGATTG
CATGAAAAAT TAAAACTATC AATTAAAGAG TCTGCTTTGT TTAAACCGGA ATTAACGACT
AAATCACTTG AAGATATTTA CAAAGAACTC GTTGTAAAAC ATCGTTAA
 
Protein sequence
MSLEDNPFNH LQRARHFYEK KYSRSTTEID ISNNKKIHIG YFSSDFYDHA TLHLISKLFE 
LHDKAVFKIY AYSIGSNPSD HYTYHLVSNV EVFRDIHLVD DQSAVSIVRK DNLDIAIDLN
GYTKGNRFSI FANRIAPIQI NYLGYPGSTG AECIDYLIAD KVVIPERFEK YYSEKILYLP
NSFQFNHDRR EQNHPTLRRS DFGLPESSFV FVCFCANYKI TPSVFNVWMR LLKQVDDSVL
WLYRSNKWAE INLRRQAESR DIDPERLIFA GRLPLNKHLA RHSLADLFLD TFNVNAHTTA
SDALLAGLPL LTLAGKSFTS RVAASLLVTL NLPELITYTI KDYEEKALMI ALDPKLNRRL
HEKLKLSIKE SALFKPELTT KSLEDIYKEL VVKHR