Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_20301 |
Symbol | |
ID | 4778332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1787759 |
End bp | 1788946 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640087544 |
Product | hypothetical protein |
Protein accession | YP_001018037 |
Protein GI | 124023730 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.236649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTAG AAGATAACCC TTTTAATCAT CTTCAAAGGG CCAGACATTT TTATGAGAAA AAATATTCAC GAAGTACTAC AGAAATTGAT ATTTCTAATA ATAAAAAAAT ACATATTGGA TACTTTTCCT CTGACTTTTA TGATCATGCA ACATTACACT TGATTTCAAA GCTATTCGAA TTACATGATA AGGCTGTGTT CAAGATATAT GCATACTCAA TTGGATCTAA TCCTTCAGAT CACTATACTT ATCACCTTGT ATCTAATGTT GAGGTGTTTC GTGATATTCA TTTAGTGGAT GATCAATCGG CTGTGTCAAT TGTCCGTAAA GACAATCTTG ATATTGCAAT TGATTTAAAT GGTTATACTA AAGGCAATAG ATTTTCCATT TTTGCTAATA GAATAGCACC TATACAGATT AATTACTTGG GCTATCCAGG CTCTACTGGT GCTGAATGTA TTGATTACCT TATAGCTGAT AAAGTCGTAA TACCGGAGAG ATTTGAGAAA TACTATAGCG AAAAAATTTT ATATTTACCT AATTCCTTTC AATTTAATCA TGACAGAAGG GAACAAAACC ATCCTACTTT AAGACGAAGT GATTTTGGGC TCCCTGAGTC TTCATTTGTT TTTGTATGTT TTTGTGCAAA TTATAAGATT ACTCCATCAG TCTTTAATGT TTGGATGAGG TTACTTAAAC AAGTAGATGA TAGTGTATTG TGGCTCTATA GATCAAATAA ATGGGCAGAA ATAAATCTTA GGCGTCAGGC AGAGTCGAGA GATATAGATC CAGAAAGGCT TATTTTTGCA GGTCGCTTAC CTTTAAATAA GCATCTTGCA AGACACTCCT TAGCAGACCT ATTCTTAGAT ACCTTTAATG TAAATGCACA TACAACGGCA TCTGATGCCT TGTTAGCAGG TTTACCACTA TTAACTCTCG CTGGTAAAAG TTTTACCTCA AGAGTTGCTG CAAGTCTTCT TGTGACTTTG AACTTACCTG AGTTAATTAC ATATACAATT AAGGACTACG AGGAAAAGGC ATTAATGATC GCTTTGGACC CAAAACTTAA TAGAAGATTG CATGAAAAAT TAAAACTATC AATTAAAGAG TCTGCTTTGT TTAAACCGGA ATTAACGACT AAATCACTTG AAGATATTTA CAAAGAACTC GTTGTAAAAC ATCGTTAA
|
Protein sequence | MSLEDNPFNH LQRARHFYEK KYSRSTTEID ISNNKKIHIG YFSSDFYDHA TLHLISKLFE LHDKAVFKIY AYSIGSNPSD HYTYHLVSNV EVFRDIHLVD DQSAVSIVRK DNLDIAIDLN GYTKGNRFSI FANRIAPIQI NYLGYPGSTG AECIDYLIAD KVVIPERFEK YYSEKILYLP NSFQFNHDRR EQNHPTLRRS DFGLPESSFV FVCFCANYKI TPSVFNVWMR LLKQVDDSVL WLYRSNKWAE INLRRQAESR DIDPERLIFA GRLPLNKHLA RHSLADLFLD TFNVNAHTTA SDALLAGLPL LTLAGKSFTS RVAASLLVTL NLPELITYTI KDYEEKALMI ALDPKLNRRL HEKLKLSIKE SALFKPELTT KSLEDIYKEL VVKHR
|
| |