Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_11701 |
Symbol | |
ID | 4717883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 983774 |
End bp | 985357 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640078885 |
Product | hypothetical protein |
Protein accession | YP_001009561 |
Protein GI | 123968703 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0208787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGGGC AATACTCCCC AAAAAATGTT TTAGGTCAGT TAGCGATAGT TTTACATGCT CATCTACCTT ATGTCAGAAA AAATGAAAAG AATTCCTTAG AAGAGGATTG GTTATTTCAG GCAATTTTGG AATGTTATGT ACCACTACTT CAATCAATAG AATCTTCCAA AAAAGAAAAT CCTGAAAATA CAAAACTTAC TATTAGTTTG TCTCCAACTT TATTATCACT TCTAAATAAT AAACAAATTC AAGAAACTTT CCCAAGCTGG ATTAAAACAA GGAATGATTT CTTAAACGAA CTACCAATAG AAGAAAAAAA TGCCTCTGCA TTTTTAATTA AGAATCTCAA TGAAAAATAC TCATATTGGC AAAATTGTTC TGGAAATTTA ATTGAGAAGT TTAGGGTCTT AAATAACTCT GGAAATTTGG ATATTCTTAC TTGTGCGGCT ACTCATGGAT ATTTGCCGAT TCTAAGGGAG AATCCTGAAA CTGTTAAAGG CCAAATTAAT ACTGCAATAA GGAGCCATAA AAATATTTTT GGAACAAAGC CTTTAGGTAT TTGGTTACCT GAATGCGCTT ATTATGAGAA TTTAGACGAA ATACTAATTA ATTCCGGAAT AAGATATGCA ATCTTAGACG GTCATGGGAT TCTTAATGCG ACACCAAGGC CTAGGTATGG GGTGTACGCC CCAATCTGCT CGAGAAAAGG AGTTGCCTTC TTCGGAAGAG ATAGTGAGTC AACATTGCCC GTTTGGTCTG CTAAGGATGG ATTCCCGGGA GATAAAGTTT ATAGAGAATT TCATAAAGAT TTGGGATGGG AATTGCCTAT CTCTAAACTC CAAAAGAAAG GTATTTCAAC TAAAAGACCT TTGGGTTTGA AGTTTCATAA GATTACAGAT GATAAGGTAC CCCTAGGGGA AAAGGCGTTT TACTTAGAAA ATGAAGCCAA AAAGAAGGCT GCAGAACATG CTGATGATTA TCTTCTCGCG AGATCCAAAC AATTAGAAAA ATTAATATTA TCCTCTTCCT TCAAGCCCTT ATTGGTAGCT CCATTTGATG CAGAGTTATT TGGTCATTGG TGGTATGAAG GGCCTTTTTT TATTGAAAAT ATTTTAAAGA ACTCTAGTAA ATATTCAATT AAGCTTACAA ATTTAAAAGA ATTTTTACTT CAAAAGCCAA AACTTCAGAT TTGCGATCCA TCACCATCAA GCTGGGGACA AGGAGGTTAC CACAACTACT GGATTAATGA TGCAAATGCA TGGATTGTCC CAGAGATCAC AAAAGCAGGC TCAACTTTTG TTGATTTATG CTCGAAAAAT TTCAATAATG ACTTGTCCAC AAGACTTTTC AAGCAAGCAG CAAGAGAATT ACTTCTCTCT GAGTCTTCTG ATTGGAGTTT TATCCTAAGA GCTGGAACTA CAACTGAGCT TGCAAAAGAG AGGATAGAAA GACACTTGTT TAGATTCTGG AAAATAGTTG AAATGATTAA AAATCATTCC AATATTAATT TAAAATTTCT TGAAGATATC GAGGAAGAAG ATAAAGTTTT TCCAGATATT AATATTGATG ATTGGCGAAA ATAA
|
Protein sequence | MNGQYSPKNV LGQLAIVLHA HLPYVRKNEK NSLEEDWLFQ AILECYVPLL QSIESSKKEN PENTKLTISL SPTLLSLLNN KQIQETFPSW IKTRNDFLNE LPIEEKNASA FLIKNLNEKY SYWQNCSGNL IEKFRVLNNS GNLDILTCAA THGYLPILRE NPETVKGQIN TAIRSHKNIF GTKPLGIWLP ECAYYENLDE ILINSGIRYA ILDGHGILNA TPRPRYGVYA PICSRKGVAF FGRDSESTLP VWSAKDGFPG DKVYREFHKD LGWELPISKL QKKGISTKRP LGLKFHKITD DKVPLGEKAF YLENEAKKKA AEHADDYLLA RSKQLEKLIL SSSFKPLLVA PFDAELFGHW WYEGPFFIEN ILKNSSKYSI KLTNLKEFLL QKPKLQICDP SPSSWGQGGY HNYWINDANA WIVPEITKAG STFVDLCSKN FNNDLSTRLF KQAARELLLS ESSDWSFILR AGTTTELAKE RIERHLFRFW KIVEMIKNHS NINLKFLEDI EEEDKVFPDI NIDDWRK
|
| |