Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_03631 |
Symbol | |
ID | 5731703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 340145 |
End bp | 341599 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641284712 |
Product | putative neutral invertase-like protein |
Protein accession | YP_001550248 |
Protein GI | 159902904 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.072592 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGTC GTTTTAGTCA ACAGCACAAA AGGTTGAGAC CCAACTCTAA TGAGGATGCG GTTATCAAAA GAGCTCAAGA GCATTTCGAG AGATCACTAG TTGAAATTTC TGGGAGCATT TCAGGCAGTG TTGCAGCTCT TGAGCATCCA GCAAACAATG ATGCATTGAA TTATGGAGAG ATCTTCCTTA GGGACAACGT GCCCGTAATG ATTTATCTAT TGACGCAAAA TAGATATGAC ATAGTCAAAA AATTCTTAAC TGTATGCCTT GATCTTCAGA GTACTACTTA TCAAACAAGA GGTATTTTCC CTACTAGCTT TGTTGAAGAG AACGGAGAAC TTATTGCAGA CTATGGGCAA AGATCGATAG GAAGAATTAC ATCTGCAGAT GCGAGTTTAT GGTGGCCAAT CTTGTGCTGG TTATATGTCA GGAAAAGCAA AGATACCAAT TTTGGAGTCA GTCAGCAAGT GCAGCGTGGA GTGCAACTGC TTTTAGACCT AGTTTTACAT CCCACTTTTG AAGGAACTCC TGTCCTTTTT GTACCTGACT GTTCATTCAT GATTGACCGT CCCATGGATG TATGGGGTGC TCCCTTAGAA GTTGAAGTTC TCTTATATGC ATGCTTAAGC AGTTGTATAG AACTTATGGA TCTAAGCAGC AAGCACCAAG TAAGCCGCCT GCTAGACCAA AGGCTTCTTC TTACAAGGCA ATGGGTACAT GATCTGAGGC AATTTCTTCT TAAGCACTAT TGGGTCACCA GTAAAACAAT GCAAGTACTA AGGAGAAGGC CTACAGAACA ATATGGAGAA GACCAGCATC AAAATGAATT TAATGTTCAA CCTCAAGTTG TCCCCTCTTG GCTGCAAGAT TGGCTAGAGA ATCGTGGTGG ATATCTGATT GGGAATATTC GAACTGGACG ACCTGATTTT CGTTTTTATA GTCTTGGAAA TTCGCTTGCA TGCATGTTCG GAGTATTAAC TGCTCCACAA CAGAGAGCAC TCTTCAGATT GGTTCTTCAC AACCGACAAC ATCTAATGGC ACAAATGCCT ATGCGAATAT GCCATCCCCC AATGGAAGTA GAAGAATGGC AAAACAAAAC TGGATCTGAT CCTAAGAATT GGCCATGGAG CTATCACAAT GGAGGGCATT GGCCCAGCAT TCTTTGGTTC TTTGGAGCAT CAATACTTAT GCACGAGAAA AGATATCCCA AAGCAGATGT TCTGTTGATG GGCCAAATGC GCACTCTGTT GGAAGAATGT TACTGGAGTC AATTAAATCA ACTTCCAAAA CAAAAATGGG CCGAGTATTT TGATGGTCCT ACAGGAACCT GGGTAGGTCA ACAATCAAGG ACATACCAAA CTTGGACAAT TGTTGGCTTT CTTCTTTTAC ATCACTTTCT CAAAGTATGT CCTGATGATA TATCAATGCT CGATCTAGAC TTAGAAAAGA CCTAG
|
Protein sequence | MAGRFSQQHK RLRPNSNEDA VIKRAQEHFE RSLVEISGSI SGSVAALEHP ANNDALNYGE IFLRDNVPVM IYLLTQNRYD IVKKFLTVCL DLQSTTYQTR GIFPTSFVEE NGELIADYGQ RSIGRITSAD ASLWWPILCW LYVRKSKDTN FGVSQQVQRG VQLLLDLVLH PTFEGTPVLF VPDCSFMIDR PMDVWGAPLE VEVLLYACLS SCIELMDLSS KHQVSRLLDQ RLLLTRQWVH DLRQFLLKHY WVTSKTMQVL RRRPTEQYGE DQHQNEFNVQ PQVVPSWLQD WLENRGGYLI GNIRTGRPDF RFYSLGNSLA CMFGVLTAPQ QRALFRLVLH NRQHLMAQMP MRICHPPMEV EEWQNKTGSD PKNWPWSYHN GGHWPSILWF FGASILMHEK RYPKADVLLM GQMRTLLEEC YWSQLNQLPK QKWAEYFDGP TGTWVGQQSR TYQTWTIVGF LLLHHFLKVC PDDISMLDLD LEKT
|
| |