Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_09661 |
Symbol | |
ID | 5731079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 858720 |
End bp | 860375 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641285333 |
Product | hypothetical protein |
Protein accession | YP_001550851 |
Protein GI | 159903507 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.308392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00593947 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAAACT ATATAGAAAA GAAATTGACC GAAGGTGGAT ACCAACAAAT CTTGTTGTTA GCTCCATCCT TGCTTGGGGA GTCTTTAGCA GCTCAATTGC AATCTGCCAA TAAATCCAAT GAGATTATAT TGCGACAAGA AAATTTAACT AAAGCTCCAG CTCTTGTTAT ATGGGCAATT GATAATGTAG TAATTCCATC GACGATTAGA TTTGAGCTAA GGACGCTATC CGAAAGATGG GCACCATCGC CAATACTTCT TCTACTGCCA AGTAAAACAT CAATACCTCC TAATGAAATT TTGAATTTTG AAAGTGATGG AATACTCCAA GATCCAGATA TAAAGACATT AGTTGACTCT ATTTCAACAA TTCTAGAAGG GGGGAGGGTA TTTCAATTAA AACAAGCTCA GAATCCAATA AATAGCACTA GGAAAAGATC AATTGGTATA GGTCAGTATC TTTTGAAACA AGGGATAGAC CAAATTGATA TGAAAATTGC TCAATTGGAA CCAATTCTTA GCCCTCTACC CATAAACCCC TTTCTGCGTA TAGCCATAAA CGGTAGGAAG AGAGAACTAA ATAGCGCAAA AAACATATTA ATTTGGGTAT GGGGACCAAT TCATAACATG CCAATAAGTG ATATTTCAGC TAAATTAACT TCAGATATTG ATTATATATA CGATAATTTT GTAGCAGATA TAGTACTCCC CCAAAGGGAT TCGAAGGCTG TAATAGAACT CATAATAACT CGCCTGAGAA AATCAGTAAG TGATCCTTTA TCAAACTCTA CAGGGACTAT ATTTGCCTTA CAGGCAATTA CTGAATGCAA ACAAAAAACA CTTTTACTGG AATTGATTAC TCAACTAGAA AAGTTACTAT TGCGGTTAAT TTCCCTGGAT AAGAATGAGT CTAAAATAAT AGATACCTGG AATTCATTTC AACTTAACCT TCGCAAAGAG GCTATCCGCT CAATAGCAGA ACCTTATACA ACAATAGAAT ATGAAGGAAA CTCTGTACTA TTAAGAGATC GTCTAGAGAA ACTAACTGAA TTAGACGAGA TTGATGAGGA TATGCCTAGT CCTAAAAATA TTGTTCAAAC CCTCATTTTA AATGAATCCT TAAAAGTTGA TGACCAATAC CTTCCCTACG ATCACCCAAA GTCAGTTATA AGAACGGAAA TGATCTTAAC TAATTGGCTT ATAAGAACAG CTGAAATTAT TAGTTCAGAG CTTCTTAATC AAGCATCAAT TTGGCCAGAC CTTAGACAAT ATTTGCTAAC TTCAAATCTT ATTTCTACAA GAGAACTTGA ACGTCTTCGC AATCAATTAA ATTCGCAATC TAGAATACAA AGTCTATTTA CTCGTCCTAT TCATTTATAC GAAAGTAAAA GACTTCTCTA CCGTATCAAC CAAAGCTCTA TTGAATCTTA TATATTAACA GAGTTACGAG ATAAAGAATT AAGGGAACTG GGTTGGCTCC AAAAACAAGT TACGTTATTA GTAGAAGCAA GAGATGCATT GGCTCCGCAG ATACAATCCC TGGTAAAATA TATAGGTAAT TTCATGGTGA TACTACTAAC TAACGTACTT GGTCGTGCCA TTGGTTTAGT TGGCAAAGGA ATAGCTCAAG GGATGGGTAG ATCTCTATCC AGATAA
|
Protein sequence | MENYIEKKLT EGGYQQILLL APSLLGESLA AQLQSANKSN EIILRQENLT KAPALVIWAI DNVVIPSTIR FELRTLSERW APSPILLLLP SKTSIPPNEI LNFESDGILQ DPDIKTLVDS ISTILEGGRV FQLKQAQNPI NSTRKRSIGI GQYLLKQGID QIDMKIAQLE PILSPLPINP FLRIAINGRK RELNSAKNIL IWVWGPIHNM PISDISAKLT SDIDYIYDNF VADIVLPQRD SKAVIELIIT RLRKSVSDPL SNSTGTIFAL QAITECKQKT LLLELITQLE KLLLRLISLD KNESKIIDTW NSFQLNLRKE AIRSIAEPYT TIEYEGNSVL LRDRLEKLTE LDEIDEDMPS PKNIVQTLIL NESLKVDDQY LPYDHPKSVI RTEMILTNWL IRTAEIISSE LLNQASIWPD LRQYLLTSNL ISTRELERLR NQLNSQSRIQ SLFTRPIHLY ESKRLLYRIN QSSIESYILT ELRDKELREL GWLQKQVTLL VEARDALAPQ IQSLVKYIGN FMVILLTNVL GRAIGLVGKG IAQGMGRSLS R
|
| |