Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01351 |
Symbol | |
ID | 4776396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 148158 |
End bp | 150467 |
Gene Length | 2310 bp |
Protein Length | 769 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640085634 |
Product | hypothetical protein |
Protein accession | YP_001016155 |
Protein GI | 124021848 |
COG category | [S] Function unknown |
COG ID | [COG3551] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTCAA GTGGAGCTTC CCTTTTAGCT GAACAGTTGC AAAATCTAGG AATTTTTCTA CCTGGCCAGA TGATTGCAGC TGATAATGAT AATCCTCAGG GTTACTTTGA ATGGGATGAG GTAGTTGAAC TGCAAGAGAA TTTGTTGATT GCTCTAGATC GATGGTGGCC CTCTCATACT GGTTCGTTTT GTTTACCATC AAATTGGTTG ATTCACCCCG CTACTCATAA ATTTCGTGCT CAGCTTACTA ACTTGCTTCG TCATCAAATT TCTACTAGTC AAGCGCTATT CTTAATCAAT GACCCTCGTA GCAGTATTTT ACTGCCTCTC TGGCGTGATA TTTGCAACCA ACTTGATATC TCTCTTCGTT TGATTCTGGC ATTTCGCAAA CCGGATGATG TGGTTTCTTC GATCATGTCA TGTAACGAGC GTCTGGCAGG TATGACATAT TGGAGAGCTC AACAGCTTTG GTGGAGGTTT AATTCATCCG TACTTGCCTC CGTGCCTTCC AAGTGCGAAG CAGAGCTACT AGTTGTTCAT TACGACACTT GGTTTGATGA TCCTATAGGT CAGGCACTTT TCTTAGCATC TCATCTTGCT TTAGAAAAGC CGAATTCAAA TCAGCTTAAT TCTGTCCAAA AAGCAATCTT CTTTCAACAG CAGAAAGCTA AGTCTCTTCC AGATTATGCA CCGCCTTTGG ATGATCGAAT TAGCAACCTT TATCATTGGC TGTCTAAGCA AAAGACTGTT CATTTGCCTC TTAAACTGTT TAAAGGCTCT TTGCAACCAA GGCGCACTTT TCGGCACAAA ATCTTCCATA GAATTGACTG GTTATGGCTT ATCCGTTCTT CTTTATTGCC GAAGGGCGGA TTGTTTGCCT ATAGAAAAAA CTTTTTGCAG GGTGTTGGTG CAGGACCTTT AGCTTTACCT GTTTGGATTG CGCGGCAAAG GCCAAGCTTG TTGCGCTATC ACCGCGATCC TTTGGCTTGG TATCAACGCG TTGGTTGGCG TTTGGGTGTG AACCCTCATC CATTACTGGA GTCAGCTCGG CTCTGGTCGC ATCTGGGATT CCAGAAAGAG GCGGTTGCCC TTTATCGGCG AGAGGCCATG TTTGAAAACA TTCCGGTCCA TCCCCGTTTT GACTCTGTGT ATTACAGGCA GCAATGTCGG AATGCCTATT GCATTCCTCA ACCCACGCCC TTGGAGCATT ATTTGGTCGA GGGTTGGCAG CAAGGCCTGG CCCCGCATCC AGCTGTCGAT CCACTCTGGA TGAAGAGGCG GCATGGCTTG CCTGGTGAAC CACTGGTGGC CTTGATCCTC GATGGGGGAG ATCCCACTGA CCCCGGCCTG ACTCATCCCT GCGGCAATCT TTATGGTGCG GCCCTAGCCG AGCCACTGTG CTCCACTCGC CTGCCCGTTG CCCTTGTTGA TCTGCTGCGA CTTTGGAACC AGCGAGGACT ATGGCCAGCA GAACGTTGGC TTGATCATGA GTGCATGCAA GATCCTTTGC CGAGTTTCAA TCTTTTTGAT ACTGAGCAGG CATCTTTGTT TGCCTTGGGC TTGCAAGTTC AATTGACTTC TGCTTCACGT CTGCAGATGC CCCCAACCTT AGGTTTTGGC CATGATTTGC CTTGGCGTGC AGAGCGGTTG TTGGCTGGTT GCAGTGATCA ACTCGCCATT ACAAAATCCG CCTCGGTAAG GCTGCATGTG TTGGAGGATG CTGATGATTG TCTGCGCTGG CAACAGACGA GTTCCCCTGG AGATTGGCTG ATCAATTTCC ACTGGCCTCC TACTAAAAGT CTTGCTAGTT GGATCCAGGG TCTACGCGGC ATGGAAGCAG TGCTGGATCC TGATCCTCAG CGAACAGCTT TTTTGCAGTT GTTTGGCGTT AAGGCTGTTC ATCAGCCTTT TCAACCTTTG GAGTTTGCAG CTGGTAGTGA TGAGGATCTC TTGCGTTTGG CACAGTTGAA ACTTGGTCTG CCAGATCCCC GTTGGTTTGA GCCTTCCCTT GAGCTCGCTG TTATTGGTAG CAGTGGGCCG ACCCAGGAAC GGCGCTGGGG AGAGTTGGGC CTGAAGTTAG AAGCTGCTGG ATTGCTGCTG TTGCCGCGTT TGCCTCAGAT TGAAATTGCC AACCTTGATC AGCTCAAGGC GTTGCAGGCT TGGCTCAATC AATTGGCTCA GAATTGTAAG AGGGTTCTCT GGCTTGAGCC AATACAGCAA GGTGCTTGTC AGTTGTCATC TGAAGCCGTT GTGTTGGCTC CAGAAGTAGA GCTGGATTTG CTTCTGCAAT GGGAATCTCG TTGCCGCTGA
|
Protein sequence | MHSSGASLLA EQLQNLGIFL PGQMIAADND NPQGYFEWDE VVELQENLLI ALDRWWPSHT GSFCLPSNWL IHPATHKFRA QLTNLLRHQI STSQALFLIN DPRSSILLPL WRDICNQLDI SLRLILAFRK PDDVVSSIMS CNERLAGMTY WRAQQLWWRF NSSVLASVPS KCEAELLVVH YDTWFDDPIG QALFLASHLA LEKPNSNQLN SVQKAIFFQQ QKAKSLPDYA PPLDDRISNL YHWLSKQKTV HLPLKLFKGS LQPRRTFRHK IFHRIDWLWL IRSSLLPKGG LFAYRKNFLQ GVGAGPLALP VWIARQRPSL LRYHRDPLAW YQRVGWRLGV NPHPLLESAR LWSHLGFQKE AVALYRREAM FENIPVHPRF DSVYYRQQCR NAYCIPQPTP LEHYLVEGWQ QGLAPHPAVD PLWMKRRHGL PGEPLVALIL DGGDPTDPGL THPCGNLYGA ALAEPLCSTR LPVALVDLLR LWNQRGLWPA ERWLDHECMQ DPLPSFNLFD TEQASLFALG LQVQLTSASR LQMPPTLGFG HDLPWRAERL LAGCSDQLAI TKSASVRLHV LEDADDCLRW QQTSSPGDWL INFHWPPTKS LASWIQGLRG MEAVLDPDPQ RTAFLQLFGV KAVHQPFQPL EFAAGSDEDL LRLAQLKLGL PDPRWFEPSL ELAVIGSSGP TQERRWGELG LKLEAAGLLL LPRLPQIEIA NLDQLKALQA WLNQLAQNCK RVLWLEPIQQ GACQLSSEAV VLAPEVELDL LLQWESRCR
|
| |