Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_25231 |
Symbol | |
ID | 4775938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2220592 |
End bp | 2222082 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640088044 |
Product | hypothetical protein |
Protein accession | YP_001018519 |
Protein GI | 124024212 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGGGA TGTCGAAGAA GGAAAGCTTT CGTACAGTTG CGTTAAAGAC GTATGGCCGG TTTGTCCCCG CGCATGTGAC CTCGCCCAAA CACCGCTTTA GTCAGCGCCG CTCTCATCAG CGCCGCACCA GTATCAAAGG GATAAAGGCT GCTAGCGAAA GGGGTCTAAT GATCCGTTCT CGCAGAGCAG TGAGTTGGCT GCTACCTGGA CTGGTCGTCA AGCGCTGGCT GCTCACTTCG GGCCTGGGCC TTCTCATGGC ACTCCTAGGA GCAGCCATCT GGGCGGACCT CAAGCCCATC TACTGGATCA TCGAGACCCT CATCTGGCTA CTGGGAACAA TCACAACTGT GCTGCCACGC AGCATTACAG GTCCTTTAGT GGTCTTAATC GGAGCCGCCC TAGTGCTGTG GGGACAAAGC CGCAGCTTTG GCTCGATCCA ACAAGCACTA GCGCCCGACA AGGACACCGT GCTCGTTGAC GCCCTTCGTG CCAAAAGCAA ACTCAACCGC GGCCCTAACA TTGTCGCCAT CGGAGGTGGA ACAGGCCTCT CCACTCTGCT CAGTGGTCTG AAGCGCTACA GCAGCAACAT CACAGCGATC GTCACAGTGG CTGACGATGG CGGCAGTAGC GGGGTCCTGC GTCGCGAACT TGGAGTTCAG CCCCCAGGCG ACATTCGTAA CTGCCTGGCT GCACTCTCAA CCGAAGAACC CCTGCTCACA CGACTGTTTC AATATCGTTT CTCAGCAGGT AGTGGTCTAG AGGGCCACAG CTTCGGCAAT CTCTTTCTCT CTGCACTATC AGCCATAACG GGCAATTTAG AGACAGCCAT CACAGCGTCT AGTCGAGTCC TCGCAGTTCA GGGTCAGGTC GTTCCCGCTA CCAATGCAGA TGTACAGCTC TGGGCGGAAC TGGAAAACGG CCAGCGCATT GAAGGGGAAT CAGCGATCGG CAAGGCCCCA AGTCCAATCG TGAGGCTTGG CTGCTTGCCC GCACAACCGC CAGCCCTGCC TCGCGCCCTA GAGGCAATCT CTAATGCAGA CCTGATTCTT CTAGGACCTG GCAGCCTCTA TACATCCCTA TTACCCAATT TGCTTGTACC GGCATTAGTT CGAACCATCC AGCAGAGCCG AGCACCAAAG CTCTACATCT GCAACTTAAT GACTCAACCC GGCGAAACAG ATGGCCTGGA TGTTGTTGGT CACCTCAGAG CAATCGAAGC CCAACTTGCC AGCCTGGGCA TCAGCCAGAA ACTATTCAAT GCCGTATTGG CCCAGGATGA CCTAGGAGAA TCCCCATTGG TGAAGCACTA TCAAGCACGA GGTGCTGAAC CGGTCAACTG TGACGCTCAA ACACTCATCG CCAAGGGCTA TGAGTTGATG CAAGCTCCAT TGCAAGGGAA AAGGCCTCGC GCAACTCTGC GCCATGACCC ACGCAGCCTT GCCCTAGCAG TGATGCGCTT CTATCGAAAG CATAAAAAGA ATGCTCAATA A
|
Protein sequence | MPGMSKKESF RTVALKTYGR FVPAHVTSPK HRFSQRRSHQ RRTSIKGIKA ASERGLMIRS RRAVSWLLPG LVVKRWLLTS GLGLLMALLG AAIWADLKPI YWIIETLIWL LGTITTVLPR SITGPLVVLI GAALVLWGQS RSFGSIQQAL APDKDTVLVD ALRAKSKLNR GPNIVAIGGG TGLSTLLSGL KRYSSNITAI VTVADDGGSS GVLRRELGVQ PPGDIRNCLA ALSTEEPLLT RLFQYRFSAG SGLEGHSFGN LFLSALSAIT GNLETAITAS SRVLAVQGQV VPATNADVQL WAELENGQRI EGESAIGKAP SPIVRLGCLP AQPPALPRAL EAISNADLIL LGPGSLYTSL LPNLLVPALV RTIQQSRAPK LYICNLMTQP GETDGLDVVG HLRAIEAQLA SLGISQKLFN AVLAQDDLGE SPLVKHYQAR GAEPVNCDAQ TLIAKGYELM QAPLQGKRPR ATLRHDPRSL ALAVMRFYRK HKKNAQ
|
| |