Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_12041 |
Symbol | |
ID | 4777660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1050014 |
End bp | 1051939 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640086713 |
Product | hypothetical protein |
Protein accession | YP_001017218 |
Protein GI | 124022911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.681037 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCATCC GGTACAGCTC TCTAGCCGAT CCTCTGATAG CCTACATAGA CAAAGATCAA GAAAATACAT CGGGCAGAGC ATCAAGCTTA CTGACCCCAA TCAACTACAA CGCAGATCTA TTTGCTAACG AGGCAAAGTT TTTCACTGGT AACTATGGAT TTGACGGATA TATTGGTGTA CCGGGCTTAC TCGGGCCAGG GGGCCAGCAG GCAGCTGCGC ATCACAATGT TGCCTGGGAA AGTGTTGATC CAGATCTGAG TCCAACGCTC CGCGCGCTAA CAAGTGCAGC GAGCGTTGTT GGATCTAAAG CTGTTTATGG CATGAATCTT TTGTTGCAAG ACACCATGCC ATTGGTGTTC AGTTATCCAG TCTTACCAAC AACCTTAGAT GGAAATGGAA GCGATTTTGA GATCACCCTG AACGATGGTT CGATCGTATC GCCTGCCCTT GCAGGATTCT TGCCAAACCT CGAATACAAT GAACGCCAAA CGATAGTGGT GGCAGGTGAT TTCGGCAATC GCCTCAACCC CGAAAGTGAA GGAGCCCGCT ATCCGGTATC GGTTCGAATC GTCAATGACG GCACACCCCT GCAGATGCTC TCAGCAAAAG GGCCTGTCTT TGCTACAGAA CTGTCCGTCG ATAGCAGCAA CTCCTATGTT CAAGGGGATG GCCCGAAGCT TGTTGCGGCC AAACTCAATA CCTTCAGTCC GCTTGGAGAA GGCGGACCTA TTGGCGTTGG CGCCACTTCA GCTAGCAACA GCGGGAGCGA TCTTTATGGG GATCAAGCCC AATATCGCCT ACGCCTTTAT ACCAGTGCAG GATTCTCACC AGATGGCATT GCAAGCCTGC AACCCAGCGA GTTCAATAAA TACTTTATTC TTGAAGCGAA AGGCGATAAC GGCGAAAAAA TTTCACTTAC TAAATCGAAT CAGGATTATC TTATTGGCAA ATATGGTTCC ATCAAAGTTG TAGGAATAGC AGATTTAGCA CCAGCAGGAA CAATAGAAAA TGCCGCTTAT GTGGAAGATC ATGATAATTA CTACGACATC ATCCTAGAAG GTGATCTCAG TGCAATTACG AGACTAAAAA GTGTTCGTAT GCCATCAAGG GGTAATTACC AGGCCGTTTA TAACCCTGGT GGACCCGGTA ATAATCCGGA CGCACAGGCT GCAGCACCAG GCCCGTTTAC GATGCCGAGC GTCGATCACA CCATTGCCAT CATCAACGAC CTTAATGGTG CGATGACTGC CACCTATGTC GAAATCGAAG GAGACGTACT GACAAATCCA TTGAGTAACT TGCCGGTTGG AAAGCTGCTA GGAGTAGCAG TGGAAGACAC GATCAGCGGC CAGCAAATTT ACGCGTACGA AGATCCCTAT GGACGTCGTT TCTACACAAG TTTCGAAGCC TCCAAAGACG TAGCCTCGGT ACTTCCTAGC AACCTCCTGA AGCCGAAGCC GATTGACCTG ATCGACACAA CAGGTTTTGC GCCCGACTCC AGCGTCATTA TTTCCGGGTC ATTTAGTCGC TCAGCATCTC ACAGCTCAAC ATTGCAGTTT TATGAGGTAG CAGGACCCGA TGGTGGTGTA GTTGACCCTG TTACTGGCAG AACGCTGATG CCCAATGAAA GCGGCTACAA CTATGTGGCG AGGAGCAATC TGCTTACCAG CCAAAATAGT TCATTGAAAA TCGAAAATAA GGAAATTAAT AAATTTCAGT TCAATGCCGA GGCCGGAAAG ATTTATGCAC CATTGCTGAT CAACGAAGTA ACTGGTGAAC AGTATTTTGC CTTCACTGGC GCTAACTCTG ACAAGTCCGC GCATTTCACT GCTCTTGGTC CCAATGGCTT TGGTATAGAG GATCTATTTG GTGGTGGAGA TAAGGATTTT GCCGACATGA TCGTGCAATA TACGATCACA GTCTGA
|
Protein sequence | MAIRYSSLAD PLIAYIDKDQ ENTSGRASSL LTPINYNADL FANEAKFFTG NYGFDGYIGV PGLLGPGGQQ AAAHHNVAWE SVDPDLSPTL RALTSAASVV GSKAVYGMNL LLQDTMPLVF SYPVLPTTLD GNGSDFEITL NDGSIVSPAL AGFLPNLEYN ERQTIVVAGD FGNRLNPESE GARYPVSVRI VNDGTPLQML SAKGPVFATE LSVDSSNSYV QGDGPKLVAA KLNTFSPLGE GGPIGVGATS ASNSGSDLYG DQAQYRLRLY TSAGFSPDGI ASLQPSEFNK YFILEAKGDN GEKISLTKSN QDYLIGKYGS IKVVGIADLA PAGTIENAAY VEDHDNYYDI ILEGDLSAIT RLKSVRMPSR GNYQAVYNPG GPGNNPDAQA AAPGPFTMPS VDHTIAIIND LNGAMTATYV EIEGDVLTNP LSNLPVGKLL GVAVEDTISG QQIYAYEDPY GRRFYTSFEA SKDVASVLPS NLLKPKPIDL IDTTGFAPDS SVIISGSFSR SASHSSTLQF YEVAGPDGGV VDPVTGRTLM PNESGYNYVA RSNLLTSQNS SLKIENKEIN KFQFNAEAGK IYAPLLINEV TGEQYFAFTG ANSDKSAHFT ALGPNGFGIE DLFGGGDKDF ADMIVQYTIT V
|
| |