Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_14191 |
Symbol | |
ID | 4912028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1185388 |
End bp | 1186458 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 22% |
IMG OID | 640161010 |
Product | hypothetical protein |
Protein accession | YP_001091643 |
Protein GI | 126696757 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.205889 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTTA CTCATAAAAT ATTTGACTCT TTTAATTCAG AACTTATAAG TTGTTGGCTA AATTTAGAGC AAAATTCAAT ATGCAATGCT TTTAATTCTT TATTTTTTCA TAACTCTATT TACCATATAA ATAAAAAATT TAAAAAGAAT TACTGGCCAT TAATAATTTG CTTATATGAA AAAAATAACA TTGCAGCGAT ATTGCCTCTA GAAAAATTCA ATTCTTCAAA TACATTAGTT GTACATGGTA GCGAGTTTAT AGACTACGAT GGTATTATTT TTAGCAGACA AATTAATCAC AAAGAATTTG AAAAATATAT ATTAAAAAAC ATATTATCCA AATACGATAT TTTTTTTCAT TCAATTGGCA GCAATAATAA ATTAATTAAT TCTTTATCTA ATAAATTAAT GTATAAACTA AAGTGGAGAT ATTCGATCAG ATATTTTATT GATGCAAAAA AATATGATGA TTTCTTCTTA AAGAAAAAAA AGTTTATAAA AAAAACAACA AAAAACATTA CTAAAAACAA TCTTTCTAGT ATTGAAATTA AACTTCCAAA TGAAAAATTA TCTCATTTAG AGAATCACAT AATACTAAAG TCTTTGCAAT ATAATTCTAC AAATAATAGA AATCCATTTA AAAACAAAAG ATATAAAAAA TTACTTGAGT ATTTAATTAA AAATTATTCG AAGAATATAG TTATCTATAT CCTAGCTATT AACAAAAGTC TTCAATCAAT TCTAATAGCC TTCAAAAGTA AAGACAAATT TTGCTATTAT CAACCTTCTT ATTCTAGGGA ATCAACCTTG GAATCACCAG GAAAAATTTT AATGTATTAT GCCATTAAAA TTGCAAATAA AAACAGTATT GAATTTGATT TCAGTACAGG AAATGAAAAT TACAAAATGC TTTATTCAAC TGATAATGAA ATAATACATT CCTATTTTTT ATCAAATAAG TTATTTCCTA ATTTTTTAAA CTTAATAATA TTTTGGATAT TATATTATGA AAAGTCCAGG AAAATTTTAC GAAGTCTTTT TAATCAAGCA AAAAAATTTA TGAATTTTTA A
|
Protein sequence | MIFTHKIFDS FNSELISCWL NLEQNSICNA FNSLFFHNSI YHINKKFKKN YWPLIICLYE KNNIAAILPL EKFNSSNTLV VHGSEFIDYD GIIFSRQINH KEFEKYILKN ILSKYDIFFH SIGSNNKLIN SLSNKLMYKL KWRYSIRYFI DAKKYDDFFL KKKKFIKKTT KNITKNNLSS IEIKLPNEKL SHLENHIILK SLQYNSTNNR NPFKNKRYKK LLEYLIKNYS KNIVIYILAI NKSLQSILIA FKSKDKFCYY QPSYSRESTL ESPGKILMYY AIKIANKNSI EFDFSTGNEN YKMLYSTDNE IIHSYFLSNK LFPNFLNLII FWILYYEKSR KILRSLFNQA KKFMNF
|
| |