Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_01141 |
Symbol | |
ID | 5731684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 117071 |
End bp | 118021 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641284457 |
Product | hypothetical protein |
Protein accession | YP_001549999 |
Protein GI | 159902655 |
COG category | [S] Function unknown |
COG ID | [COG4243] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.450714 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATGA GCTCATCTCG CCTTAAAAGT CGACGCCGTA ATGAACAGGG CTCCAAAGTG GCAAGAATTA CCATTGCAAT ACTTTCAACA ATAGGAGTAA TAGATACAGG TTCTATCACA CTACATAAAT GGGGATGGGT TGGCGCACTT ACCTGTCCCG GTGGTGCCGC AGGGTGTGAC AAAGTTCTAA ATAGTCCATG GGGCAATATT TTTCAGGGGA ATGGTTATTC AATTCCCCTG TCTTTTATTG GTTTTCTCAG CTATTTGACA GTATTAATTT TAGCAATATT GCCTTTCCTT CCAATACTTG CTGAAAGAAA AAGGGATTTC TCAAGAGCTA CATGGTGGAG TCTGTTTTTT CTCTCAACTG GTATGGCTAT TTTTAGCCTT CTTCTAATTG GGATAATGCT TTTAAAAATA AAAGCATTTT GTTTTTTTTG TATTCTTTCG GCATTTCTTT CAATATCAAT TTTAATTTTA ACTATGATTG GAGGAGCATG GGATGATCCA AGAGAAATGA TATTCAAAGG TTTCCTAATA TCAATTACTG TATTACTAGG TGGTTTAATT TGGTCCTCAT CTGTTGACTC AAGTCCATTA AAGGCTGGGC TAAATCCTGA AGCAGGTTCA GCCCCAATAG TACTTTCAAA AAGCACACCT TCTGCTATAG CCCTTGCAGA ACATTTAACC TCTATAGGAG CAGTTAAATA CTCTGCATAT TGGTGCCCTC ATTGTCATGA GCAAAACGAA ATGTTTGGGA AGGAAGCAAG TTCGAAGCTT TTATTAGTGG AATGTGCTCC AGATGGAATC AATAGCCAAA CTAAGCTTTG CCAAGAAAAA GAAATCACAG GATTCCCATC ATGGGAGATA AATGGAAAAA TTGAAGCTGG AATAAAGTCT CTAAATGAAT TAGCAAATAT TAGTAATTAC AAAGGTCCTA GGGATTTCTA G
|
Protein sequence | MNMSSSRLKS RRRNEQGSKV ARITIAILST IGVIDTGSIT LHKWGWVGAL TCPGGAAGCD KVLNSPWGNI FQGNGYSIPL SFIGFLSYLT VLILAILPFL PILAERKRDF SRATWWSLFF LSTGMAIFSL LLIGIMLLKI KAFCFFCILS AFLSISILIL TMIGGAWDDP REMIFKGFLI SITVLLGGLI WSSSVDSSPL KAGLNPEAGS APIVLSKSTP SAIALAEHLT SIGAVKYSAY WCPHCHEQNE MFGKEASSKL LLVECAPDGI NSQTKLCQEK EITGFPSWEI NGKIEAGIKS LNELANISNY KGPRDF
|
| |