Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_00671 |
Symbol | |
ID | 5730870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 73267 |
End bp | 74487 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641284409 |
Product | hypothetical protein |
Protein accession | YP_001549952 |
Protein GI | 159902608 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0763] Lipid A disaccharide synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.997015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGAGGC TGCATTCCAA ATTGTTGATG CAACCTAAAG CAATTGATTC CAAAATTTGT ATTCGTCTAG TGCTAGTGCC CTGCCCTAAC GCAACAGGCA ATGAACATAA AGTTGCTAAT CAATGGCATT ACTTTGAAAG TATTTCTCCT GCAAAACACT TTTGGACCCT GTTAGTTAAT CCTCATAAAT ACAGTTATTG GCCTAAAAAA GGTCTTGTCA TTTTCTTAGG GGGCGATCAA TTCTGGAGTG TATTGCTTTC AGCAAGACTG AGGTACTTGC ATTTGACTTA CGCAGAGTGG ATTGCAAGAT GGCCTTTTTG GAATAATCGA ATCGCTGCTA TGTCAGAGAA AATAAAACAA TCTTTACCAA AACATTTGAG AAAACGTTGC ACAGTTGTTG GGGATTTAAT GGCGGATTTA ACAAGCCAAG CAAAAGATAT AAACCCTTTA CCTAATAAGC AAGGTGAGTG GGTAGCTTTA TTACCAGGTT CTAAGAAAGC AAAGCTTTGC GTAGGCATTC CATTTTTTCT CGAATTAGCA GATGAATTAT CCAAACTATT ACCTAGTTGC AACTTCCTTT TACCTATTGC ACCAACGACC AATATTCAAG AGTTAGAAAC TTTTAGTAAT TCAAGAAAAA ATCCCATAGC AAATAACTAT CAATCTGGAA TCAAAAAAAT AATATCACTC AAGAATGATC AATCTTTGAA AGTAATGATT ACTAAAGCTG GAACAGAGAT AACCCTCATT GAAGAACATC CTGCACATAA TGCTCTTAGT CAATGTGACC TAGCATTAAC AACAGTAGGA GCCAATACTG CAGAGCTAGG TGCTTTGGCT GTACCAATGA TTGTAATTAT TCCTACACAG CATCTCAATG TTATGCAAGC TTGGGATGGA TTCCTTGGAA TAATTGGCCG GCTACCAATT CTAAAGTGGT TTATTGGCAT TCTAATCTCG TTCTGGCGTC TAAGAAAAAA AGGTTATATG GCCTGGCCAA ATATATCTGC AAATAGACTA ATAGTTCCAG AAAGAATTGG GACTCTTTAC CCAAAAGATC TTGCCCAAGA AGCATTTGAC TGGCTACAAT CACCAAATCG ACTGGAAGGT CAAAAAGAAG ACTTAAGAAG TCTCAGAGGT CAGCCAGGAG CTACTAAAAG AATGACTAAA GAAATAATAA ATTTGATATC AAAAGAATTT TTATCATCCA GTGATGAATG A
|
Protein sequence | MERLHSKLLM QPKAIDSKIC IRLVLVPCPN ATGNEHKVAN QWHYFESISP AKHFWTLLVN PHKYSYWPKK GLVIFLGGDQ FWSVLLSARL RYLHLTYAEW IARWPFWNNR IAAMSEKIKQ SLPKHLRKRC TVVGDLMADL TSQAKDINPL PNKQGEWVAL LPGSKKAKLC VGIPFFLELA DELSKLLPSC NFLLPIAPTT NIQELETFSN SRKNPIANNY QSGIKKIISL KNDQSLKVMI TKAGTEITLI EEHPAHNALS QCDLALTTVG ANTAELGALA VPMIVIIPTQ HLNVMQAWDG FLGIIGRLPI LKWFIGILIS FWRLRKKGYM AWPNISANRL IVPERIGTLY PKDLAQEAFD WLQSPNRLEG QKEDLRSLRG QPGATKRMTK EIINLISKEF LSSSDE
|
| |