Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_17851 |
Symbol | |
ID | 5731605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1614323 |
End bp | 1615636 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641286171 |
Product | glycosyl transferase family protein |
Protein accession | YP_001551670 |
Protein GI | 159904326 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTTTG CAGCTACAAA TGGCAGAAAT CGCCGCGGAA AGGCCAGCTT ATTCTTGATT TGCTGCGTTT TATTAGGCTT ATGTCCTTAT TTAATTCCCG CATCAATAAG CTTATTGCCA GCATTAATAT TGGCTGTATT GCTAGGAATT TATGGTTTGT CGATTGTTTT AAGAGAGATA GATAATCGTT TTGATCAATC TAAATTGGTT GCTTTTACTC CAGAGGAATA CCAATCTCTC CCAATGGTAG ATGTTGTGGT TTCAGCTAGA GATGAAGAGA ATGTTGTTGA GAGATTAGTG GAACGTCTAA TTTCTATTAG ATATCCAAAG GATAAAATAA CTAGAATGAT TATTGATGAT GGAAGTAAAG ATAAGACGTC CATTTTATTG AATCAATTAA CCCAAACTTT TCAAGAAATT CAAGTCTTAA ATCGATCAAG ATCTTCCGGA GGAGGCAAGT CTGGCGCACT TAATTATGCG CTATCTAAAC TGAATGGTAA ATGGATTTTT ATTCTTGATG CAGATGCACA GTTTAATGAT GATATTTTGT TGAGGATTAT TCCTTTTGCG GAGAAATATG GTTTATCTGC CGTTCAATTA AGAAAGGCAG TTATAAACTC AGGAAAGAAT TTATTGACTC ATTGCCAGTC TATGGAAATG GCAATGGATG CTTTTATCCA GCAAGGAAGA ATCTTCGTAG GAGGAGTGGG TGAATTAAGA GGAAATGGTC AGCTTATAGA GAGAAATATA TTGAATAAAT GTGGAGGCTT TAATGAAGAT ACTTTGACTG ATGATCTTGA TTTGAGTTTT AGGCTATTGA TTGTTGGCGC TAATGTCGGT TTGCTTTGGA ATCCTCCTAT TCAGGAAGAA GCAGTTGAAT CTTTAGGGTC TTTGTGGCGA CAAAGAAATA GATGGGCAGA AGGTGGATTA CAACGTTTTT TTGACTATTG GTCTTTTCTC TTTTCCGGAA GGCTCGGCTT TGTCAAAAAA CTTGACTTAG GATGTTTCTT CACTCTTCAA TATGTATTGC CTGTCGTATC TTCTGTAGAC TTGTTAATTG CCACTTGGAC TCACTCATTT CCCTTGTACT GGCCTTTATC ATGTATTGCT TTGAGTGTAT CTGGAGTGGC CTATTTTCGA GGTTGCAGAA GAAAATCTCA AGGCCCAGAC TTACCTTCTC CTAAATTACT TAGATTATTA ATATCTGTCA TTTACCTGAT TCACTGGTTT ATTGTTATTC CTTGGGTAAC AATAAAAATG GCAGTCTTAC CTAAGAAATT AGTTTGGAAC AAGACTACTC ACCAAGGTAA TTAA
|
Protein sequence | MAFAATNGRN RRGKASLFLI CCVLLGLCPY LIPASISLLP ALILAVLLGI YGLSIVLREI DNRFDQSKLV AFTPEEYQSL PMVDVVVSAR DEENVVERLV ERLISIRYPK DKITRMIIDD GSKDKTSILL NQLTQTFQEI QVLNRSRSSG GGKSGALNYA LSKLNGKWIF ILDADAQFND DILLRIIPFA EKYGLSAVQL RKAVINSGKN LLTHCQSMEM AMDAFIQQGR IFVGGVGELR GNGQLIERNI LNKCGGFNED TLTDDLDLSF RLLIVGANVG LLWNPPIQEE AVESLGSLWR QRNRWAEGGL QRFFDYWSFL FSGRLGFVKK LDLGCFFTLQ YVLPVVSSVD LLIATWTHSF PLYWPLSCIA LSVSGVAYFR GCRRKSQGPD LPSPKLLRLL ISVIYLIHWF IVIPWVTIKM AVLPKKLVWN KTTHQGN
|
| |