Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_23431 |
Symbol | |
ID | 4778770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2059039 |
End bp | 2060067 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087864 |
Product | glycosyl transferase family protein |
Protein accession | YP_001018343 |
Protein GI | 124024036 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCAC CACTCTCCCC TGAAGGGCAG CGCCTAAATT GCTCTCTACT GCGGACGGCC TCCGGCATGA CCTCGCCCAT GGACCTCTCA GTCGTAGTGC CCCTCTACAA CGAAGAGGAA AGCCTGCCAC AGTTGGTAGA TCAACTCCTT GCAGCCTTAC GCCCTACTGG TGAAATCTTC GAGCTAGTGC TCGTGAATGA TGGCTCCACC GACAGCACCG CCAAGGTGCT GGAGCAGCTA AGTATTAAAA CGCCAGAACT GGTGGGCGTA CTTCTACGCA AAAACTATGG CCAAACAGCT GCTATGTCAG CAGGGTTCGA CATATCACAA GGCAAGGTGA TTGTGTGCCT GGATGGCGAT CTACAGAACG ATCCCGCAGA TATCCCACTA CTACTTGAAA AGTTGAATGA GGGATACGAC CTCGTAAGCG GCTGGCGTCA TCAACGCCAA GACGCTGCCC TACAGCGCAA GCTGCCCTCA AAAATTGCCA ATCGCCTGAT CGGCCGCTTC TCAGGTGTAA AGCTGCATGA CTACGGCTGC TCACTAAAGG CCTACCGCAA AGAAATCCTC AGCGACATGC GTCTCTATGG CGAGCTGCAT CGCTTTCTAC CCGCACTGGC ATTTATAGAA GGCGCAAGGA TCACTGAAGT GAAGGTGCAC CACCGTGCAC GCCAATACGG CAGCAGCAAG TACGGCATTG ATCGGACCTT CCGTGTGTTG ATGGACCTTT TCACCGTTTG GTTTATGAAG CGCTTTTTGA CCCGACCGAT GTACTTCTTC GGCTTCGGTG GACTAGTGGC AATGAGCGGG AGCCTACTGA CGGGTATGTA TTTACTGGCG ATCAAGCTGA TGGGGGAAGA CATTGGCAAC CGCCCACTAC TCACCATGGC TGTGGTACTC GGCCTGACGG GAGTCCAATT GTTCTGCTTT GGACTACTAG GCGAGCTGCT GATGCGCACT TACCACGAAA GCCAAGGACG ACCGATCTAC CGCATACGGG CGACGTTGCG CGGCGGCGGA ACATCCTGA
|
Protein sequence | MTAPLSPEGQ RLNCSLLRTA SGMTSPMDLS VVVPLYNEEE SLPQLVDQLL AALRPTGEIF ELVLVNDGST DSTAKVLEQL SIKTPELVGV LLRKNYGQTA AMSAGFDISQ GKVIVCLDGD LQNDPADIPL LLEKLNEGYD LVSGWRHQRQ DAALQRKLPS KIANRLIGRF SGVKLHDYGC SLKAYRKEIL SDMRLYGELH RFLPALAFIE GARITEVKVH HRARQYGSSK YGIDRTFRVL MDLFTVWFMK RFLTRPMYFF GFGGLVAMSG SLLTGMYLLA IKLMGEDIGN RPLLTMAVVL GLTGVQLFCF GLLGELLMRT YHESQGRPIY RIRATLRGGG TS
|
| |