Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_16631 |
Symbol | |
ID | 4778776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1451450 |
End bp | 1452394 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640087172 |
Product | glycosyl transferase family protein |
Protein accession | YP_001017672 |
Protein GI | 124023365 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.505772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGTTGC GAATGTTCGC AAGCGTCGTG ATACCCACCT ACAACCGTCG ATCGATCCTT GAGAAGTGTC TGTTGGCTCT TGAGCAACAG GTCCTTTTCT CTGAGCCAGA TGGTTATGAG GTGGTCGTGG TGGATGACGG CTCCACGGAT GGCACACCGA GCTGGCTGAG GGATGAGGCG TCTCGCTTTC CACATGTGCG CATGGTGGAG CAAGAGCATG GGGGCCCGGC AGAGGGGCGT AATCGCGGGG TAGACAATGC CAGAGGTGAT CTCATTGTTT TTATCGACAG TGATCTCGTT GTAACCGAAT CGTTTCTCGC CTGTCATGCG AGAGCTTTGC GAAAAAGTTG GCAGCAACGC GGTGATCGAC TTTCTTTCAC CTATGGCGCG GTGATCAACA CGGCAAATTT CGAGCAGCCC ACTACAGAAC CTCACAAATT CCAGGACAAC TCATGGGCCT ACTTCGCGAC AGGCAATGTG GCCATTGATC GTGAGGTGCT TGAACGTTCT GGACTATTCG ATACCAATTT CCGTCTGTAT GGCTGGGAGG ATCTAGAGCT GGGGGAACGA CTTCGACAGA TGGATGTTGA GTTGGTGAAA TGTCCCGAAG CAGTTGGTTA TCACTGGCAT CCTGCCCTGC GTCTTGAGCA AGTTCCTGAC CTGATCAGAG TTGAGAGGGA GCGGGCCAAA ATGGGCCTGG TCTTTTATCG CAAGCACCCG ACGCGACGGG TGCGTTTCAT CATCCAATTC ACCTGGCTTC ATCGGTTGTT ATGGGAGTTG CTTACGCTCG GTGGATTACT TAATGAGCGC AACCTCAGGC CTTTACTTGG CTGGTTGATC CGCAACGGTC ATCAAGGTTT GGCAATGGAG CTCTTGCGTT TACCCCTAAA CCGAATCGGG GTAAGAGCGC TCTTCAAAGA AGCAGCACTA ACAGGGCTGC GCTGA
|
Protein sequence | MWLRMFASVV IPTYNRRSIL EKCLLALEQQ VLFSEPDGYE VVVVDDGSTD GTPSWLRDEA SRFPHVRMVE QEHGGPAEGR NRGVDNARGD LIVFIDSDLV VTESFLACHA RALRKSWQQR GDRLSFTYGA VINTANFEQP TTEPHKFQDN SWAYFATGNV AIDREVLERS GLFDTNFRLY GWEDLELGER LRQMDVELVK CPEAVGYHWH PALRLEQVPD LIRVERERAK MGLVFYRKHP TRRVRFIIQF TWLHRLLWEL LTLGGLLNER NLRPLLGWLI RNGHQGLAME LLRLPLNRIG VRALFKEAAL TGLR
|
| |