Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_08111 |
Symbol | |
ID | 5731034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 712196 |
End bp | 714310 |
Gene Length | 2115 bp |
Protein Length | 704 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641285175 |
Product | glycosyltransferase |
Protein accession | YP_001550696 |
Protein GI | 159903352 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000296604 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAATTAT TTAGACCGCT TTATAGGATC AACAAAAAAC TTCTTTTTTG GGGACTAATT GCCATCATAT GGATCATTTC TACATATTAT GATCGAACTT GGTGGAACAA TAGTAATTTA GTTCCTGCAT GGGATCAGGC AGATTATTTA AATAGTGCTC TTTCTCATGG TAGAGCGCTT GGGGTATTAC AAGGTGGGGC ATGGGAAGGC TTTAGAGGAC TAATAAGCCA TTCTCCAAAA ATTCCACCTT TAGCCTCATT TGTAAATGGA ACAATAATTG CATTTTCGGG TGATGCTCCT AAAGACGCTG CCTGGAGCCT AAGTATTTGG CATGGATTAC TACTATTTAG TGTTGCAGGC ATAGGGGCTA AACTAAATGG AAGGTTTTTT AGCTGCATAT CAGTCATATT TGTATCTATA GCTCCTGCTC TTGTCGCCTT AAGGTTGAAT TATGTTCTAG AGATGCCTCT ATGTGCTGCA ACAACCTTGG CAATATGGCA ACTAGGTTCT TGGGTGAAAC CTGATAGTGA TAACAATAAA AATAGAGTCT ATATAGCAGC AGCTTGCTGT TCAATTGCCC TTCTAATTAA GCAAAGCTCA TTGCTAATAT TATTTCCCTC AGTTATATGG GCAATTATTA AAGTAGTAAA TACAAAAGAA ATTAATAACA GGCAAATATT TTACTTTTTA TTCATTACAT TTAGCAGCAT TCTGCCTTGG TTTAAACACA ATTGGATTAC AGTTATTAGC GGAACAAATA GGGCTGTTAT TTCATCTGCT ATCAATGAGG GAGACCCGTC TATATTTAAT TTAGATGGGT GGACTTATTA TTTCAGTATT ATACCTAAGC AAATAGGTAT ATTCTTTTTT ATAATTGGTC TAAGTGGTTT AATTTTCCAC TTCATTAAGA GAATTAAAGA GTTAAAGATC TACAATAGTA ATGAACAGCT AGGTAAAGTA TTTACAGCAG ATAGTTTTTT CTTTAAATGG CTGGTATTTA ATTCCATCTT TTGCTGGTTC CTTACGACAA TAGTTCCAAA TAAAGATCCA CGTTACATAA CTCCCATATT GCCATTATTA ATCATAATCT TATCTTTTGG TTGGTACGAA TTTGGCAAGT TCTTATTTAA AAAGCTATCA AAGATTACTT ATTTTGTTAC CTTAATAGTT ATTCCCTCTA TTATAATATC TTCACTTCCT TTAATGAGAA AAATCAGTCT TTCCAGTACA AATAGACATA AGGATTGGCA GATTGAAAAT ATAATAAATG AGGTTCGTTT AATGTCTGAG GATAATGAGA GTCTTACAGT AATTGTGGTT CCAAGTTTTG CAGAATTAAA CCAGCATAAT ATTAGTTATT ATGGAAGAAG GAACGGTACC AACATAATAG CTAGGCAGCT GGGTAAACAC TCTAGCGATA TAGAGCCATG TCTGAAACAA TGTGAATGGA TTCTTCTTGC TGAAGGTGAT ATCTCTACTG GAAATTCAGT ACGACCATCG GCTTACGCTT TAGATCATGC AGTGCGAAGC AGTAATTATT TCTATAAAAT CAAAGAATTT AATCGAGTAA AGGGTGGTCA ATATTCTCTA TGGAAAAGAA AACCGCAATT TGTTAATTAT ACTAATTTCG CGGATTCTTT CCAAGTATTA GCCAAAGGGT TAGAAGAAGG TCCTATAGGA GTAAAAAGAG TCTTTGATAT CATTGCTGTT CAACATATGA TTGATGGACA TAAAAAGTAT CAAGCAATAG TAAGAGAGCT ATCCTTGAAG AAGCTAGGCG AAAATCCTGA GGATACTAAT GCACATTGGA GCTTGGGCTT GCTTGCCATT CTTGAAAACA GACCAAGGGA AGCTGATTTC CATTTCTCAA AGCTTGAAAG CTTACTCCAG AGCAATCCGT GGCCAAGTGC TTATCGTTCT ATAGTAAATC TTGCAAATTG GAATCCATGG CAAGCTTATT CAATATCAAG TAATTCACTA AGGGCACATG ATGATCCTGT TCTCAAAGCA ATTAATGATA TTAGCTATAT CTTTTCTGGA GGATTATGGA GGATTACATC TACCTATAAT TCACTAAAAG AGGCAATTAC TCAAATTGAT AGCTCGACTA GTTAA
|
Protein sequence | MELFRPLYRI NKKLLFWGLI AIIWIISTYY DRTWWNNSNL VPAWDQADYL NSALSHGRAL GVLQGGAWEG FRGLISHSPK IPPLASFVNG TIIAFSGDAP KDAAWSLSIW HGLLLFSVAG IGAKLNGRFF SCISVIFVSI APALVALRLN YVLEMPLCAA TTLAIWQLGS WVKPDSDNNK NRVYIAAACC SIALLIKQSS LLILFPSVIW AIIKVVNTKE INNRQIFYFL FITFSSILPW FKHNWITVIS GTNRAVISSA INEGDPSIFN LDGWTYYFSI IPKQIGIFFF IIGLSGLIFH FIKRIKELKI YNSNEQLGKV FTADSFFFKW LVFNSIFCWF LTTIVPNKDP RYITPILPLL IIILSFGWYE FGKFLFKKLS KITYFVTLIV IPSIIISSLP LMRKISLSST NRHKDWQIEN IINEVRLMSE DNESLTVIVV PSFAELNQHN ISYYGRRNGT NIIARQLGKH SSDIEPCLKQ CEWILLAEGD ISTGNSVRPS AYALDHAVRS SNYFYKIKEF NRVKGGQYSL WKRKPQFVNY TNFADSFQVL AKGLEEGPIG VKRVFDIIAV QHMIDGHKKY QAIVRELSLK KLGENPEDTN AHWSLGLLAI LENRPREADF HFSKLESLLQ SNPWPSAYRS IVNLANWNPW QAYSISSNSL RAHDDPVLKA INDISYIFSG GLWRITSTYN SLKEAITQID SSTS
|
| |