Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12641 |
Symbol | |
ID | 5731414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1139159 |
End bp | 1140973 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641285633 |
Product | glycosyltransferase |
Protein accession | YP_001551149 |
Protein GI | 159903805 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.547801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00232598 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGATCAATA GAGCTAAATA TCAAAGATGC CTTTGTTTAA TACTTTTTTT AGGTGTTTTG ATTTTTTGCT GGCAACTAGG TGCAACGGGC TTGGTTGATG AAACACCGCC ACTTTTTGCA GCGGCAAGTC GTGCAATGAG TAGAACTGGA GACTGGCTAA CCCCAAGAGT TAATGGATTA CCTCGATTTG ATAAGCCCCC TCTTGTCTAT TGGCTTATGG GTATCTTTTA TTCAGTGCCA GGGCAAAACA TTTGGGACCC TTTAGGGACG TGGTCCGCTC GGCTCCCTTC TGCTCTCTCA TCTTTAGTAA TGATGTTGAT GCTTGGGGAT ACATTATTGA GATGGCCACC TAAGGGTATT GTTTCACCTA GAAGAGCTGC TGTAGTTGCG GCTTTGGCTT TTGCGCTCTC TCCTTTAGTA CTTATTTGGA GTCGAATTGC TGTTAGTGAT GCCCTTCTAT GTTCAACATT AGGAATAAGT TTAATTCTTA GTTGGCGATG TTATGCCAAT CCAGAAAACA ATCAATGGTA CTGGTCTTGG ATATTTCTAG GACTTGCTGT CCTTACTAAA GGCCCTGTGG CAATTGTGCT AACGGCAATG ACATTTATTT GTTTTTCTAT TATACAACGA GACTATTTAA TATTTTTTAA GAGGGTTAAA CCTATTAGAG GAATCTTTCT GAGCTTATCA ATAAGTATTC CTTGGTACTT AATGGAATTA ATTAAAGAAG GAAAACCTTT TTGGGATAGT TTCTTTGGAT ATCATAATTT TCAGAGATTA ACCTCAGTTG TTAATAGTCA TTCTCAGCCT TGGTGGTTTT TCTTGCTGAT TTTGGTTCTT TCCTCTTTAC CATTTACACC TTTTTTAATT CTTGGACTTG GAGAAGTAAT ATCTTCTTCT TTGAAGGATA AGAATCAGAT GTTGAACCCT CCAAGCCAGT CTCTTTTGAA TTTTTCTGCA AGTTGGTTGA TTTCTGTTTT ATTACTATTT ACTTTCGCTG CTACTAAGTT GCCTAGCTAT TGGCTTCCAG CAACACCAGC AGCAGCAATT GTCATAGGAA TATCGACACA TAATATTGAA TCTAAAAGTG GTCGATATAA ATTATCACTT GCTTTTACAT CAGCAATTTC ATTTACATTA GCCTTAATTA TATTCACTTC TAGCTTATGG ATTTATTCTA TAAATGACCC TGAGATACCA AACTTCTCTC AAGAATTTAT TTACAACAGA TTATATTTAA AAGGGTCTAT ATGTTTTGCA ATATCAGCCC TTATATGTTT TATCTTGCTT GTCAAGAATA CTCCAAATAA AATCTTTTTA AGCCAAGTAC CGCTGATTTT TTTCCAGTTA TTATTTATGC TCCCTATGTG GAATATTAGC GATAGACTTC GTCAGTTACC TTTAAGGCAA GTGTCTACTT TATTGGTTTC TTCTCAGAAC AATGATGAGC CAATAGCAAT GGTTGGTATT AACAAGCCAT CGTTACATTT CTATACTGAT AGGGTTATTC TTTTTGAAGC AAATGATGCT GAGGGTCTTG TTAACTTAGC TGATAGATTA AAATTAGAAG TTAGGCAAGG TTGGAAAGGA TCATCTCTTT CCTCTGTAAA TGGATCAAGG ACTGTTCTGG TAGTAATAGA TGACCAAACA AGTAAGTATC GGCATTGGAG AGGCTTGAAC CCGGAGATAA TTGGTAAATA CAGTATTTAT AATATTTGGA GATTAGACAG AAATAAGCTA GAGCAACGAG CTATTTCTTT AATTGATGGA GGGCAATCGC CAGACTGGCA ACTCCCAAAG CCGGAGAGGA TTTAA
|
Protein sequence | MINRAKYQRC LCLILFLGVL IFCWQLGATG LVDETPPLFA AASRAMSRTG DWLTPRVNGL PRFDKPPLVY WLMGIFYSVP GQNIWDPLGT WSARLPSALS SLVMMLMLGD TLLRWPPKGI VSPRRAAVVA ALAFALSPLV LIWSRIAVSD ALLCSTLGIS LILSWRCYAN PENNQWYWSW IFLGLAVLTK GPVAIVLTAM TFICFSIIQR DYLIFFKRVK PIRGIFLSLS ISIPWYLMEL IKEGKPFWDS FFGYHNFQRL TSVVNSHSQP WWFFLLILVL SSLPFTPFLI LGLGEVISSS LKDKNQMLNP PSQSLLNFSA SWLISVLLLF TFAATKLPSY WLPATPAAAI VIGISTHNIE SKSGRYKLSL AFTSAISFTL ALIIFTSSLW IYSINDPEIP NFSQEFIYNR LYLKGSICFA ISALICFILL VKNTPNKIFL SQVPLIFFQL LFMLPMWNIS DRLRQLPLRQ VSTLLVSSQN NDEPIAMVGI NKPSLHFYTD RVILFEANDA EGLVNLADRL KLEVRQGWKG SSLSSVNGSR TVLVVIDDQT SKYRHWRGLN PEIIGKYSIY NIWRLDRNKL EQRAISLIDG GQSPDWQLPK PERI
|
| |