Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12611 |
Symbol | |
ID | 5731230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1135791 |
End bp | 1137329 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641285630 |
Product | glycosyltransferase |
Protein accession | YP_001551146 |
Protein GI | 159903802 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00539614 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGGAAAC TTAAATTTAA TCAATATCCA AGATTTATTG CATTGCCTTT ATTAATTGGT TTCATATTAA GAACTATTCT ACTAAATTCG CCAATTGTAG GAGTGCATAG TTGGCGGCAA GCTGATACAG CTGCAATGGC TAGACACTTT TATTTAAATA ATACTCCTAT ATGGCTTCCG CAAATAGATT GGGCTGGCAA TGGAGAAGGG TTTGTAGAAT CAGAGTTCCC TATATTCCCA TATTTAACTG CTCAACTCTA CAAATTATTT GGTATCCATG AATGGATAGG AAGAAGTTTT TCAGTAATAT TTAGCCTACT AACTATTTTG TTAATAATTA GAATTGGTGA GCTTTTAATT AATAAAGAAG CTGGTTGGTG GGGTGGAATA TTATTTTCAA TACTGCCAAT GAGTGTTTAT TATGGACGAG CATTTCAAGC TGAGTCCCTA TTACTTCTAT TATCAGCATT AAGCCTAGAA AGGTTAATTG AATCAATAAG ATCTTATAAA TATACAAATT TACTAATTAG CTGGGCTGCA TTTGTTTTAG CATGTCTAAT AAAAGTCCTA CCTTTGGTTT GGCTGGGTTT GCCTTTAATT ACTACTATAA TTTTAGCAAG AAAAGAGCGT GATCAAAAGA ATAGCTTTAT AGGCTTAACA AAAAGATTGG CATCTTCTCA AATCATTTAT ATTTATATGC TTGGTGCATT ATTAATTTTA TATTTATGGT ATTCCCATGC TTACGAAATA GGAAAATTAA GTGGCCTTAG TTTTGGTTTT TGGGAAGACT CTGATAGAAG TAGTTTTAGT ATGTTATTTA ACACTCAAAT GTGGGGTAAC CTTATATTAA GGACAAGTAT TAGATGCTTT GCAATACTTG GTTTACCTCT TCTTTTTATT GGAATAAGGA CTACTTACAA AGGGTATGGT GGCAAAATCC TAATCTCAGG CATTATAGGA ATACTTTTAA CATTATTAAT CTCAATCAGG TCGAGCAGCA TACATGAATA TTATCAACTT CCATTATTAA TATTTAGCTG CCCATTTATG GGTCAAGGAG TCCAAGTATT AATTCATGCA ATTAAGAAAG AAAAGTTACA CAGAAAAGTA CTTAGGATTA CATTAATATT TATTGGTTTG ATAAGTATTA ATATTTTAGC ATTTGATTAT TATTTATTAG AAAAACGTCA GACGAAGATA TGGCTACCTC TTGCGTTAGA AATTAGGAAC AATGTACCTT TAAATGAAAG AATTGTAAGT GTTACCAACC ATGACCCTAC TTTATTAAAT CTAGGCAGAA GGCAAGGATG GTTAGTATCA GCAAGCAGCA TCAATAAAAA TAATATTGAG CTGTGGTTTA AAGAAGGCGC AAGTTTTATA GTTGGTAGTC TAAATTGGGC TGAGACTTAC GCCACTCTTC CAGAAGGAAC TACAAAAAAG AATTTAAATG ACTTACTTTG CAATGCGCAT GTATCCGAAT CATGCCCTAA GCCACCTAAT TTTACTTATA TAATTCCTAT AAAAAATCTA ATCAATTAG
|
Protein sequence | MRKLKFNQYP RFIALPLLIG FILRTILLNS PIVGVHSWRQ ADTAAMARHF YLNNTPIWLP QIDWAGNGEG FVESEFPIFP YLTAQLYKLF GIHEWIGRSF SVIFSLLTIL LIIRIGELLI NKEAGWWGGI LFSILPMSVY YGRAFQAESL LLLLSALSLE RLIESIRSYK YTNLLISWAA FVLACLIKVL PLVWLGLPLI TTIILARKER DQKNSFIGLT KRLASSQIIY IYMLGALLIL YLWYSHAYEI GKLSGLSFGF WEDSDRSSFS MLFNTQMWGN LILRTSIRCF AILGLPLLFI GIRTTYKGYG GKILISGIIG ILLTLLISIR SSSIHEYYQL PLLIFSCPFM GQGVQVLIHA IKKEKLHRKV LRITLIFIGL ISINILAFDY YLLEKRQTKI WLPLALEIRN NVPLNERIVS VTNHDPTLLN LGRRQGWLVS ASSINKNNIE LWFKEGASFI VGSLNWAETY ATLPEGTTKK NLNDLLCNAH VSESCPKPPN FTYIIPIKNL IN
|
| |