Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_14261 |
Symbol | |
ID | 5731027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1286369 |
End bp | 1288036 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 641285803 |
Product | glycosyltransferase |
Protein accession | YP_001551311 |
Protein GI | 159903967 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.259667 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00324745 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTAAAA AGAGCTTAAG TAGCAGAATT GAGAATATTT ATAGTCCTAT ATGCTCATAT ATGAAAAAAC TTTATGCGGA GAAATATTTT TCAATATTGG TCATTATATG TATTTGTGGA TTTGTTGGAT TTGTTTGCCT TTTCAATGAA AGCACACAAA GTTTTTTTGC TCACGATGAA GGTCTTTATG CAAGAAGAGC AAAGTTGATA TTGGATACAG GAGATTGGTT TGCACCTTTT TCTAAAGCTC ACCATAAGAC AATTGGTAGT TATTGGTTAA TAGCATTAAG CATGAAGCTT TTTGGCTTAG GTGAATACTC AGCAAGAATA CCAAGTGCTT TGTTTTCGGT TCTTAGTTCA ATAGTTGTTT ATAAGATAGG CTTAGAAATA TCAAATAAAA GATCAGCATT TATATCGGCT TTAATACTTC CATCTATGCC GCTTTGGTTT CAATATAGTC ATTACGCCAG CCCTGACATG GCGTTTGTTT TTTTAAATTT ATTTGCTATT TATATGATTC TAAGAGCAAG TAGTGAAAAT AACAGACAAT CTAACCATCA ATTCTTTTAT TGGTTTTTAA CAGGCATTTC CTTCTCTCTT GCTTTCCTGA TAAGAAGTTT TTTAGCTCTA CTACCGATTT TTGCTTTACT GCCTTTTATA TGCCTATCTT TAAGACAGCA GGGTAGAAAA AAAGTTTATT TCTTATTGGG GGGGATGATA ATAGGCTTTA TACCCGTCAT AGTCAGCATT TTCTATGCAT ACAATGCATA TGGATCTGAG GCTTTTCTGG AGTTATTTGA CTTTGCTAGA AGAAAGTCTA TGGGAGGGAA TTTGTTTAAG GGATTATTCT ATTACCCTAT TATTCTCATT ATTCTTTCTT ATCCATCAGG ATTGATCAGT ATATTTGGCT TTATCAGGGT AAATCAATAC AAGAACTTAA AGCTCAGATA CCTCTTGTCT ATTTTCCCGT TGGTCATATT ATTTGCCTTA ATGGTTGCAT CGACGGCTTT AAGTCATTAT GCCTTGATGC TAATACCTTG GATTGCTATA GCATCAGGAA TTGCAATTGA TTCATTGATT TCATCAGAGC TTCTTAAATC ATATCAATTT AAGAAGCTTT GCGCGTACAT ATTTTTATTA ATAGGATTAT CTCTGATGGT AATTCTTTTA TTTAAAATTA CAGGACTTGT ATCAATAGAT GTGCTTGATA AGCCAATAAT AATAAGCTCT TTCTTTGTTG TATCATTAGT TAATATATCT GCTGGATTAG TTGGAATTAG GGGATTAAAT AATCATAGAA ACTTTTCAAT TTCAATTGGT TTGATGGTAA TTACACAAAC TATTCTTTTA ACTATACTTT ATGGAATTGG AATCTTGGGT AATCCTAATC AAGAGATAAA AACGTTTGTT CGAGAGCCAT TCGTAAATGA AATACTACGT TCAAATACTG TTTATCTTAT AGGTGTGAAT AGAAATACTA AAGTTAGAAC TTTAATGGAA TTTTATCTGC CTAATTATCA GGATTATCAA AAGTCACTTG ATCAAGTCAA GGGAAACTCC TATTTTATGG TTAGCAAGGA TGCTCTTTTG GAACTCTTAA AATTTAAAAA ATATAGTATT AATAAAATAG CGAAGTACAA GGAATTCTTT TTTATAAGAG TTAATTAA
|
Protein sequence | MIKKSLSSRI ENIYSPICSY MKKLYAEKYF SILVIICICG FVGFVCLFNE STQSFFAHDE GLYARRAKLI LDTGDWFAPF SKAHHKTIGS YWLIALSMKL FGLGEYSARI PSALFSVLSS IVVYKIGLEI SNKRSAFISA LILPSMPLWF QYSHYASPDM AFVFLNLFAI YMILRASSEN NRQSNHQFFY WFLTGISFSL AFLIRSFLAL LPIFALLPFI CLSLRQQGRK KVYFLLGGMI IGFIPVIVSI FYAYNAYGSE AFLELFDFAR RKSMGGNLFK GLFYYPIILI ILSYPSGLIS IFGFIRVNQY KNLKLRYLLS IFPLVILFAL MVASTALSHY ALMLIPWIAI ASGIAIDSLI SSELLKSYQF KKLCAYIFLL IGLSLMVILL FKITGLVSID VLDKPIIISS FFVVSLVNIS AGLVGIRGLN NHRNFSISIG LMVITQTILL TILYGIGILG NPNQEIKTFV REPFVNEILR SNTVYLIGVN RNTKVRTLME FYLPNYQDYQ KSLDQVKGNS YFMVSKDALL ELLKFKKYSI NKIAKYKEFF FIRVN
|
| |