Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_13781 |
Symbol | |
ID | 4912344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1146768 |
End bp | 1148573 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640160968 |
Product | glycosyltransferase |
Protein accession | YP_001091602 |
Protein GI | 126696716 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.938605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCTTC TTAACTCAAA AAAAAGGCTT ATAACCTTAC TGATAGTTTT AGTTTGTGGG ATCATTATAT TTTTTCTAGG TTTAGGTACT ACAGGACTGG TGGATGAAAC GCCCCCTTTA TTTGCTGCTG CGGCAAGGGC GATGAGTGAA TCTGGTGACT GGATAACTCC AAAGGTTAAT GGAATGTTCC GTTTTGATAA GCCACCATTA ATATATTGGC TTATGGGTTT TTTTTACTCA TTGCCGAAAA ACGAGATCTG GGATAGTTAC GGGACACTCT CAGCAAGACT TCCTTCAGCT TTGGGATCAT TATTTTTAAT GTTGATGATT GGAGATACTT TGTTTTGTTG GCCACAGAAG AGTGATAGGC AATTCCTCAC TCCAATAGTT GCATCATTAG GCTTTGCTTT GTCTCCACTG ATAATTATAT GGAGTAGAAC TGCTGTGAGT GATGCCCTTT TAACTGGAAC CTTGGGGATA AGCCTGCTTT TGTTTTGGAG AAGAATGGCA AGTGAAAATA ATGACCAATG TATTTCAGCG TGGGTATTTT TAGGTTTTGC AATTTTAACT AAAGGACCTG TTGCATTCGT TTTGGCATTA TTAACTATTA CATCTTTTTT GTTTAGTCAG AAGAATTGGA AAACGTTGCT CTGCAAGATA AATCCTAAGA AAGGTTTTTT AATAACAATT CTTATAAGTG TTCCATGGTA TTTATTAGAA CTAGTAAAAG AGGGAAAGCC TTTTTGGGAC AATTTTTTTG GTTACCATAA TTTTCAAAGA TATACCTCAG TTGTAAATAA TCATGCCGAA CCATTCTGGT TTTTTCTTTA CATAATGATA TTGGCTTCAT TACCATTCAC GCCTTTTTTG TTTCACGGAA TATTCAAAGC CTTTGAGGAT TTCTTGAAAA GTTCAAAACA AAGTTGCAAT GTCGCTGAGA CCCTTTATAT CTATTCTCTA TGTTGGCTAA TATCAGTTTT GATTTTCTTT AGCCTTTCTG CTACAAAACT ACCAAGCTAT TGGTTGCCAG CGATTCCAGC AGCGGCAATC TTAATTACTA ATAGCTTTAT AAACTTAAAA AATTCAAGTA AAAGCTATCT ATTTTTATGG AATTTTAATA TTTTAATTTT CTTCGGTGTT ACGATGGCAT TCTTTTTCTC AAATAATTGG TTAAGCTCAA TAAATGATCC CGAAATGCCT AATCTCGCAT CTGAACTAAT AAGTTCTGGG ATAATTTTTA AAGCTAAATT ATTCTTCTCT TTATTTACAC TACTTGCAAT AATTTTATTT TCTTTAAAGT CCAGAAATAT CCTTCTTTAT CTTCAAATTT TACTTTTAAT TGGACAATCC TCTTTGATGC CGCCAATAAG AAAATTAGCA GATACTTCTA GGCAATTACC TTTGAGAAAT ATCTCAAAAT TAATTTTAGA TATTCGCGAG AGGAGGGAAA CCTTAGCAAT GATCGGTATA AGAAAGCCTT CATTGCATTT TTACTCCAGG CAAATAGTTT TTTATGAACC AAGTACTGAA GAGGGATTAA TTAATCTTTC AGACAGGCTA AATACTGATA GGCGAGAAAA TTATGAGGAT CAACCTGATT ATGAATACAA ATCTCTATTG GTCGTCATAG ATGAATACTC TTCAGGCCGA CACCAATGGT CAAAAATTAA TCATCAAAAA TTGGGTAAAT TTGGGATTTA TAATTTATGG CGAATTCAAA AAAGTGATTT AAATAATTAT TCGAAATTTT TAGAGAATAG TGGTTATAAA TCCGACTGGA AAAATAGGAA AGTTGAAAAA TTTTAA
|
Protein sequence | MILLNSKKRL ITLLIVLVCG IIIFFLGLGT TGLVDETPPL FAAAARAMSE SGDWITPKVN GMFRFDKPPL IYWLMGFFYS LPKNEIWDSY GTLSARLPSA LGSLFLMLMI GDTLFCWPQK SDRQFLTPIV ASLGFALSPL IIIWSRTAVS DALLTGTLGI SLLLFWRRMA SENNDQCISA WVFLGFAILT KGPVAFVLAL LTITSFLFSQ KNWKTLLCKI NPKKGFLITI LISVPWYLLE LVKEGKPFWD NFFGYHNFQR YTSVVNNHAE PFWFFLYIMI LASLPFTPFL FHGIFKAFED FLKSSKQSCN VAETLYIYSL CWLISVLIFF SLSATKLPSY WLPAIPAAAI LITNSFINLK NSSKSYLFLW NFNILIFFGV TMAFFFSNNW LSSINDPEMP NLASELISSG IIFKAKLFFS LFTLLAIILF SLKSRNILLY LQILLLIGQS SLMPPIRKLA DTSRQLPLRN ISKLILDIRE RRETLAMIGI RKPSLHFYSR QIVFYEPSTE EGLINLSDRL NTDRRENYED QPDYEYKSLL VVIDEYSSGR HQWSKINHQK LGKFGIYNLW RIQKSDLNNY SKFLENSGYK SDWKNRKVEK F
|
| |