Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_17181 |
Symbol | |
ID | 4777430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1498642 |
End bp | 1499865 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640087227 |
Product | glycosyl transferase family protein |
Protein accession | YP_001017727 |
Protein GI | 124023420 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.793323 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTGATCC TTACTTGGAT CTTGATTGTC TTTGCTGCAG GAGCAGCACT CGGCTTGTTA CTGCTGCTGT TCAGTCTTGT ACGTGTGTTC CAAAATGCTC CAACCTTAAA TTCCAAGCAG TTTCAGCCTG GGCAGCTACC TACTAGCAAG GTGGAAACCC TCAAGAAAAC TGAGCTCACA GTGGTTGTTC CTGCTTATAA CGAGGCCACC AACATCAAGG TGTGCCTGAG CAGCATTTTG GCCAGTGATC CACCCTGCAA TAACTGGAGA GTGTTGTTGG TTGATGACGG GAGTACTGAT GAAACCGTTC AAGTTGCAAG CGATGTGGCC TCAGTCTTAA AGCTCGAGGA AGGTCGCTTT ACAATTTTCA ACGCCGGACC AAGACCCTTG GCGCAGCGTT GGGTAGGCAA GAACTGGGCC TGCTCGCGCG CAATGGAGCT TGTGAGTAGC ACCTGGGTGC TGTTTGTAGA TGCAGATGTA GAGCTTCACC CAGCAACCCT CAAGCGTGCA CTGAATCAGG CGATTGAGGA AGAAGCAGAT CTACTGAGCC TGGTACCACG TATTAACTGC AGCTGCTCCG CTGAATGGAT GGTGCAACCG ATCATGGCCT GTCTTTTGGC TGTAGGTTTT CCGATTAAAG CCGCCAATGA TCCAGCTGAA TCAACAGCAT TTGCCGCTGG ACCATTTATG TTATTTCGCC GATCAACGTA TGAGGAGATT GGCGGTCATC GCGCACTGGC TGATGTAGTG ATAGAGGATC TGGCTCTAGC TCGTGAAGTG AAAGGCGGGG GATTTCGACT TCGTTATCTG CTGGGGTTAG ATGCATTGCA ATTGCAGATG TACGACAATT TTCCTGCGCT TTGGGAAGGC TGGAGCAAAA ATTGGTTTCT AGGCCTAGAT AGCAGTATCG TCAAAGCAAT CGGAGCATCA GCACTAGTGT TCTGGATGTT CACAGGCCCC TGGCTAGTCT TGTTTCTGAT CATAGTTAGC CTGCTCTGGA TCCCTTTTTA TGGGGCACAA TTAGTTGCCT TTACTTTTTC GGTAATTGGT GTATTGCTTC AGTTCATTCT ACGTCTTTGG ACACGTCAGA AATTCGAGGT GCCGCTAACG AATTGGTGGT TGATGAGTGC AGGTGGCATC CTTATTGGCT TACTTGGGCC AACTTCAGTT TGGAGGACAC TGACTGGACG CGATTGGACC TGGAAAGGCC GTTCCTTGGC TTGA
|
Protein sequence | MLILTWILIV FAAGAALGLL LLLFSLVRVF QNAPTLNSKQ FQPGQLPTSK VETLKKTELT VVVPAYNEAT NIKVCLSSIL ASDPPCNNWR VLLVDDGSTD ETVQVASDVA SVLKLEEGRF TIFNAGPRPL AQRWVGKNWA CSRAMELVSS TWVLFVDADV ELHPATLKRA LNQAIEEEAD LLSLVPRINC SCSAEWMVQP IMACLLAVGF PIKAANDPAE STAFAAGPFM LFRRSTYEEI GGHRALADVV IEDLALAREV KGGGFRLRYL LGLDALQLQM YDNFPALWEG WSKNWFLGLD SSIVKAIGAS ALVFWMFTGP WLVLFLIIVS LLWIPFYGAQ LVAFTFSVIG VLLQFILRLW TRQKFEVPLT NWWLMSAGGI LIGLLGPTSV WRTLTGRDWT WKGRSLA
|
| |