Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_18681 |
Symbol | |
ID | 4718606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1603446 |
End bp | 1604747 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640079602 |
Product | glycosyl transferase family protein |
Protein accession | YP_001010258 |
Protein GI | 123969400 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAGG GTTTTTATAA AAATCGAAGA TTGAAGTCGT TTATATTTCT TAGTGCTTGT TTTTTAGTAG CTTTTATTCC TCACGCTAAC AATATCGAAA ATTTCTTTTA TATAATGTTG ACTCTTTCTT TTGTGATTGT TTTTTACGGT TTAATAGTTA TTTCGAGAAA TTTCAAAAGG AACAACGTTT TAAATACTTT AAGCAGAAGA ATTAGCAATA AAGAGTTACC TGTGCTTGAT ATTTTAGTCG CAGCTAGAGA TGAAGAGAAT GTCATATCAA GATTAGTTGA AAGATTATTT AGTTTAGATT ATCCAACAAA TAAATTAAAT ATTCATATAA TCGATGATGG TAGTTCTGAT AAGACGCCTT TAATTTTAGA TCGATTATCA AGACAATATG AAAAGCTAAA AGTCATAAGT CGTTCTCCAA ATGCAGGAGG AGGAAAGTCA GGAGCTTTGA ATTATGCCTT GAAATTTACC CATGGTGAAT GGTTACTAGT TTTGGATGCT GATGCTGAAT TAAAAAAAGA TTCTTTGATA AGGTTATTTA GTTTTGTAGA AGAGGGTGAT TGGTCTGCAG TTCAACTAAG AAAATCAGTA ACAAATGTAA GTAAGAATTT TTTAACTTCC TGTCAGTCAA TGGAGATGGC TATGGATGCA ATCTTTCAAT ATGGAAGATT ATCAGTTGCT GGAGTTTCCG AATTAAGGGG AAATGGTCAA TTAATTAAGA AAGAAACATT ATTAGCATGT GGTTCTTTTA ATGAAGATAC AGTTACAGAT GATCTTGATT TGAGTTTAAG ATTATTATTA TCAAAATCTA GAATTGGAAT CTTATGGGAT CCTCCAGTCA TGGAGGAGGC AGTTGAGAAT TTAAATGCTT TATTAGCCCA AAGGCAAAGA TGGGCAGAGG GGGGGTTGCA AAGATTCTTC GATTATGGAG ATCAATTATT TACTAATAAA ATTGATTATT TGCAGAAATT TGATTTAACT TACTTTTTCA TCTTGCAATA TGCATTACCA ACCATTTCTA TTTTTGATTT AGTTTTCAGT ATTGCTTTTT TAGATTCACC AATTTACTGG CCTATTTCAT TTACAGCTTT TATGTTATCT GGAATTGCTT TTTGGTACGG TTCTTCTTGT AAAAGTGAAG TACCTGTATT GCAAAAAAGC AATTTTTTGA TGGTATTTGT ATCGGTTTTT TATTTATCAC ATTGGTTTTT AGTAATCCCT TGGGTAACTA TAAAGATGTC TATTTTTCCT AAAAAGATAC TCTGGCGAAA GACTCTTCAT ACTGGAGTTT AA
|
Protein sequence | MSKGFYKNRR LKSFIFLSAC FLVAFIPHAN NIENFFYIML TLSFVIVFYG LIVISRNFKR NNVLNTLSRR ISNKELPVLD ILVAARDEEN VISRLVERLF SLDYPTNKLN IHIIDDGSSD KTPLILDRLS RQYEKLKVIS RSPNAGGGKS GALNYALKFT HGEWLLVLDA DAELKKDSLI RLFSFVEEGD WSAVQLRKSV TNVSKNFLTS CQSMEMAMDA IFQYGRLSVA GVSELRGNGQ LIKKETLLAC GSFNEDTVTD DLDLSLRLLL SKSRIGILWD PPVMEEAVEN LNALLAQRQR WAEGGLQRFF DYGDQLFTNK IDYLQKFDLT YFFILQYALP TISIFDLVFS IAFLDSPIYW PISFTAFMLS GIAFWYGSSC KSEVPVLQKS NFLMVFVSVF YLSHWFLVIP WVTIKMSIFP KKILWRKTLH TGV
|
| |