Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_12261 |
Symbol | |
ID | 4717941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1043975 |
End bp | 1045819 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 25% |
IMG OID | 640078943 |
Product | TPR repeat-containing sulfotransferase |
Protein accession | YP_001009617 |
Protein GI | 123968759 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAAAA TTGACTTATT AAAAAAAATT GAAGAAGCAA AAATAAAACA TAAAGCAGGG AACTCTTTAG AGGCAAATCA GATATTTCAA GAGTTATTAA AATCAAATAA TGATTCTTTT GATTTACTTT ACGCTTATGG GTTGTTTTGT AGAGATTTAA AAAATTTTAA TTTAGCAAAA AGAGTATTTC TCAATCTAAT TAATAAATTC TCATCATCAA TTAATTCTTA TATTTTATTA GCTGAAATAT TAAAAATTGA GAACAAATTC AATGATGCAG AAAGGGTACT GCAAAAGGCA ATAAAAATTA ATCCTAATCA TGGAGATTTA CTTTATAATC TTTCTCTTTT GTACTTTACA TTGAGAAACT TTGATTATGC ACTAGATTAT ATAGATAAAG CTATTAAAGT ATCGATAAAT AATGATATTT ATAAACTTTT AAAGTCTGAG ATTTATATCA ATAAATTCAA TATTGATGAA GCATTGTATA TCTTGGAAAA TCTAAATAAT AAAAATAGAA TTAAAAAAGA TAAAAATAAA GAAATAAGAA TAAATATTCT TCTAGCCAAT GCATTCCTAA AAAAAAGGAA GTACGAAGAA GCAGAAACAA TTCTTTTAAA ATTGACCAAA AAATATCAAG GATTGGAATT GGCTTATTTA AATTTAAGTA TTCTGTATAA GGATAAGAAT CAATTAAGTA AAAGTATACA AATACTAAAA AAGGGAATAA ACCTATCTCC CAATTTCATG CCTTTTTATA AAAATTTAGC AAGTTTCTAT AGAAATTCAG GACAGCTTAA ACTTGCTATT GAGACTAACT TATTTATTAT TTCTAGAAAT AAATTTGACT TCAATAGTTT TTATGAATTA TCTGGGATTT ATGATTTTAA GAATCATAAA AATGAATTAG ATTTTTTATT AAATACTAAA CTTGAGAATC TTAATCCAAA CTCAAAGATA TACGCAGCTT TTGCAATCTC AAATTTGCTG CACAAACAAG GAAAATTTAA AGAAAGTGCA AAATATCTAA AAATCGCCAA TGACGAAGGC ATGAAGTATA AAAAATCTGA CTCAAGTTTG AAGATTAAAC ATACTGAATC TTATAGATCA CTAAAAATCA AAAAATCAAA AAATAAATAT TTGAAGAATT CTTCTAATTA TGTCTTTATT GTTGGCATGC CAAGATCAGG AAGTACTTTA CTGGAAAACA TATTAAGTTT AAATTCTGAA GTAACTGATA TGGGCGAGGT TAGCTTTTTA GAGGAATCCA TCAAGGAAGC TAAAGATTTT GAAGAAATAT ATGATTTATA TGAAAAAAAA GTTATTAATC AATTTAAATC CGCTACCTTT TACACCGATA AAAGTTTATT TAATTATATG TATATTGCCA TTATTTCTAA TTTTTTTCCT AAAGCAAAAA TAATAAATTG CATAAGAAAC CCTCTCGATA ATATTTTATC CATTTATAGA GCAAACTTTT TAAATCAGTC ATTCTCTTTC TCTTTATCTG ATATTTCTTG TTTATATAAA CACTATTTTG AAACTATGGA GGAATATAAA ATTAAATATG GTGTAAATAT TTATGATTAT TACTATGAAG ACTTAATTGA AAATCCCAAT AATGTAATAC CTAGGATAAT AAATTGGCTT GGTTGGGATT GGGACGAAAA ATATCTTTCT CCCCATCAAA ACAAAAGAAA TGTACACACC GCAAGTAGCG CTCAAATAAG AAAGAAATTT TATTCTTCTT CTATAGGAAT TTGGAAAGAA TATAAGGAAC TTTTGGAACC TGCAATAGAA ATTATTAAAA CAAATAAACT TCTTGCAGAA AAGATTTCTA GGTGA
|
Protein sequence | MKKIDLLKKI EEAKIKHKAG NSLEANQIFQ ELLKSNNDSF DLLYAYGLFC RDLKNFNLAK RVFLNLINKF SSSINSYILL AEILKIENKF NDAERVLQKA IKINPNHGDL LYNLSLLYFT LRNFDYALDY IDKAIKVSIN NDIYKLLKSE IYINKFNIDE ALYILENLNN KNRIKKDKNK EIRINILLAN AFLKKRKYEE AETILLKLTK KYQGLELAYL NLSILYKDKN QLSKSIQILK KGINLSPNFM PFYKNLASFY RNSGQLKLAI ETNLFIISRN KFDFNSFYEL SGIYDFKNHK NELDFLLNTK LENLNPNSKI YAAFAISNLL HKQGKFKESA KYLKIANDEG MKYKKSDSSL KIKHTESYRS LKIKKSKNKY LKNSSNYVFI VGMPRSGSTL LENILSLNSE VTDMGEVSFL EESIKEAKDF EEIYDLYEKK VINQFKSATF YTDKSLFNYM YIAIISNFFP KAKIINCIRN PLDNILSIYR ANFLNQSFSF SLSDISCLYK HYFETMEEYK IKYGVNIYDY YYEDLIENPN NVIPRIINWL GWDWDEKYLS PHQNKRNVHT ASSAQIRKKF YSSSIGIWKE YKELLEPAIE IIKTNKLLAE KISR
|
| |