Gene A9601_12261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_12261 
Symbol 
ID4717941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1043975 
End bp1045819 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content25% 
IMG OID640078943 
ProductTPR repeat-containing sulfotransferase 
Protein accessionYP_001009617 
Protein GI123968759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAAA TTGACTTATT AAAAAAAATT GAAGAAGCAA AAATAAAACA TAAAGCAGGG 
AACTCTTTAG AGGCAAATCA GATATTTCAA GAGTTATTAA AATCAAATAA TGATTCTTTT
GATTTACTTT ACGCTTATGG GTTGTTTTGT AGAGATTTAA AAAATTTTAA TTTAGCAAAA
AGAGTATTTC TCAATCTAAT TAATAAATTC TCATCATCAA TTAATTCTTA TATTTTATTA
GCTGAAATAT TAAAAATTGA GAACAAATTC AATGATGCAG AAAGGGTACT GCAAAAGGCA
ATAAAAATTA ATCCTAATCA TGGAGATTTA CTTTATAATC TTTCTCTTTT GTACTTTACA
TTGAGAAACT TTGATTATGC ACTAGATTAT ATAGATAAAG CTATTAAAGT ATCGATAAAT
AATGATATTT ATAAACTTTT AAAGTCTGAG ATTTATATCA ATAAATTCAA TATTGATGAA
GCATTGTATA TCTTGGAAAA TCTAAATAAT AAAAATAGAA TTAAAAAAGA TAAAAATAAA
GAAATAAGAA TAAATATTCT TCTAGCCAAT GCATTCCTAA AAAAAAGGAA GTACGAAGAA
GCAGAAACAA TTCTTTTAAA ATTGACCAAA AAATATCAAG GATTGGAATT GGCTTATTTA
AATTTAAGTA TTCTGTATAA GGATAAGAAT CAATTAAGTA AAAGTATACA AATACTAAAA
AAGGGAATAA ACCTATCTCC CAATTTCATG CCTTTTTATA AAAATTTAGC AAGTTTCTAT
AGAAATTCAG GACAGCTTAA ACTTGCTATT GAGACTAACT TATTTATTAT TTCTAGAAAT
AAATTTGACT TCAATAGTTT TTATGAATTA TCTGGGATTT ATGATTTTAA GAATCATAAA
AATGAATTAG ATTTTTTATT AAATACTAAA CTTGAGAATC TTAATCCAAA CTCAAAGATA
TACGCAGCTT TTGCAATCTC AAATTTGCTG CACAAACAAG GAAAATTTAA AGAAAGTGCA
AAATATCTAA AAATCGCCAA TGACGAAGGC ATGAAGTATA AAAAATCTGA CTCAAGTTTG
AAGATTAAAC ATACTGAATC TTATAGATCA CTAAAAATCA AAAAATCAAA AAATAAATAT
TTGAAGAATT CTTCTAATTA TGTCTTTATT GTTGGCATGC CAAGATCAGG AAGTACTTTA
CTGGAAAACA TATTAAGTTT AAATTCTGAA GTAACTGATA TGGGCGAGGT TAGCTTTTTA
GAGGAATCCA TCAAGGAAGC TAAAGATTTT GAAGAAATAT ATGATTTATA TGAAAAAAAA
GTTATTAATC AATTTAAATC CGCTACCTTT TACACCGATA AAAGTTTATT TAATTATATG
TATATTGCCA TTATTTCTAA TTTTTTTCCT AAAGCAAAAA TAATAAATTG CATAAGAAAC
CCTCTCGATA ATATTTTATC CATTTATAGA GCAAACTTTT TAAATCAGTC ATTCTCTTTC
TCTTTATCTG ATATTTCTTG TTTATATAAA CACTATTTTG AAACTATGGA GGAATATAAA
ATTAAATATG GTGTAAATAT TTATGATTAT TACTATGAAG ACTTAATTGA AAATCCCAAT
AATGTAATAC CTAGGATAAT AAATTGGCTT GGTTGGGATT GGGACGAAAA ATATCTTTCT
CCCCATCAAA ACAAAAGAAA TGTACACACC GCAAGTAGCG CTCAAATAAG AAAGAAATTT
TATTCTTCTT CTATAGGAAT TTGGAAAGAA TATAAGGAAC TTTTGGAACC TGCAATAGAA
ATTATTAAAA CAAATAAACT TCTTGCAGAA AAGATTTCTA GGTGA
 
Protein sequence
MKKIDLLKKI EEAKIKHKAG NSLEANQIFQ ELLKSNNDSF DLLYAYGLFC RDLKNFNLAK 
RVFLNLINKF SSSINSYILL AEILKIENKF NDAERVLQKA IKINPNHGDL LYNLSLLYFT
LRNFDYALDY IDKAIKVSIN NDIYKLLKSE IYINKFNIDE ALYILENLNN KNRIKKDKNK
EIRINILLAN AFLKKRKYEE AETILLKLTK KYQGLELAYL NLSILYKDKN QLSKSIQILK
KGINLSPNFM PFYKNLASFY RNSGQLKLAI ETNLFIISRN KFDFNSFYEL SGIYDFKNHK
NELDFLLNTK LENLNPNSKI YAAFAISNLL HKQGKFKESA KYLKIANDEG MKYKKSDSSL
KIKHTESYRS LKIKKSKNKY LKNSSNYVFI VGMPRSGSTL LENILSLNSE VTDMGEVSFL
EESIKEAKDF EEIYDLYEKK VINQFKSATF YTDKSLFNYM YIAIISNFFP KAKIINCIRN
PLDNILSIYR ANFLNQSFSF SLSDISCLYK HYFETMEEYK IKYGVNIYDY YYEDLIENPN
NVIPRIINWL GWDWDEKYLS PHQNKRNVHT ASSAQIRKKF YSSSIGIWKE YKELLEPAIE
IIKTNKLLAE KISR