Gene A9601_14861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14861 
Symbol 
ID4718207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1266344 
End bp1268449 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content31% 
IMG OID640079207 
Producthypothetical protein 
Protein accessionYP_001009876 
Protein GI123969018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAACTTC CTTTAGACCA TTTTCGTTTA ATTGGCGTAA GCCCCTCTGC AACTTCTGAG 
GAAATATTAA GGGCGTTTCA ATTGCGGTTA GATAAAACAC CTGATGAAGG TTTTACTTAT
GAAGTTTTAA CCCAAAGATC TGAGCTACTT CGCCTCACTG CCGATCTACT TACAGATCCA
GAAAGCAGAA GAGAATACGA AAATTTGTTA TTAAATGGGA ATTCTGGATT GGATTTTTCC
TCAAATAGAG AAGTAGCAGG ATTAATACTT CTTTGGGAAT CAGGTTCACC AAAAGAAGCT
TTTAAAATAA CGAGAAAAGC ATTGCAACCC CCTCAAACCC CAGCTTTAGG AAGTAGTAGA
GAAGCTGATT TAACATTATT GGCTGCTTTA ACAGCTAGAG ATTCTGCAAT ACAAGAACAA
CAGCTTAGAT CCTATTCGAG CGCGTCAGAC TTTTTACATG AAGGTATAAA ACTTCTACAA
AGAATGGGAA AGCTTGGAGA AAAAAGAAAA GAACTTGAAG AAGATTTGGC TGCTTTGCTT
CCTTACAGAA TACTAGATCT ACTTAGTAGA GATCTAAATG ATCAAGACTC TCATAAAAAA
GGTTTAAGTA TGTTGGAAAA TTTAATAATC AAAAGAGGTG GTTTGGAAGG TAATAATAAA
TCTGAATATA AAGATTATTT AAATCAGCAA GAGTTTGAAG CTTTTTTTCA ACAAATAAAG
CCATTTTTGA CAGTGCAAGA ACAGATTGAT TTGTTTCTTG AATTACAAAA AAGAGGATCA
TTAGAAGCAG GATTTTTAGC GTTTCTATCT TTAACAGCTA TTGGTTTCTC TAGAAGAAAG
CCAGAAAAAT TATTTGAAGC GAGAAGAATT TTAAAAAAAT TAAATTTATC AGGTCTTGAT
TCAATGCCTC TAGTTGGTTG TTTAGATTTA CTTTTAGCTG ACATTGACCA AGCCTCTGCA
AGGTTTTCAA GTAGTTCTGA TGAAAATTTA CGAGATTGGC TCAATAATTA TCCTGGAAAT
AAGTTAGAAG CTATATGTAT TTTCTGTAAA AATTGGTTAG AAAATGATGT TTTAGTTGGG
TATAGAGACA TTAACTCAAA AGAGGTGGAT TTAGATTCTT GGTTTGAAGA TAGGGAAATT
CAAGAATTTA TTGAAAAATT AGAAAAGAAA ACAAAAAAAA TTGCAATTAG ATCAAATCTT
CAAAACCAAC AAACTGAGAA GGAATCCTCC ACAAAAACGA CTGAAGATTT TGATAATGTA
TTGGGGAATA TTGATGAAAG AAGATTACCT TGGCCTGGTG GCATAAAACA AGGCTATGAG
AAGGTTGAGA CCAAAAAAAC AGAATTCAAT GAGGAATACT TTAAGAAAAA ACCAATTGAG
TTTTATAATT TTTTAATTGA AAAAATTGCT GAATTTAAGT TTAGTTTTGG GGAATTCTTA
AAGGATAAAG AGATAATTAA TCGGTCTCCG TATTTAATTT ATATCTATGC ATTTTTGATC
TTATTTGCAT TTGGTATTGG TATTGGATTT TTAAGAAATA ATTTTAAAAA ATCAATTCAG
GACGAATCTA TTGCTGAAAA ACCATTAATT GCAAAAGATA AAAATCAAAA GATTAGTGAG
ATAGATATTA TTCAAGAAAT AAAAAAAAAT CCTTCAAATA AATTGAATTC TATTTCTGAG
AAATCTACTT CAATTATTTC TTATGAATTC AAAGAACTTA ATACTGCTTC ACCTACTTTG
GAAGATATAA AGAATTTAAT TAATAGATGG CTTCTTAATA AAAGTAATTA CTTAGAGGGA
AAGGGTGAAA TTAATCTTTC TAAGATTGTT AGTAAAGGTC TAATTGATCG AACAATCGAA
GAAAGACAGA ACGATATCAA GAAAGGAATT TATAAGGAGA TTAATTCCCA AATACTTAAA
ATTGATTTGG AATCGCAAAC TTCATCTAGG ATAGTTGTTT TAGTAGAATT GAATTATTTA
GAGAGGTTAG TAAAGAATTC GGGAGAATTT ATTAATGAAA CATCTTTAAA TCCCCTTAAA
GTTAAATATA TTTTGGGCTT TTCAAATAAA TCGTGGAAAT TGGTTGATTT CGTGAGCGGC
TTGTAA
 
Protein sequence
MELPLDHFRL IGVSPSATSE EILRAFQLRL DKTPDEGFTY EVLTQRSELL RLTADLLTDP 
ESRREYENLL LNGNSGLDFS SNREVAGLIL LWESGSPKEA FKITRKALQP PQTPALGSSR
EADLTLLAAL TARDSAIQEQ QLRSYSSASD FLHEGIKLLQ RMGKLGEKRK ELEEDLAALL
PYRILDLLSR DLNDQDSHKK GLSMLENLII KRGGLEGNNK SEYKDYLNQQ EFEAFFQQIK
PFLTVQEQID LFLELQKRGS LEAGFLAFLS LTAIGFSRRK PEKLFEARRI LKKLNLSGLD
SMPLVGCLDL LLADIDQASA RFSSSSDENL RDWLNNYPGN KLEAICIFCK NWLENDVLVG
YRDINSKEVD LDSWFEDREI QEFIEKLEKK TKKIAIRSNL QNQQTEKESS TKTTEDFDNV
LGNIDERRLP WPGGIKQGYE KVETKKTEFN EEYFKKKPIE FYNFLIEKIA EFKFSFGEFL
KDKEIINRSP YLIYIYAFLI LFAFGIGIGF LRNNFKKSIQ DESIAEKPLI AKDKNQKISE
IDIIQEIKKN PSNKLNSISE KSTSIISYEF KELNTASPTL EDIKNLINRW LLNKSNYLEG
KGEINLSKIV SKGLIDRTIE ERQNDIKKGI YKEINSQILK IDLESQTSSR IVVLVELNYL
ERLVKNSGEF INETSLNPLK VKYILGFSNK SWKLVDFVSG L