Gene A9601_11521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_11521 
Symbol 
ID4717865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp966978 
End bp968528 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content27% 
IMG OID640078867 
Productputative dienelactone hydrolase 
Protein accessionYP_001009543 
Protein GI123968685 
COG category[R] General function prediction only 
COG ID[COG4188] Predicted dienelactone hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATACA TTTTTATAAT TTTTTTTAGT TTTTGTGGTT TATTTTTTAA TAATGGTTTA 
AAGGCTGCTG AAAAGATAAA TATTAAGTTT GAAGAGATGG AAATCCCTCT TACTATAGAA
CAATTATCAA AATTAGAAAA ATACAAAGAT AATTCAACAG AATTAATAGA TTGGTTAAAA
AAAAATGGAT TTATAAGAGT TTTTGAATTA TCAAAGTTTT TAGAATTTCC AGTTTTCAAA
GAAGATGGAT TAAATAGAGA AATATTAAGA AGTTGGATAG GGCGTAAAAT TCTTACAGAA
TTAAGCAAAA GCATTAAAGT TCCAAATGAC AATAATGGAA CAGAAATATA TAACACTATA
GAAAATTTAT TAGATCAAAA AAAACAAATT TCAACTTTAG ACATCATAAA GGCATTACCA
TCAGAAGAAA TTTCACTGGA TATTGATAAT CTAATTTTAA TAATTTCATC TTGGAAAAAT
GAATTATCAA TGCAACAAGA ACTGTTGTCC AAATTAAATC AACTTGAAAG AACTAAACAA
AATGTCTCTA AAAATACTGA AAAAAAATCA ATTGAAGATC TAATAAAAAT TGAAAAAAAA
ATTTATGCTC CTCACCGAGT GAAACCTTTT GAAATTGAAA TATGGAAAAG CAATAAAACA
AATTCTGATA GAGAATTAAT AATATTTATG CCAGGACTTG GAGGAGAAAT TAATAATTTC
AAATGGATAG GTAACGAATT AGCTAGAAGA GGCTGGCCAA TATTATTCAT AGATCATAGA
GGAAGTAATT TAGAATCATT CATAGAAGTA CTCGATGGTA AGGAAACAAT ACCAGGAAGT
GCAGACTTTT ACTTATATAG AATTAAAGAT TTAGATGCTG TATTGAAAGC TCATGAAAAT
GGAGAATTTG ATTTACCTAA TAATTCTTAC ATTTTAATGG GGCATTCACT TGGTGCTTTA
ATAGCACTTT TATATGAAGG CAAGAAACCT ACTGATCAAC TAGAGGAAAA ATGTGATTTG
GCATTAAAAG ACTTTGCGGT TACAAATTTA TCAAAATTAC TTCAATGTCA GTTGAGTGAA
ATACCATTCC CTAAGAACAA TAACACTAAT AAGGCCAGTG CCATAGTAGG CTTTAATTCA
TTTGGAAGTC TAGTATGGCC AAAAGAAAAT AGTACAGGCA TTAAGGTACC AACTCTTCTA
ATAGGAGGTA CTTATGACCT TATTACACCG TTAATGAATG AACAATTTAA AGTTTTTTAT
GCTTTAGATA ATCCATCAAA TAGATTTCTA ATTATTGAAG GAGCAAGTCA TTTCTCTCCA
ATAAGAATTA ATAAAAGCTA TGAAAAAAAT AATGACCTCT TCAAAATAAG TGAATCTTTT
ATTGGTTCAG AGCCAATATT AGTACAAGAT TTATCTACTA AATTTATAGT TGAATTTTTA
AAAAATATTA AAGACCAAAA GATCCCTAAT GTAGTTAAAA ACCAAAGAGA TTTGGGACTT
GACTTCCATT TTTTAGATCT TGAAACGATA AAAGAAATTT CCGAAAATTA G
 
Protein sequence
MKYIFIIFFS FCGLFFNNGL KAAEKINIKF EEMEIPLTIE QLSKLEKYKD NSTELIDWLK 
KNGFIRVFEL SKFLEFPVFK EDGLNREILR SWIGRKILTE LSKSIKVPND NNGTEIYNTI
ENLLDQKKQI STLDIIKALP SEEISLDIDN LILIISSWKN ELSMQQELLS KLNQLERTKQ
NVSKNTEKKS IEDLIKIEKK IYAPHRVKPF EIEIWKSNKT NSDRELIIFM PGLGGEINNF
KWIGNELARR GWPILFIDHR GSNLESFIEV LDGKETIPGS ADFYLYRIKD LDAVLKAHEN
GEFDLPNNSY ILMGHSLGAL IALLYEGKKP TDQLEEKCDL ALKDFAVTNL SKLLQCQLSE
IPFPKNNNTN KASAIVGFNS FGSLVWPKEN STGIKVPTLL IGGTYDLITP LMNEQFKVFY
ALDNPSNRFL IIEGASHFSP IRINKSYEKN NDLFKISESF IGSEPILVQD LSTKFIVEFL
KNIKDQKIPN VVKNQRDLGL DFHFLDLETI KEISEN