Gene OSTLU_13471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_13471 
Symbol 
ID5006594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp255611 
End bp257041 
Gene Length1431 bp 
Protein Length476 aa 
Translation table 
GC content58% 
IMG OID640422015 
Productpredicted protein 
Protein accessionXP_001422694 
Protein GI145356967 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.155639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.170565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTTG TTGACTTTTT GAAGAGTCCG AAGAAATACG AAGCCCTGGG CGCGAAAATT 
CCCCACGGCG CGCTTCTCGT CGGACCGCCG GGGACGGGAA AGACGCTTCT CGCGAAAGCC
ACCGCGGGCG AGGCTGGGGT CCCGTTTCTT TCCATTTCTG GGTCGGATTT CATGGAGATG
TTTGTCGGCG TCGGCCCGTC GCGAGTTCGC GATTTGTTTG CCCAAGCGCG TCAGCAGAAG
CCGTCCATCA TTTTCATCGA CGAAATCGAC GCCATCGGTC GTCAACGTGG TCGCGGTGGC
TTCGCAGGAG GCAACGATGA GCGCGAAAAC ACGTTGAATC AGCTTTTGGT TGAGATGGAC
GGTTTCGGTA CCAAGGAGGG CGTCATCGTG TTGGCGGGTA CGAACAGACC GGATATTCTC
GACAGGGCGC TCCTGCGTCC CGGTCGATTC GATCGTCAGA TCACCGTCGA TCGTCCCGAT
ATTCAAGGTC GCGAACAAAT ATTCCGCGTG CATCTGGCCA AGATTGCCTT GGACGGACCA
GTGGATCACT ACAGTGAACG TCTCGCCGCG TTGACGCCCG GCTTCGCCGG GGCGGACATC
GCGAACATGT GCAACGAAGC TGCGCTCGCC GCGGCGCGTG ATAACATGAC CACGGTAACT
CTCACACACT TCGAGTACGC CGCCGATCGC GTCATCGCGG GTTTGGAGAA GAAGTCGAAG
GTTGTGAACA AGACGGAGCG TCGCACGGTG GCGTATCACG AAGCCGGACA CGCCGTCGTG
GGGTGGTTTT TGGAACACGC TGAGCCTTTG CTCAAAGTGT CCATCGTTCC GCGCGGTTCC
GCGGCTCTAG GCTTCGCGCA GTATCTGCCG AACGAGAACC TTCTCGCCAC GACGCAGCAG
CTGATCGATA TGATGTGCAT GACGCTCGGA GGCCGCGCCG CGGAGCAAGT CATGCTCGGA
AAGATTTCCA CCGGGGCGCA AAACGATTTG GAAAAGGTCA CGCAAATGGC GTACAACACC
GTGGCCGTGT ATGGCATGAA CGAGAAGATC GGGTTGCTTT CGTTCCCCAA AGACGAGCAA
AGCTTGAAGT CGCCGTATTC TGAGGACACG GCGAGAATGA TCGATGAAGA GGTTCGCCTG
CTCGTCGACA CCGCGTACAA GCGCACGTTG GCGCTCGTGA AGGAGAAGAA GCACCTCGTC
GAAGCCATGG CGCAAGGCTT ACTCGACAAG GAGGTTTTGC AGCGCCACGA TTTAGTTAAA
CTTCTCGGCG ATCGACCCTT CGTGTCTGAA AACCCGCAAA ACATTGATAT TTTGAACGAA
GGCTTCAAAA TGCACTATCC GAAGACGGCA ACGGCGCCAG AGGACGAACC CGCGGATACG
GACGAGCCGG AGGACGACGA GCCCAGTCCG GCGTTTCCAC TCGCGACTTA A
 
Protein sequence
MEFVDFLKSP KKYEALGAKI PHGALLVGPP GTGKTLLAKA TAGEAGVPFL SISGSDFMEM 
FVGVGPSRVR DLFAQARQQK PSIIFIDEID AIGRQRGRGG FAGGNDEREN TLNQLLVEMD
GFGTKEGVIV LAGTNRPDIL DRALLRPGRF DRQITVDRPD IQGREQIFRV HLAKIALDGP
VDHYSERLAA LTPGFAGADI ANMCNEAALA AARDNMTTVT LTHFEYAADR VIAGLEKKSK
VVNKTERRTV AYHEAGHAVV GWFLEHAEPL LKVSIVPRGS AALGFAQYLP NENLLATTQQ
LIDMMCMTLG GRAAEQVMLG KISTGAQNDL EKVTQMAYNT VAVYGMNEKI GLLSFPKDEQ
SLKSPYSEDT ARMIDEEVRL LVDTAYKRTL ALVKEKKHLV EAMAQGLLDK EVLQRHDLVK
LLGDRPFVSE NPQNIDILNE GFKMHYPKTA TAPEDEPADT DEPEDDEPSP AFPLAT