Gene OSTLU_17041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17041 
Symbol 
ID5003982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp222930 
End bp224069 
Gene Length1140 bp 
Protein Length379 aa 
Translation table 
GC content58% 
IMG OID640419403 
Productpredicted protein 
Protein accessionXP_001420116 
Protein GI145351505 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5207] Isopeptidase T 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.852844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0273457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCGAG GGGTGGCGAG CGAGGGGAAG CACATGTCGG CGTGGGAGCG CGAGACGGCG 
CGGAGCGAGC GACGGCGACC GGCGCGCGAG ACGGGCGGGA TTGAAAATTT AGGGAATTCG
TGCTACATCG CCGCGTCTTT GCAGTTGTTG CGAAGCATGC GTGGATTCGT GGAGTCGGTT
CACGAGGTGT CTGGAGACGA GGACGGTAAA CCGTTACTCG CCGCGCTCGG CGAGTTTTTC
AGATCGGACG CGAGCGAGTT GAGCGCGTCG GGCGTGAAGC GCGAAATGGG TCGCGTGCGG
GACGAGTACG GAGAGTTCGA TCAACACGAC GCAATGGAAT TCATGACGCA AATGTTGGAC
ACGATCGAGC GCGAAATGGG TGACGACGCG GCGCACTGTC CGAGTCGACA AAACTTCGCG
TGGCGCATCG AGCACGCGCT ATCGTGCGTC AGCTGCGGCG AAAGGAGCGT GATGGACGAA
TCGATGTACA TGCTGACGTT GCAGCTCATC ATCGACGAGA ACGAGTCTGT CGACGCGCTG
CTCGATCGGT ACTTCATTCC AGAAAAGCTC GAACGCAAGT GTTCGTGTGG ATGCTTGTTC
GCAATCTCGA CTCGACAAAT CGTCTCAGAG CCGAAATTCC TCCTCTTGCA CCTCAAGCGA
TTCAATGCGG TGATAGCGCG CGGTGTGTTG CGTTTGCAAA AGCTCACGGC TTCGATTCGT
CTTCCTTCAA AGATGTCGCT GATGCACGCT GGATCCGCCG CCGCCGAAAT CGTCGTCCCC
AAATCGTCTG GAGACGATTC GGATCTCGAA CACGCTAACA ACGCGTCCAA AAGTTCGCCC
GATACCCCGG GTGTGAAGCG TCACAACACG CGATCGGTGG CGGCTACGAG ACCGTTCGAC
TTGCTCGCCG TCATCTCGCA CCACGGGAAC ACGGTTGAGC TCGGGCACTT CGTCGCGCAC
ATCCGCGAGC GCAAATCGAA GGCGTGGAAG ACTTACGACG ACGAGCGCGT GACTTCCTAC
GTGGCGCGCG ATGAGCTCAT TTTCAACTCG CTTCAAGAAT TCGAACGCGA GTGCTACGTG
GTCGCTTACG AACGAGACGA TAACGAAAAT CTTCGCCAAG GAAACGCGCA AATATTCTGA
 
Protein sequence
MWRGVASEGK HMSAWERETA RSERRRPARE TGGIENLGNS CYIAASLQLL RSMRGFVESV 
HEVSGDEDGK PLLAALGEFF RSDASELSAS GVKREMGRVR DEYGEFDQHD AMEFMTQMLD
TIEREMGDDA AHCPSRQNFA WRIEHALSCV SCGERSVMDE SMYMLTLQLI IDENESVDAL
LDRYFIPEKL ERKCSCGCLF AISTRQIVSE PKFLLLHLKR FNAVIARGVL RLQKLTASIR
LPSKMSLMHA GSAAAEIVVP KSSGDDSDLE HANNASKSSP DTPGVKRHNT RSVAATRPFD
LLAVISHHGN TVELGHFVAH IRERKSKAWK TYDDERVTSY VARDELIFNS LQEFERECYV
VAYERDDNEN LRQGNAQIF