Gene OSTLU_44226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_44226 
Symbol 
ID5004424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp503692 
End bp504678 
Gene Length987 bp 
Protein Length328 aa 
Translation table 
GC content55% 
IMG OID640419845 
Productpredicted protein 
Protein accessionXP_001420363 
Protein GI145352032 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0107542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTCG CTCGTTTATT TGCCGTGTTT GCCGACGACA AGGAAGTCAA CGCACTTCGC 
ACGTGCGTTC CCGCGCTAGA TTTCGAAGAA GACGTTGAAG TGTTTTCACA AGGCGTGACA
ACCTGGTCCA GAGACCGTCT GAACGGTAGA GACGGGCTTG ATGGTAAGTT GCTTCCTCGC
GATCCCCCTG AAGATGGCTA TGGGAGCATC GTGCACGTTT ATCTGCTTGA CACGGGTGTG
AGAAGGACGC ACGTGGAATT CAAAGATCAA GCGTTTGGGC GCGGCGTCGA TTTAGTCGAC
GATGACGCCG AACCTGATGA CTGCGATGGT CACGGTAGTC ACGTGGCTTC GACCATCAAT
CAAATCGCGT ATAGCGGCAA GACAGTTCTC CACTCTGTGA GAGTTCTCGA CTGTAACGGA
AATGGTGAGC TCTCGGGGCT GATTGAGGGA CTTGAATGGG TGCTCGGTGT GGCGACGCCG
TCCGAGCCAG CCGTGGTCAG TTTAGCGCTC GGAGTGCGAA ATGGAATTTG GTCACGCGCA
CTCGAACGCG TCGTCCAAAC GCTCACTGGA CGTGGAGTAT TCATCGTTTG CGCCGCTGGT
AACCAAAAAG GAGACGCATG CACGATTTCG CCTGGAAACG TCGCGGAGAC GCTGACTGTC
GCAGCCAGCG ATCAAGCAGA CGCGCCGTAC GCCTATGGAA ACTCTGGAAG GTGCGTTGAT
TTATTCGCGC CCGGCGTGCA AATTCTCGGT GCATGCGGTG GAAGCACTGC GTGTGAGCAT
CCAAGCGACA CGGCATATGC ATTTCAAAGC GGTACGAGTA TGGCGGTCGC GCACGCCGTC
GGCGCAGCGA CGCGCCTTCT TATGTTTTCC CCACGCATGA GCCCTGAAAA TTTGAAGAAG
CATCTCACAT CAACTGCGTC GCGCGACAAA ATTCGAGGTG GGTCATTACT CCCAGGAACT
CCGAATCTAC TGTTGTACGT GAAATAA
 
Protein sequence
MRFARLFAVF ADDKEVNALR TCVPALDFEE DVEVFSQGVT TWSRDRLNGR DGLDGKLLPR 
DPPEDGYGSI VHVYLLDTGV RRTHVEFKDQ AFGRGVDLVD DDAEPDDCDG HGSHVASTIN
QIAYSGKTVL HSVRVLDCNG NGELSGLIEG LEWVLGVATP SEPAVVSLAL GVRNGIWSRA
LERVVQTLTG RGVFIVCAAG NQKGDACTIS PGNVAETLTV AASDQADAPY AYGNSGRCVD
LFAPGVQILG ACGGSTACEH PSDTAYAFQS GTSMAVAHAV GAATRLLMFS PRMSPENLKK
HLTSTASRDK IRGGSLLPGT PNLLLYVK