Gene OSTLU_35481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35481 
Symbol 
ID5002670 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp121524 
End bp122583 
Gene Length1060 bp 
Protein Length307 aa 
Translation table 
GC content58% 
IMG OID640418091 
Productpredicted protein 
Protein accessionXP_001418840 
Protein GI145348817 
COG category[R] General function prediction only 
COG ID[COG1310] Predicted metal-dependent protease of the PAD1/JAB1 superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0686481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.028016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGCT TGATGCAATC TGGCGGTATG CCCGGCGCCA TGCCCGGCGC TGGTGACGCC 
GCGCAGGTCG ACACCGCGGA ACAAGTGTAC ATATCTAGCT TAGCGCTGCT CAAAATGCTC
AAACACGGTG AGCGCGCGAC GCGAGCGACG AATCATCGTC GAGTTGACTG TTGGATCTGA
CGCGAAATCT CGAGACGATC GCGGTCGTGT GCGCGACGCG CGGGCGTGGA CTGACCGATG
GACGCCGCCG TCGCTCGAAA TAGGTCGCGC GGGAGTGCCG ATGGAAGTCA TGGGCCTGAT
GCTCGGACAG TTCGTCGATG AGTACACGGT GACGGTGGTG GACGTCTTCG CGATGCCGCA
GAGCGGCACG GGGGTCAGCG TGGAGGCGGT CGATCCGGTG TTTCAAACGA AAATGTTAGA
CATGTTGAAG CAGACCGGGC GAGAGGAGAT GGTGGTGGGT TGGTACCACT CGCACCCTGG
GTTCGGGTGC TGGCTGTCGG GGGTGGACAT CAACACGCAG CAGTCGTTCG AGCAGTTGAA
CCCGAGGCTC GTCGCCGTGG TCATCGACCC GGTGCAGAGC GTGCGAGGGA AGGTGGTGAT
CGATGCGTTT CGATTGATTA ATCCGCAGAC GATCATGCTC GGTCAGGAGC CGAGACAAAC
GACGTCGAAT CTGGGGCACT TGAACAAACC TTCGATCAGC GCGTTGATCC ACGGGTTGAA
TAGGCACTAC TACAGCATCG GCATTTCGTA CGCCAAATCG GTGCTGGAAG AAAAGATGTT
GTTGAATTTG AACAAGTCGA AGTGGAGCGC GGGTTTGAAG GTGAACAAGT TCGACGAGCA
AGAGAAGCAA AACGAAAACG TCGTGCTCGA GCTCAAGGAG TTGGCGACCA AGTACGAAAA
GGCCGTCGTG GAGGAGGACA AGCTGACGGC GCAAGAGCTC GTGGTGAAAA ACGTCGGCCG
GCAGGATCCG AAGAAACATC TCAGCGAGAA CGTGCAAAAG CTCATGGCGG ATAACATCGT
GCAGACGCTC GGGGTGATGC TGGACACGAT TTGTTTTTAA
 
Protein sequence
MQRLMQSGGM PGAMPGAGDA AQVDTAEQVY ISSLALLKML KHGRAGVPME VMGLMLGQFV 
DEYTVTVVDV FAMPQSGTGV SVEAVDPVFQ TKMLDMLKQT GREEMVVGWY HSHPGFGCWL
SGVDINTQQS FEQLNPRLVA VVIDPVQSVR GKVVIDAFRL INPQTIMLGQ EPRQTTSNLG
HLNKPSISAL IHGLNRHYYS IGISYAKSVL EEKMLLNLNK SKWSAGLKVN KFDEQEKQNE
NVVLELKELA TKYEKAVVEE DKLTAQELVV KNVGRQDPKK HLSENVQKLM ADNIVQTLGV
MLDTICF