Gene OSTLU_42117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42117 
Symbol 
ID5006317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp309808 
End bp311148 
Gene Length1341 bp 
Protein Length446 aa 
Translation table 
GC content58% 
IMG OID640421738 
Productpredicted protein 
Protein accessionXP_001422155 
Protein GI145355838 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.0484588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAATG TCGATCGCGA ATGCGGGACG ACGGCGGAGC GACGAGGGAA GTACAGGTCG 
AGCGAGACGC ACGAGGAGCA CGTGAATAAG AAGATACACG CGGCGAAGCG GCGGACGAGG
AGCGAGCTGT CGGTGCGACA GGGAGGGTGG GCGAAGCACG GGTACGCGTA CGAGGACGCG
GACGCGACGT CGCGGTCGTT GAAACATTTA CACGCGATCG AGCGGATATC GGTGAAGACG
ACGACAAAGG AAGAGTTTAT CGAAAGGTTC GAGAGGACAC GGACGCCGTG CGTGATCACG
GACGCGATGG AGGATTGGGG GTGTTATAAA AACGACGGCG GGCGGTTTTG GAGCGTAGAC
ACGCTCGCGG AGCGGTTTCG AGAGGTCAAG TTCAAGGTGG GGACGGACGA CGATGGGTAC
CCGGTGCGGT TGAAGATGAA GCACATTCAG CATTACGTGA ACGATCCGGT GCACATGCGA
GACGATAGTC CGATGTATGC GTTCGACGGG AGCGTCTTTG ACAAGCCGGA GACAAAGTCG
TTATTGGAGG ACTTCAAGAT TCCAGATTGG TTCGAGGAGG ATTTGTTCAA GCACGTCGGG
GCGAAGCGTC GACCGCCTTA TCGATGGATC GTTTTTGGGC CGCCTCGGAG CGGTTCGTCC
GTGCACGTTG ACCCGTTGGC GACGAGCGCT TGGAATGCGT TGATTTCGGG ACGCAAGCGT
TGGGCGCTTT ATCCGCCGCG CTCGGTGGAC AAGGCGACGA TCAAGCCGCG AGGAATTGGT
CTGGATGGTG AATCGGTGAC CTGGTTCAAC AAAATGTACC CTCGAACGAC GACAGAGGAG
TGGAAGCGCC AAGGATTGCC CCCACCCATT GATGTCATCC AACATCCTGG GGAAATCATG
TTCGTTCCAG ACGGTTGGTG GCACGCAGTG CTGAATCTCG ACCACACCAT GGCGGTGACG
CAAAACTTTT CGACTTCCGC CCGATTCGAC GCGGTGTGGC GCATCACGCG TCGAGCGCGA
CCGAAAATGT CTGCTAGATG GCTGGAAAAG TTGCGACGGG TTCGTCCAGA TTTAGCCGAG
GTGGCGGATG CGCAGCCTCG TCGAAGCGAA GTCAGCGCAG GTGAACAAAC GAGTTCGACG
AGCAGTTCAT CCTCTGGTTC AAGCGATACC GAAGCCGAAG CCGAGGACGA GGTCATGACG
AAAGAACGCG AGACCTTTGA GCGAGCCGCG GGGGGCGGCG GCGACGCCTC GACCAAACGC
ACGAGAACTG GAGACGGTTT GATCGCCGAC CTGGCCGCGG AAAAGATGCG CGCGGCGTCG
AAATCGATGG ACATCAACTA A
 
Protein sequence
MRNVDRECGT TAERRGKYRS SETHEEHVNK KIHAAKRRTR SELSVRQGGW AKHGYAYEDA 
DATSRSLKHL HAIERISVKT TTKEEFIERF ERTRTPCVIT DAMEDWGCYK NDGGRFWSVD
TLAERFREVK FKVGTDDDGY PVRLKMKHIQ HYVNDPVHMR DDSPMYAFDG SVFDKPETKS
LLEDFKIPDW FEEDLFKHVG AKRRPPYRWI VFGPPRSGSS VHVDPLATSA WNALISGRKR
WALYPPRSVD KATIKPRGIG LDGESVTWFN KMYPRTTTEE WKRQGLPPPI DVIQHPGEIM
FVPDGWWHAV LNLDHTMAVT QNFSTSARFD AVWRITRRAR PKMSARWLEK LRRVRPDLAE
VADAQPRRSE VSAGEQTSST SSSSSGSSDT EAEAEDEVMT KERETFERAA GGGGDASTKR
TRTGDGLIAD LAAEKMRAAS KSMDIN