Gene OSTLU_18191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18191 
Symbol 
ID5005421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp606731 
End bp607912 
Gene Length1182 bp 
Protein Length393 aa 
Translation table 
GC content73% 
IMG OID640420842 
Productpredicted protein 
Protein accessionXP_001421337 
Protein GI145354111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000101363 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.119889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCT CGACGCGCTC GGGCGATGCG ACGCGCGTCG ACGCCTTGAT GCGCGCGGTC 
GACGCGGTCG CGGACGCGCT CGGCGACGCG GGCGAGGCGA ATCATCGCGC GTTTCGGGAC
GACGCAGAAC GCATCGCGAC GGCGCTGGAG CGAACGCTGA CGCGCGACGA GGACGCGGAC
GCGTTCGCGG TGCTGGGCGT CGAACACACG GCTTCGATGT GGACGGCGTG CGTGGACGCG
TGGACGCGGG CGGCGGAGAC GCGCGAGGCG ACGCGCGCGA CGGCGGCGAC GCTGCGGGAC
GTCGCGGCGG CGGCGCGAGA CGCGGAACGC GCGTGGGCCG CGAAACGCGC GCGAGGGGTG
GCGAGAGAAC TGGAGGAGGT GGAATTGAGA CACGACGCGC GAGTGCGCCG AGGCGAGGAC
GCGGCGCGAA GAGCGCGACT CGTCGGACGA TTTCGAGCGG AGGAGACGGC GGCGCGGGAC
GCGCTCGGCG ACGCGGGCGA GGCGAGCGCG GGCGTGGACG TCGTGGCGCG GTTTCGAGCG
CTCGAGGGCG AGTTTTTGTG GCGCGAGGAG TGCGTGCTGC GAGGCGAGCG CGGGGCGAGC
GCGCTGGAGG CGAGCGTGCG GGCGTTCGAG CCGGATTCGA GGGTGGGCGC GTTGGCGCTC
GCGGTGACGC GCGCGGCGCG TTGGCGCGAG CGAGTGGCGA CGTTCGAGGG ACGGCGCGAC
GACGAATTCG AGCGCCGATG CGATCAAGAC GTCATCGCGC GCATGGAGGC GCAGCACCCC
GCGCTCGTGG CGCAGTACGT GATGTGGATG AAAGACGCCG CGTTGGAGTC GAGCGAACGC
GCGGCTTCGA GCGCCGTCGC CGCCGCGACG ACGCATCTCT ACGATTTCCT CCTCGCCCTC
GACGCCGACG TCTACGACGC CGTTTTCGAC GCCCAAGGCC GCGCGAACGT CCTCTGCGCG
TCGTTCAACG CCCTCCACGT CCTGCGCTCG CGCGCGTCGC CGTCCTCGCC GCCGCCTCGG
CTCGCCGCCG ACGCGCGCGC GTTAATCTCT CGTCTCGTCG TCCGCGTCGA CGCCGAGCGC
GACGCCGAGA TCGCGCTTCA CGGCGACGAC TGGGGCCAGG CCGCGTTCGC CGTCGTCGCC
GCGCGCTTGG CGAGCCAGAG ATTACTCGAA GTCGTCGCGT AG
 
Protein sequence
MARSTRSGDA TRVDALMRAV DAVADALGDA GEANHRAFRD DAERIATALE RTLTRDEDAD 
AFAVLGVEHT ASMWTACVDA WTRAAETREA TRATAATLRD VAAAARDAER AWAAKRARGV
ARELEEVELR HDARVRRGED AARRARLVGR FRAEETAARD ALGDAGEASA GVDVVARFRA
LEGEFLWREE CVLRGERGAS ALEASVRAFE PDSRVGALAL AVTRAARWRE RVATFEGRRD
DEFERRCDQD VIARMEAQHP ALVAQYVMWM KDAALESSER AASSAVAAAT THLYDFLLAL
DADVYDAVFD AQGRANVLCA SFNALHVLRS RASPSSPPPR LAADARALIS RLVVRVDAER
DAEIALHGDD WGQAAFAVVA ARLASQRLLE VVA