Gene OSTLU_37673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37673 
SymbolPGE3501 
ID5006015 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp104265 
End bp105482 
Gene Length1218 bp 
Protein Length391 aa 
Translation table 
GC content64% 
IMG OID640421436 
Productpredicted protein 
Protein accessionXP_001421975 
Protein GI145355452 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0842145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCG CGCGGGAGGC GAAACGGGCG TCGTACGCGC CGACGTGCGC GATACGGGAC 
GGTCAGGAGC GCGCGCACGT GTACTGCGCG CGATTTTGCG CCATCGAAGG CGTCGATGGA
AAGTTTCAGC GAACGTTCGC CACGTGCGCG GGGACGCGGG CGGACGTGTG GGAGTGCGAA
AAGTCGGGAA ACGTCGTGCT CGTGGCGTCG TTTGAGACGC GCGACGCGAA TGAGGCGTTT
TACGCGTGCG AGTGGTGCGC GATTGACTCG GTGGGACGAC GGGAGAGCGG CGCCGACGCG
ACGACGACGG GAAAGGGGAA ATTGCGGCCG TGCCTGGCGC TCGCGGGGGA GGGAGCGGTG
GTGCGCGTCG TAGATTGCGT CACGGGGCGG CTGCACGTGA ATCTGGTGGG ACACGGAGGG
ACGGTGAATT CCGTCGTGTC GCACCCGTCG CGGCCGAGCG TGGTGGCGAC GGCGAGCAAG
GATTTGAGCG TTCGTCTGTG GCACGTCAAC ACCGGGGTGA CGATGGCGAT ATTAGCCGGG
GCTCGAGGCC ATAGAAATGA GTTGTTGAGC GTGGATTTTC ATCCCGCCAT CGACGCGAAA
GGGCAGATGA AGCTCGTCAC GGGCGCGATG GACAACTGCG TCAAGGTTTG GGCCACGCCG
CCGCTCGCGG ATTCCATGGC GAAGGCGGCG ACTTGGACGA AACCACTCGC GAATTTCAAA
ACGATCGTCA TCGATACGCC GATGTTTTCG AGCAGCAGCG TGCACGACGA TTACGTCGAT
TGTGTCGGGT GGTTGGGCGA CGCGGTGTTG AGCAAGAGCG TGGACGGCAT CGTGAAGCTT
TGGGTGCCAG ACGAACCCGT GGGCGTGGTG CACGCGCGAG GGAACCAATT TCGTTCGGTG
TCGGCGTTTG AGCAAAAAGA CGCGAATTTG TGGTGGATAC GCTTCGCCGT CTCGGGATCG
CGAAACGCCT TCGCTTTGGG CAACATTAAA GGTTTGGTGC TGGTGTGGCG CTTGGACGCG
CGCGGCGGGT TGACGCGCGC GCCCGCGAGA TTGGCGGCGT TTCCGGTCAG GCGTAGCGCG
TCAAACAACG TTGCGCCCGA AATCGCGCTC GACGGCTTCG CGGTCGTTCG TCAGTGCGCC
ATCAATCGCG ACGGCGACGT CGTCGTCGCG GCGTGCGATT CGGGCCTCAT CTGTCGCTGG
GATTTGGCGA CGCCGAGC
 
Protein sequence
MARAREAKRA SYAPTCAIRD GQERAHVYCA RFCAIEGVDG KFQRTFATCA GTRADVWECE 
KSGNVVLVAS FETRDANEAF YACEWCAIDS GKLRPCLALA GEGAVVRVVD CVTGRLHVNL
VGHGGTVNSV VSHPSRPSVV ATASKDLSVR LWHVNTGVTM AILAGARGHR NELLSVDFHP
AIDAKGQMKL VTGAMDNCVK VWATPPLADS MAKAATWTKP LANFKTIVID TPMFSSSSVH
DDYVDCVGWL GDAVLSKSVD GIVKLWVPDE PVGVVHARGN QFRSVSAFEQ KDANLWWIRF
AVSGSRNAFA LGNIKGLVLV WRLDARGGLT RAPARLAAFP VRRSASNNVA PEIALDGFAV
VRQCAINRDG DVVVAACDSG LICRWDLATP S