Gene OSTLU_17764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17764 
Symbol 
ID5005111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp100665 
End bp101786 
Gene Length1122 bp 
Protein Length373 aa 
Translation table 
GC content64% 
IMG OID640420532 
Productpredicted protein 
Protein accessionXP_001421052 
Protein GI145353507 
COG category[R] General function prediction only 
COG ID[COG5273] Uncharacterized protein containing DHHC-type Zn finger 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0000195954 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0112396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCG CGCTCGCGCT CGCGCTCGCG CCGCTCGCGT GCGTCGCCGT CGTCGTGCTC 
GGTCCGACGA AGCGGTTCGA GCGCTCGGCG CTGGGACGCG CGCATCGCGC GCTCGGCGAC
GCCGCGCCGC GCGTCGCGAC GGCGCTGGTG ACCATCGCGG TGTGCGACCG AAAACGCGGC
GCCGCGGTCG CCAGCGCGTG TTTCGACGCG CTCGATCGGC CGAATCCGCT CGGTCAAATC
GTGTATCTGA CGCTCGCGAT CGGTGGTCAC GCGAGCTTCG TCGAGGGGGT GGAGAAGCGG
CTGTTGGACG GCGGCGACGC GCGCGGGTGG ACGGCGGCGA CGAGCGCGGC GTTCGCGTTC
GCGTGCGCGA CGTGGGCGTT GGTGTGCTGC AGCGAGCCCG GAACCATCAC GAGAGAGAAC
AACGAAGAGT ATTTGAAGGC GTACGCGTAC GATGAAATCG TGTATCACAG GAAACGATGT
CGGACGACGG GGAAGGACGC GCCGGCGCGA TCGAAGTGGT GCACGACGAC GGAGCGGCGC
GTGGCGAGGT TCGATCACTT CTGCGTGTGG GTGAATAACA CGATCGGGGC GAATAATTTG
AGGTGGTTTT TGCTGTTTTT ATTCGCGCAG CTCGTGTTGG TGGGGTACGT GACGTTGGCG
TGCGCGCACG CGGTGCGGCG TTCCATGACG CGCCGAGACT GTTGGTCGTT AAGGTTCCAG
CACGAGACGC CCTCGGGCGC GCGCGCGACG CTCGGGAACG ACAAGGCGCT GCTTTATAGA
TTTGTGGTGT ACCACTACGC CCCCGCGGTG ACGCTCGGCG TGTTTTGCGC GCTCGTCTTC
GTGCTGTTGT CGGTTTTCTT GGGCTACAAC GTGTGGCTCG CCGCGAAAAA CGTGACGACG
AACGAAACCT TCAAGTGGGA GCTCGTGCGC GAATCCGTGG AGACGATGAA GGGCGAGCGC
GCGGGCGGAA GCGGCGATGA ACAAATCGAT TGGGGCGAGA TGACGAGAAA TAAGTACGAC
GTCGGGATTT GGGGGAACAT CAAAGAAGTA CTGTTCCCGC CCGTGAAAAC GCCGAGCGCG
TTCGCGCTTC CGTGGGACGC GCTCAAAGCG AAGCGTCCAT AG
 
Protein sequence
MTRALALALA PLACVAVVVL GPTKRFERSA LGRAHRALGD AAPRVATALV TIAVCDRKRG 
AAVASACFDA LDRPNPLGQI VYLTLAIGGH ASFVEGVEKR LLDGGDARGW TAATSAAFAF
ACATWALVCC SEPGTITREN NEEYLKAYAY DEIVYHRKRC RTTGKDAPAR SKWCTTTERR
VARFDHFCVW VNNTIGANNL RWFLLFLFAQ LVLVGYVTLA CAHAVRRSMT RRDCWSLRFQ
HETPSGARAT LGNDKALLYR FVVYHYAPAV TLGVFCALVF VLLSVFLGYN VWLAAKNVTT
NETFKWELVR ESVETMKGER AGGSGDEQID WGEMTRNKYD VGIWGNIKEV LFPPVKTPSA
FALPWDALKA KRP