Gene OSTLU_18438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18438 
Symbol 
ID5005966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp7303 
End bp8841 
Gene Length1539 bp 
Protein Length512 aa 
Translation table 
GC content55% 
IMG OID640421387 
Productpredicted protein 
Protein accessionXP_001421808 
Protein GI145355103 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGGG CAACGTTCAA GTTGTCGGTC GAAATCGTGC CCGAGAATAG ATTGTTGAAT 
CCCGCGCAGA CAGCCGCGCT CGAGGGCATT TTTACGAAAT GCTGCTCGTC GACGACGTCG
TGCCCGGTTT GGAAAACTGC CAGAAGATAC GGTTACGATC CCTGCCACTG GGGCTTGGCT
GGATGCACGA AGGCGAACGA GACGAAATGG TTGAATCTGC AGGGGCTGTA CTTGGATTGC
GAGCTCGGTT CATCCGACGT CAGCGCATTC GGGTCGTCGT TGCGTAGGCT GTATGTCGGA
CAAAACACGG CGCTGAAACT CGCCGACGAA GACGCGACGA TTGGATTGTT GAAGGTGCTT
CCGAATTTGG TAGAAATAGA CGTCACCGGA ATCGATCTAC AGGGGCGAAC CGTGGATGGA
CTGTGTCACG CATCGGTGAG CGCGAACTTG ACGCGTATCG GTTTGAACAC GGCGAACGTG
TCGGGTGCGT TGTCACAGTG CGTGGTGGAT AAACCTCAGT TGATGGATCT CGCGATGCAG
TACAATTACT TGACCGGAAC GCTGCCATCA CTACCATCGT CGTCCAACTT ACGAACGTTG
TATCTCCACG AGCAAAGGTC GGCAGACAGC ATCAGCGGCG TTCTACCGCC GTCGTACGTC
AGCTCGACGA CGCTCGAGCA CTTGTGGCTG ACCAATCTGA AACTCTCGGG CGCGTTGCCC
GACGTGTTTT CGCCGACAGG CGTGTGGCGA GAGATATACT TGAACAAAAA TGCATTTAAC
GGTACGATTC CAGCGTCTCT GGGCTCGCAG CGATATCTGC CGGTGCTCGA CTTGTCCTTC
AACGCGTTCT CAGGAGCGGT TCCCGGCGGT ATTTACGATC ACCCGAACCG CACGCACGTT
GGCATCAAGT CGAACAAATT GACGCAAGTG AGCGTGTCCT CAATCCACAG CCCGCCGGGC
GCGTCGCTGA TACGTCTCGA TGCGTCCAAA AACGTAGTCA ACGAGACGGG CGTGTCGACG
ATATTCACTC GAATGCCGAA GCTGCAGTAT CTCTACTTGA ACGACAATGA ACTTCACGGC
GTCATCTTGG ACGACTCGAC GACGCCGGTT TGGGCGCTTC GGCAATTAGA CGTGAGTACA
AACTATCTCG AAGGCGAAAT TCCTGGCGCC TCGTATTGGG GTAAAATCTT CACGTCGAGC
GCGCCCGCGG GACGAAAGTT TGACATATCG CAAAACCTGT ACACCAGAGC CCCATCCTGG
TTCGGCGCTT ACACCGGCGA TTCTGGTCTG ACGATCACAC TGGGCAGCGG GTTGTACGAT
CCATCCTCCG ATCCCGACGC CGCGCTCGCG TCTGCGAATG CAAAGCCTAC GGTGTCGAAA
TTCATGCTAG CGCTGTTGCT CATAACGCTC TTCGCCATGA GTGGACTGGG CCTGTACTTG
GGCATTTATA TCTTGGTGCA ACGGCGAAAT CGGGCGCACG CGAATCGCTT TAGGCAGTTT
CATGACTTTG ACCAAGGTCA AGGCGTCGAG ATGGCGTAA
 
Protein sequence
MHGATFKLSV EIVPENRLLN PAQTAALEGI FTKCCSSTTS CPVWKTARRY GYDPCHWGLA 
GCTKANETKW LNLQGLYLDC ELGSSDVSAF GSSLRRLYVG QNTALKLADE DATIGLLKVL
PNLVEIDVTG IDLQGRTVDG LCHASVSANL TRIGLNTANV SGALSQCVVD KPQLMDLAMQ
YNYLTGTLPS LPSSSNLRTL YLHEQRSADS ISGVLPPSYV SSTTLEHLWL TNLKLSGALP
DVFSPTGVWR EIYLNKNAFN GTIPASLGSQ RYLPVLDLSF NAFSGAVPGG IYDHPNRTHV
GIKSNKLTQV SVSSIHSPPG ASLIRLDASK NVVNETGVST IFTRMPKLQY LYLNDNELHG
VILDDSTTPV WALRQLDVST NYLEGEIPGA SYWGKIFTSS APAGRKFDIS QNLYTRAPSW
FGAYTGDSGL TITLGSGLYD PSSDPDAALA SANAKPTVSK FMLALLLITL FAMSGLGLYL
GIYILVQRRN RAHANRFRQF HDFDQGQGVE MA