Gene OSTLU_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_1559 
Symbol 
ID5005774 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp436982 
End bp438508 
Gene Length1527 bp 
Protein Length486 aa 
Translation table 
GC content56% 
IMG OID640421195 
Productpredicted protein 
Protein accessionXP_001421664 
Protein GI145354801 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCCGG TGAATCTCGG ACGGAAGTCG CGGGCGGCGT TTGAGAACGT CTTCAAGCAG 
CTGACGTCGT TGACGTCGTT TCAAAAGTCC ACGGCGCCGA CGAACGCGAG AGAGTTTGAT
CAGGTCTACG ACGCGGATTT GCTCAGTGGG AGCTCGGTGG GGGAGTTCGA GACGCCGAAC
GCGAAGTTTA CGACGGTGTT GGTCACGGGG GCGACGGGGC GCATAGGTCG CGTTTTGATT
CGCAAGCTCT TGTTGCGCGG ATACACAGTC AAGGCGCTCG TGCGTCGCCA GGAAGACGTC
GAGAAGCTTC CGGGTTTGGT ACAAGTCATC GTCGGGGACG TCGGGGAGAA AGAAGTGATC
AAAAATGCCA TGATTGGCGT GAACAAGGTG ATTTACTGCG CGAGCGCAAA AACCTCCGTC
ACGAGCGACT TGTACAACGT CGCCGACCAA GGTGTGAAGA ACGTGGTATC GTGCATGCAA
GACTACTATC ACATGCTCGC TTCCCGTCGC GCCGGTCGCA GCGCCAAGTC CAAGGTGATG
TTGACCAACT TCAAGCACCC GACGGCGTAC GAGGCGTGGG ACGTCGAAGA GATCGAAGCC
GACGCCGGCG CCGGCGCCGA CGGGCGATGG GCCGCCGCGG CGGAGATGCA GCGTGTGAAC
TTCGATCCGC TCTACCCCGA AGACGAGGAC AAACCTTTCG AATTCGCGAC GTTCAACGGT
TTCATCACCT CTCGTACGGG TAAGGCTGAA GTGAGCTCAA ACGTCGAAGG TTTGCAAGCC
GACGTCGACT TTTCAGCCAA GGAAGGTTTG TTGTTCCGTT TGAAGGGCGA CGGGAAGCGC
TACAGCGTGA TGCTCACGCA GGACGATGGT TCCAAGTTCA GATTTTCGTT CAACACCACT
GGGGGATGGC AAGTCATTCG TATGCCGTTT CACAAATTCG TCAGTGAAGG GAAAACTTCT
TGGGGAGACG ACGGCGACGC CATTCTCGAC TTGACGAGAA TCGAGAAGAT TGGCGTTCGC
TTCGATGCGA GAAAGAACCA ACGCGAGACG ACGATGTCAG ACGTGATGAG TGGGAACAAT
AACATGTTCA ACTTGACGCT CGAGTACGTC AAGGCGATTC CCAAGGGCGA GGAACCCGAT
GTCATTTTGG TTTCGTGCTT CGGCGCCGGT TTGGAAGAGG GCGAAGAAAA GGAACGTATC
CTGAAGATAA AGCGTGACGG TGAACGCGTG CTGCGCAACT CTGGTGTAGG ATACACCATC
GTTCGCCCGG GTGAGCTCGT CGAAGAGGCT GGTGGGGGCA AGGCGTTGGT TTTCGATCAA
ACCGAACGCA TCAACACGCC GATTTCTTGC GCCGACGTCT CCGACGTCTG CGTCAAGGCG
ATGCACGACG AAGAGGCGCG TAACAAGAGC TTCGATGTCG GCTACGAGTA CGAAAGCGAG
CAAGCCGAGT ACGAGCTGAT CACCCAAGTC AAAGGCAAAT CCGACAACTA CCTCACTCCG
GCGTTGAAGG TGCTCGAAAA GAACTCG
 
Protein sequence
VNPVNLGRKS RAAFENVFKQ LTSLTSFQKS TAPTNAREFD QVYDADLLSG SSVGEFETPN 
AKFTTVLVTG ATGRIGRVLI RKLLLRGYTV KALVRRQEDV EKLPGLVQVI VGDVGEKEVI
KNAMIGVNKV IYCASAKTSV TSDLYNVADQ GVKNVVSCMQ DYYHMLASRR AGRSAKSKVM
LTNFKHPTAY EAWDRVNFDP LYPEDEDKPF EFATFNGFIT SRTGKAEVSS NVEGLQADVD
FSAKEGLLFR LKGDGKRYSV MLTQDDGSKF RFSFNTTGGW QVIRMPFHKF VSEGKTSWGD
DGDAILDLTR IEKIGVRFDA RKNQRETTMS DVMSGNNNMF NLTLEYVKAI PKGEEPDVIL
VSCFGAGLEE GEEKERILKI KRDGERVLRN SGVGYTIVRP GELVEEAGGG KALVFDQTER
INTPISCADV SDVCVKAMHD EEARNKSFDV GYEYESEQAE YELITQVKGK SDNYLTPALK
VLEKNS