Gene OSTLU_5032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_5032 
Symbol 
ID5003819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp473557 
End bp474684 
Gene Length1128 bp 
Protein Length376 aa 
Translation table 
GC content63% 
IMG OID640419240 
Productpredicted protein 
Protein accessionXP_001419817 
Protein GI145350867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.127559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGCACGTCG CCCTGCTGCT CTGCGCGCTC GCGCCGACCG TCGCGGTCGT CGATCCCAAC 
GCCCAGATCG TCGCCGTGAG CACGCTGAGC GTCGTCGCCG GCGCGTACCG AAGCGTGCGA
CCGGCGAGCG AGGGCTCGGG AGAGGTCATG ACCAAGGAAG ACGCGCAAAA GTTCCCTCTG
CTCGGGTCGT GCGTGCTGTT CGGGGCGTTT CTGGCGTTCA AGTTTTTGCC CAAAAACGTG
CTCGACGTGT GCGCGACGGC GTACTTTGGG ATGCTCGGCG TCGTGGCGAT GAGCGCGATC
CTGACCCCGG TCGTGCACAA ATTTGCGTTC GGGGGACGCG AGCTCGTGAG CTACGAACTG
TTTTCGGTGC CGGAGATGAA GTGGGTGAAC GGCGAGCGGT GGACGGCGGA GTGCACGCTG
GCGGAGGCGG CGGCGGGCGT CGCGGCGTTG GCGGGAACGG CGGCGTACGT TCGTTCGCGT
CATTGGTTGG CGAATAATGC GCTGGGAATG TCGTTTGCGC TGCAAGGAAT CGAGTATTTG
ACGATTGATA GCGTGCAGAT CGGGTCAATC TTGCTCGCGG GGTTGTTCGT GTACGACGTG
TTTTGGGTGT TTTGCACGCC GGTGATGGTG AGCGTGGCGC GGTCGTTCGA CGCGCCGATC
AAGCTACTTT TCCCGCGAGT CGCCGCCAGT GCGATCGAGG GCGCTAATAG ACCGTTTAGC
ATGCTAGGTC TGGGGGATAT CGTCGTTCCA GGGCTTTACG TGGCGATGAT TTTGAGGATG
GACAACGCGA GACGCGCGGC GGCGCTCGAG CCGAGAAAGT CGCTCACGAG ATCGGCGTCC
AAAAAAGCTG CGACCGCCTC TCGAACGGTC CGCGACGACG GAAAGACTGT GACAACGTAT
TTCCCCGCCG TCGCGTTCGG CTATCTCGTC GGGATCGTCA CCACCATCGT CGTCATGAAC
GTCTTTGACG CCGCCCAACC GGCGCTGTTG TACATCGTCC CGGGCGTCCT CGGCGCCACC
TTCATTCGCG CCGCTCTGGC GAAAGAAGTC GGCGTGACGT GGAATTACTG CGAAGGATTA
GAAGAGGCCC AGGCCGAGCG CGACGCCGCG GAAGCCAAGA CGAAGTCG
 
Protein sequence
AHVALLLCAL APTVAVVDPN AQIVAVSTLS VVAGAYRSVR PASEGSGEVM TKEDAQKFPL 
LGSCVLFGAF LAFKFLPKNV LDVCATAYFG MLGVVAMSAI LTPVVHKFAF GGRELVSYEL
FSVPEMKWVN GERWTAECTL AEAAAGVAAL AGTAAYVRSR HWLANNALGM SFALQGIEYL
TIDSVQIGSI LLAGLFVYDV FWVFCTPVMV SVARSFDAPI KLLFPRVAAS AIEGANRPFS
MLGLGDIVVP GLYVAMILRM DNARRAAALE PRKSLTRSAS KKAATASRTV RDDGKTVTTY
FPAVAFGYLV GIVTTIVVMN VFDAAQPALL YIVPGVLGAT FIRAALAKEV GVTWNYCEGL
EEAQAERDAA EAKTKS