Gene OSTLU_46154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_46154 
Symbol 
ID5002903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp530850 
End bp532175 
Gene Length1326 bp 
Protein Length427 aa 
Translation table 
GC content55% 
IMG OID640418324 
Productpredicted protein 
Protein accessionXP_001418961 
Protein GI145349066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.938893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGACCCGATC GCGCGCGCGC GCGCGGAATA CGAACGCGCA TAAAAATGGT CCTCAAGCGC 
ATGTTCAGCC GCGGTTCGCT CGGCGATCTC AACGAGCGTC GAGCGAAGAA ACGCTCGCGA
ACGAACCCCA AGGACCTGCG CGCGCCCGGA CGTAAATTCG TCATCGTCAC CACCGCCGCG
CTGCCGTGGA TGACGGGGAC GAGCGTTAAT CCGTTGCTCC GAGCGGTGTA TCTGGCGAAC
GGAGACACCA CGCGCGAGGT GACGCTCTTA GTACCGTGGT TGGCGAGAAA GGATCAGAGG
ATCGTCTACC CAAAGCGCGT CGAGTTCAAG ACGCCGAGCG AACAAGAGGC GTACATCATG
GACTGGACCA AAAAACGCGT GGGATTCGCG CCAAAAATTT TGATCGCGTG GTATCCGGGA
AGGTACGCGA CGGACAAGGG AAGCATCGTG CCCGTGGGGG ATATCACGTT GCGCGTGCCG
AAGGCGAGCA GAGACGTGGC GATACTCGAG GAGCCGGAGC ATTTGTGCTG GTATCACCCC
GGAGCGCGGT GGACGTCGCG GTTTAAGCAT GTGGTTGGAA TCATTCACAC AAATTACTTG
GAATACGCGC GACGAGAGGA GGATGGTGAA CGTAAGGAGC AGATACTGCG TTGGATTAAT
CATTTGACGG CGCGATGCCA CACACACAAG GTCATCAAAT TGTCGGACGC GGTACAAGAG
TTTGCGCGGT CGATCACGCA AAACGTGCAC GGTGTGTCAA ATGGATTCAT CGATGCTGGT
AGAGAGAAGG CGAAACGAAT CAAGAAGGAA GGCAGCGGGG CGTTTAGTCG CGGGGCGTAT
TTCATCGGAA AGTGCGTTTG GGCGAAAGGA TACTCAGAGT TGATGCACGT GGTGGGCGAT
TTCAACGAAA AGTACGCTAA AAGTGCAAAA GAACGCTTGG AAATGGACGT ATACGGTGAT
GGTGATGATT TTGCCGACGT GAAAGCAGCT GTAGCTGAGA AGGCTCTGCC GCTGAGCTTG
CTCGGTCGTC TGGATCACGC GAATGAGAAA ATTCTCGATT ACAAGGTATT CATTAATCCA
TCACTGTCTG ACGTCGTCGC GACGACGTCC GCGGAGGCGT TGGCGATGGG CAAATTCGTC
GTGTGCGCAG AACATCCGAG CAACGCGTTC TTTGCCACGT TTCCAAATTG TCGCACGTAT
TCCAATATGG ATGAGTTTGC AAAGTGCATT AGAGAAGTCA CAACGTCGAC GCCAAAACCG
ATGACGGATG ATGAAATCCA TCGTTTAACG TGGGAAGCAG CGACAGAACG TTTGTTAGAC
GCCGCT
 
Protein sequence
MVLKRMFSRG SLGDLNERRA KKRSRTNPKD LRAPGRKFVI VTTAALPWMT GTSVNPLLRA 
VYLANGDTTR EVTLLVPWLA RKDQRIVYPK RVEFKTPSEQ EAYIMDWTKK RVGFAPKILI
AWYPGRYATD KGSIVPVGDI TLRVPKASRD VAILEEPEHL CWYHPGARWT SRFKHVVGII
HTNYLEYARR EEDGERKEQI LRWINHLTAR CHTHKVIKLS DAVQEFARSI TQNVHGVSNG
FIDAGREKAK RIKKEGSGAF SRGAYFIGKC VWAKGYSELM HVVGDFNEKY AKSAKERLEM
DVYGDGDDFA DVKAAVAEKA LPLSLLGRLD HANEKILDYK VFINPSLSDV VATTSAEALA
MGKFVVCAEH PSNAFFATFP NCRTYSNMDE FAKCIREVTT STPKPMTDDE IHRLTWEAAT
ERLLDAA