Gene OSTLU_33079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33079 
Symbol 
ID5003461 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp358898 
End bp360030 
Gene Length1133 bp 
Protein Length322 aa 
Translation table 
GC content60% 
IMG OID640418882 
Productpredicted protein 
Protein accessionXP_001419356 
Protein GI145349883 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.303449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATA CCGGGGCTGG GTTGACGTGT CACACCGCGC CGGTGCGTCG AAACGAGGCG 
GCGATGACGA GGGAAAGACG AAACGCGTGC CCTCGCGCGA TGGTCGCGCG ACGACGACGG
GTCGAACGAC GAGAGACGCG CGGTGGTCGC CGTCGACGCG CGCGGTGACT GACGACTGTG
ATGCACGTTT GTTCGCGCGA CGTTAGGGCG TGGTGTTCGA TACCATGGAT GCGCTGAAGC
TGCATTACAA ATCGGATTGG CATCGGTACA ACTTGAAGCG CGGCGTCGCC GGTTTGCCGG
TGGTCGGGAA GGACTTGTTC GACCGCGTGA TGACGCAAGC GGCGGCGCAG GAGGCGGCGA
GTAAGAAGAG GTCAGAAGGA GGGACGGCTA AGGCGGGAAA ATCGCATCTG AAGCGCAAGG
ATGAGCTTCC GAGAAGCGTG TTGCGGGCGC AGCGATTCGA GAAGTGGGCC GAGGCGCACA
AGGAGACGCT GGCCAAGGTG GACGCGTACA TCGCGCGGGG AGAAGAGGTT CCGGAGGCGT
TGTTGGATGA AATCTCGCGA CGACGAGGCG AAGAGGATGA CGACGACGAC GACGTGGACG
AGTACGATGG TGAATGGGAA GAAATGGACG AAGATGAAAC GCAAGAGGCG CTGGCGAACA
TCGAACGCGC CGCGCAAGAG GCGGAGAGTA GCGATGAAGA TATGGACGAT GACGCCCCGG
CATTTTCCAT GGAAGAACTC ACGAATGGTC CAGTGCGTCT GGCCGACAAC GGCTACGAAC
TCATCATTAT CGGCGCCGAT GGAAAGGCAA AGCGCATCGG TCCGCGAGAG TTTCGACGAT
ACTACAAGCA AAATCACCGT CCGAGCGACA GTCGCGATTC TGTTCGCGCC AACGCTCGAC
ACGCCGGCAT GCAAGTTTCA AGCGACGGCG TTTGTCGTGG TAGTGGCGGT GGAATCACTC
GCAGAGACTA CCCGACGTTG CCAACCCAAA TTTCCTTGGT GCACCGTCGA GCGCAGCGCG
CCTTGCGCAA GTACCAAGGC GACCTCATGG TCATGGGTGG AAGCGCGAAC AAGAAGTTTG
ACATGAGCGG CCGCAATGCC AAGACCAAGC TTCCGAAGGC GTGCCCGTAT TAA
 
Protein sequence
MSDTGAGLTC HTAPGVVFDT MDALKLHYKS DWHRYNLKRG VAGLPVVGKD LFDRVMTQAA 
AQEAASKKRS EGGTAKAGKS HLKRKDELPR SVLRAQRFEK WAEAHKETLA KVDAYIARGE
EVPEALLDEI SRRRGEEDDD DDDVDEYDGE WEEMDEDETQ EALANIERAA QEAESSDEDM
DDDAPAFSME ELTNGPVRLA DNGYELIIIG ADGKAKRIGP REFRRYYKQN HRPSDSRDSV
RANARHAGMQ VSSDGVCRGS GGGITRRDYP TLPTQISLVH RRAQRALRKY QGDLMVMGGS
ANKKFDMSGR NAKTKLPKAC PY