Gene OSTLU_33239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33239 
Symbol 
ID5003213 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp606887 
End bp608623 
Gene Length1737 bp 
Protein Length452 aa 
Translation table 
GC content62% 
IMG OID640418634 
Productpredicted protein 
Protein accessionXP_001419217 
Protein GI145349600 
COG category[Z] Cytoskeleton 
COG ID[COG5023] Tubulin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00514267 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.249426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCCGGCGCGC CATCGCAGCG ACCTCATCTC GCTCGACGCC CGCTCGATCG ACCGCATCGC 
GCCGACCGCA TCGCGCGACT TCCAGCGTTC CAACGACTTT GAATTCTTCG CCGAGACGAC
ATGCGCGAGG TGATCTCCAT TCACATCGGT CAAGCCGGGG TGCAAACCGG GAACTCGTGC
TGGGAGTTGT ACTGCCTCGA ACACGGGATC CAGCCGGAGT GCGTCGAGCG AGAGGCGACG
CGACGCGACG CGATGGACGC GATGGATGTC GGGATCCGCG GGCGCGACGC TCGCGGGTTC
GGGCGGGTGG TTCAGGGATA GCGCGACGCG CGCGCGACGC CGAAAATAGC GCCGCTTTCG
TTGACCGAGC CGAGCGACTG ACTGATTGAC CGATTTCGAT CGCCAGTGGG CAAATGCCGA
GCGACAAGAC GATCGGGGCG TCTGATGACG CGTTCAACAC GTTCTTCTCC GAGACCGGCG
CCGGGAAGCA CGTGCCGCGA TGCATCTTTC TCGATCTCGA GCCGACGGTG ATCGACGAGG
TGCGCACGGG GGCGTACCGT CAGCTGTTCC ACCCCGAGCA GTTGATCTCG GGCAAGGAAG
ACGCCGCGAA TAACTTTGCG CGCGGTCACT ACACGATCGG CAAGGAAATC GTGGACTTGG
CCCTCGATCG CATTCGTAAG TTGGCGGACA ACTGCACGGG TTTGCAAGGC TTTTTGGTCT
TCAACGCCGT CGGCGGCGGC ACGGGTTCGG GTCTCGGCTC GTTGCTCCTC GAGCGCTTGT
CCGTGGATTA CGGCAAAAAG TCCAAGCTCG GGTTCACCAT CTACCCCTCG CCGCAAGTCT
CCACCGCGGT GGTGGAGCCG TACAACTCTG TGCTGTCCAC GCACGCGCTG CTCGAGCACA
CCGACGTCGC GGTGATGTTG GACAACGAAG CCGTGTACGA CATCTGCCGC AGGTCTTTGG
ACATCGAGCG CCCGACGTAC ACCAACTTGA ACCGCTTGAT CGCGCAGGTC ATCTCTTCGC
TCACCGCGTC TCTGCGATTC GACGGCGCGT TGAACGTCGA CGTCACGGAA TTCCAAACCA
ACTTGGTGCC GTACCCGCGC ATTCACTTCA TGTTGTCGAG CTACGCCCCG GTGATCTCCG
CCGAGAAGGC GTACCACGAG CAGTTGTCCG TCGCGGAGGT GACGAACAGC GCGTTCGAAC
CGGCGAGCAT GATGGCCAAG TGCGACCCGC GTCACGGCAA GTACATGGCG TGCTGCTTGA
TGTACCGCGG CGACGTCGTG CCCAAGGACG TCAACGCCGC CGTGGCGAGC ATCAAGACCA
GGCGCACGAT TCAATTCGTC GATTGGTGCC CGACCGGGTT CAAGTGCGGG ATCAACTACC
AACCGCCGAC CGTCGTGCCG GGTGGCGATC TCGCCAAGGT GCAACGCGCC GTGTGCATGA
TTTCCAACTC GACGGCCATC GCCGAAGTGT TTTCGCGACT CGACCACAAG TTTGACTTGA
TGTACGCGAA GCGCGCGTTC GTGCATTGGT ACGTCGGCGA GGGCATGGAG GAGGGCGAGT
TCTCAGAGGC ACGGGAAGAT CTCGCGGCGC TTGAAAAAGA TTATGAAGAA GTTGGATCAT
CATCGCAATC TGGTGTTTCA GATTTTGTCG AAGAGACGGA GTACTGAGCC CGCGGCGCCG
TCGCACTGTC GCGCATGCAC CGCTCGAGGC ACTGACTCGC GGTGACGCGC CACCACT
 
Protein sequence
MREVISIHIG QAGVQTGNSC WELYCLEHGI QPDGQMPSDK TIGASDDAFN TFFSETGAGK 
HVPRCIFLDL EPTVIDEVRT GAYRQLFHPE QLISGKEDAA NNFARGHYTI GKEIVDLALD
RIRKLADNCT GLQGFLVFNA VGGGTGSGLG SLLLERLSVD YGKKSKLGFT IYPSPQVSTA
VVEPYNSVLS THALLEHTDV AVMLDNEAVY DICRRSLDIE RPTYTNLNRL IAQVISSLTA
SLRFDGALNV DVTEFQTNLV PYPRIHFMLS SYAPVISAEK AYHEQLSVAE VTNSAFEPAS
MMAKCDPRHG KYMACCLMYR GDVVPKDVNA AVASIKTRRT IQFVDWCPTG FKCGINYQPP
TVVPGGDLAK VQRAVCMISN STAIAEVFSR LDHKFDLMYA KRAFVHWYVG EGMEEGEFSE
AREDLAALEK DYEEVGSSSQ SGVSDFVEET EY