Gene OSTLU_33434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33434 
Symbol 
ID5003675 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp197952 
End bp199724 
Gene Length1773 bp 
Protein Length440 aa 
Translation table 
GC content60% 
IMG OID640419096 
Productpredicted protein 
Protein accessionXP_001419738 
Protein GI145350701 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00944369 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGTCGCGAT GCCCGCGTGC GCGGCGTGTC GCGCGCGCGA CGCCGATATA TTTTGTCTCG 
CCGACGGTGC GCGCGCGCGA CGAGAGACGC GTCGCGAACG CGCGAAAAAG ACGCGGATGG
ACGCGCGCGC GGCGACACGA TGCGCGCTGA CGCGGATGGA CGACGACGAC GAGAGACGAG
AGACGAGAGA CGCGCGACGC GCGCGACGAG AGACGCGCGC GACGAGGATT GGGACTGATT
TTTTACGCCC GCGATCGCGC TTCGATTCGG CGCAGAGGCG TTCCTGTGCG CGACGTGCGA
CGCGCGCGTG CACGGGGCGA ACGCGGTGCG CGCGCGCGCG GATATTCGCG AAACGAACGC
GCGCGCGAAC GAGCGATGTC GACTGGGCGC GCGAGACGAC GACGCGACGG ACTGACGATC
GGGACGCGGT TGAACGCCGT AGGTCGCGGC GAGACACGAG AGGATCACGG TGGATGAGTG
GTATAAACGA ACGCTCGAGG CGGGATTGAG TGAGGCGAAG GAGTGCGGGG ATTGGAAAGC
GGCGGCGAGC GCGACGGCGA GCGCGAGGCG CGAGGACGAG GACGGACGAG GACGCGGGAC
GAGCGAGAGT TTGAGGGAGA AGAGTTTCAG TTTGTTTAAG AGGGACGATG CGAAGACGAC
GACGCACGAT TCGACGTCGA GCATGGACGC GACGATCAGT GCGTGGGATG TGGGCGTGTT
TTTGAATTTA GGTGAGAACG GAGAAGAGGA TACGAGTCCA AGGGCGCCGC GAATGTCGAG
TAATAGCGAC ACGATGATTT TCGATTTGGA CGACGATCCG TTGGCTTCTT TGCTGGAGAT
GCCCGAAACA GAGTCGGCGC TTTTATTCGA TGGCGATGCG GCGTCTATTT CCGCCGCTCT
GGAGGCGGTG GCGGATCAGA TTCAAGCGGT GAGCCCGAAT CAAGCCGCCG CCAAGGCGTA
CGTCTCGAAG AGCGCGAGAG ATAATTTCAC TCCGCGACCG TCGCCTCTTG GATTGGGACT
GCCCGCGGCG CAGCGTGGGC CCGCAGAGAC GGTAGGTTCC TATCCACCCG GGGCTTTTCC
ACCCATAATG ATGCCGATGT CGAGCGATCT CTTCGGCATC CCTCGACGCG TGGTGAGCAA
GGAGCGCCAA GCGCAGCTCG ACAGATATCG CGCCAAGCGC GAGCGTCGAT TGATGGGGTT
GAAGAAAGTG GTGCGCTATG AATGCCGTAA AACGCTCGCA GACGCGCGCG TTCGCGTGAA
GGGTCGTTTC GTTAAGGCTA ATCCGGATGA GAAGACTTCA GCGCTAAAGT CATTTCAAAG
CTGTCCAGAC TTGTCGGCGT TGGTTGAGGA TGAGGACAAC GCAAAGCCCC TATCTTTCGC
ACCGATGAAG CACACAACGC TCGACGATCA GCAGCTACAT CAACAAAACT CTAAACGGCG
CATCTCAGAT GATCGTTTGT CGAACTCTGA CGCGTCGCAT GACGATAAGT TGGATGTTCA
GTCGATGCGA TACGAGATTC TTCGCGACTC GGGCGCGCCG GCGCTGCATC CACCCACGAT
CCCAGAGACC TTACCTCTCC CCAGTGGCTT AAGACGCACG AAGCAGATGC GTCACTGCCA
AAGCGAAATT AATTTGATGG ACTTGGCTGG CTATTAGCAA GTGTACACCG GTGTAGTCAA
CTACGAGAAG CGCCGCTCAG CGCCGACGGC GCGCCATCCG CGGTAATATC GAGACTTCGT
ATTTATGAAT ATATATTGTA CATTCTGTAT CAT
 
Protein sequence
MPACAACRAR DADIFCLADE AFLCATCDAR VHGANAVAAR HERITVDEWY KRTLEAGLSE 
AKECGDWKAA ASATASARRE DEDGRGRGTS ESLREKSFSL FKRDDAKTTT HDSTSSMDAT
ISAWDVGVFL NLGENGEEDT SPRAPRMSSN SDTMIFDLDD DPLASLLEMP ETESALLFDG
DAASISAALE AVADQIQAVS PNQAAAKAYV SKSARDNFTP RPSPLGLGLP AAQRGPAETV
GSYPPGAFPP IMMPMSSDLF GIPRRVVSKE RQAQLDRYRA KRERRLMGLK KVVRYECRKT
LADARVRVKG RFVKANPDEK TSALKSFQSC PDLSALVEDE DNAKPLSFAP MKHTTLDDQQ
LHQQNSKRRI SDDRLSNSDA SHDDKLDVQS MRYEILRDSG APALHPPTIP ETLPLPSGLR
RTKQMRHCQS EINLMDLAGY