Gene OSTLU_37239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37239 
Symbol 
ID5001439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp494969 
End bp496111 
Gene Length1143 bp 
Protein Length381 aa 
Translation table 
GC content59% 
IMG OID640416860 
Productpredicted protein 
Protein accessionXP_001417523 
Protein GI145346081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.00188315 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCCGC GCGGGCCCGC GCGGACGTCG CGCGCGCCCG TCGGTGCGCG CGTGGCGCGT 
CAGGGGCGCG TCTCGCGCGG TCGAACGCGA CGCGCGGCGC GCGAGGACGC GCTCGGCGGG
CAAGATTTCA TCTACTCGCA ACGCAGCGGT GTCGAAGAAG AGCTCTTTAA AGGAAGCGTT
CTCGGCGTCG ACGCGGACGT CGCGACGGGC GAACATCGCG AGCGTGAGTT TCGAACGTTC
GCCGCGCTCG ATGGGTTTCA CGTGCCGGAA CGCTTCGCCG AGCGCGTGGC GACGCACGTG
GTGAAGAACT TGTTGAAGGA TAAGGGGGCG CTCGGCGCGA CGTCTCCGGC GTTAATTTTG
GGGATTTGGG GACACAAGGG TTGCGGCAAA ACGATGAACG TGGAGTTGGC GTGTAAGAAA
ATGGGGTTGC AGCCGATCGT AACGAGCGCG GGGGAGTTGG AGGATTCGAC GGCGGGGGAG
CCCGGGGCGA TGTTGCGGCG AAGGTATCTG ACCGCCGCGC GAGCGATGAG AGAGACGGGG
AAGTTGAGTT GTCTTATTAT CAACGACATC GACGCCGGGA TCGGTAAGTT TAAGGACGAT
CTGGGGACTG TAAATAATCA AATCACGCAC GGGACGTTGA TGAACATTTG TGACAATCCC
ACGATCGTGA GCGAGGGACT GGTTTGGAGG ACGGACTCCA AATCTACCAA CGCGCGCGTG
CCAATCATCG TCACGGGGAA TGATTTTTCT CGACTGTATG CGCCTCTAAC GAGAGACGGT
CGAATGGATC TTTGGATGTG GGAGCCGACG TCGCAAGAGT TGGTTGAGAT GATACACGCT
ATGATGAAGG ATGACGGGTT GACGACGGCG TGTTGCGAAA CGCTCGTCGC GACATTTCCG
AATCAGCCTT TAGATTTCTT CGGCGCGTTA CGCGCGCGTG TGTATGATGA CGCCGTCAGT
GATTTCGTGT TCAACGTCGG CTTAGATGGT TTAAATGACT CGCTCGTCGG TTTAGATGAA
CGTCGGAGGT TGAAATTAGG CGACGTGACG ATCACGCTGG AGCGGCTGTT GGCGTGCGGA
CGCAACGTCG TTGGCGAGCA AGAAAACGTG AATAATATTC AGCTCGCTCG AGAGTACATG
CGT
 
Protein sequence
MAPRGPARTS RAPVGARVAR QGRVSRGRTR RAAREDALGG QDFIYSQRSG VEEELFKGSV 
LGVDADVATG EHREREFRTF AALDGFHVPE RFAERVATHV VKNLLKDKGA LGATSPALIL
GIWGHKGCGK TMNVELACKK MGLQPIVTSA GELEDSTAGE PGAMLRRRYL TAARAMRETG
KLSCLIINDI DAGIGKFKDD LGTVNNQITH GTLMNICDNP TIVSEGLVWR TDSKSTNARV
PIIVTGNDFS RLYAPLTRDG RMDLWMWEPT SQELVEMIHA MMKDDGLTTA CCETLVATFP
NQPLDFFGAL RARVYDDAVS DFVFNVGLDG LNDSLVGLDE RRRLKLGDVT ITLERLLACG
RNVVGEQENV NNIQLAREYM R