Gene OSTLU_35268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35268 
Symbol 
ID5003025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp119068 
End bp120252 
Gene Length1185 bp 
Protein Length394 aa 
Translation table 
GC content58% 
IMG OID640418446 
Productpredicted protein 
Protein accessionXP_001418614 
Protein GI145348351 
COG category[R] General function prediction only 
COG ID[COG4076] Predicted RNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0796669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0661795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCG ACGGCGCGGG CGCGAGGGAC GACGGGGACG GGCGCACGGC GAATCTGGTG 
GTGACGAAAC CGAAGGACAT CGGCGACGCG GATTTCGCGA ATTATTTTTG CACGTACGGG
TACCTGTATC ACCAGAAGGA CATGCTGGAG GATCAGAATC GAATGACGGC GTACAGCGAC
GCGGTGCGAT TGAATCCGGA TTCGTTCAGG GGGAAGGTGG TGCTGGACGT GGGGACGGGA
TCTGGTGTGC TCGCGATGTG GGCGGCGCAA GCGGGGGCGA AGAAGGTGTA CGCGGTGGAG
GCGACGCACA TGGCTGTGCA AGCGAGGAAA ATCGTCGCGG CGAACGGGCT GAGCGACGTC
GTGGAGGTGA TACAAGGATC GATGGAGGAG GTGGAACTGC CGGAGAAGGT GGACGTGATC
ATCAGTGAAT GGATGGGGTA CTTTTTATTG CGCGAGTCGA TGTTTGATTC GGTGCTGTGC
GCGAGGGATA AGTGGATGAA GCCGGGAGGG GCGATGTTTC CGTCGCACGC GAAGATGTAT
CTGAGCGCGA TTAAGTCGAA CAAGAGTGGG CAAAAATATC AAGAGCTTCA AGAGAGCTTG
AACGTGTGGG AGGATTTCGT GCGGAACACG CACGAAAATT ATGGCATAGA CTTGTCGTGC
ATGAACGGCG AATACGAAGA TGAGCAAAAA GAGCACTATT TGAACACCGC GGCGTGGGTG
GACATTCACC CGTCGCAAGT CATGGCTAAG CCGTTTACGT TGGCGAGCTT TGACTTGAAC
ACGTGCTCGA TGGACGACAT CAAAGTTCTT CGCGACGTCG ACTTCAAGCT GCGCCTGTTC
GATGGACTCG CAGGGCCTTC GGGGGAGACG CGCGTCGGCG CGTTCGCGGG CTGGTTCGAC
GTCACGTTCG CGGGATCCAA GGAGAACCCG TGCGAGAACC CGGTCGAGCT CACGACAGCA
CCGGACGCGA ACGGCGCCAC GCACTGGGGT CAACAAGCGT TCTACATGTT TCCTGAGGTA
TACGCCAGCG ATGGTCACTT CATTTCTGGT AAGTTTGACA TGGTTCGTCG CAAGGAAAAC
CAACGCCTGT ACAGCGTTCG GATGAACTGG ACGCAGTCCG ACACGGACGC TGGGCCGCCG
AACGAGTCCA TCGGCACGCG TGGGATCGTT TGGCAGATTG AGTGA
 
Protein sequence
MDADGAGARD DGDGRTANLV VTKPKDIGDA DFANYFCTYG YLYHQKDMLE DQNRMTAYSD 
AVRLNPDSFR GKVVLDVGTG SGVLAMWAAQ AGAKKVYAVE ATHMAVQARK IVAANGLSDV
VEVIQGSMEE VELPEKVDVI ISEWMGYFLL RESMFDSVLC ARDKWMKPGG AMFPSHAKMY
LSAIKSNKSG QKYQELQESL NVWEDFVRNT HENYGIDLSC MNGEYEDEQK EHYLNTAAWV
DIHPSQVMAK PFTLASFDLN TCSMDDIKVL RDVDFKLRLF DGLAGPSGET RVGAFAGWFD
VTFAGSKENP CENPVELTTA PDANGATHWG QQAFYMFPEV YASDGHFISG KFDMVRRKEN
QRLYSVRMNW TQSDTDAGPP NESIGTRGIV WQIE