Gene OSTLU_88555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_88555 
Symbol 
ID5004457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp144787 
End bp145818 
Gene Length1032 bp 
Protein Length343 aa 
Translation table 
GC content65% 
IMG OID640419878 
Productpredicted protein 
Protein accessionXP_001420257 
Protein GI145351813 
COG category[L] Replication, recombination and repair 
COG ID[COG3145] Alkylated DNA repair protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0488535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTACG CGAAATCGAC GCGCCCGACG CGGTTCTTAA GGGTGTGCCC GCTTCGCGCG 
CGCGACGAGG CGCGCGCGCT CGCCGAAGGC GCGGCGCGCG AGGCGGTGGA AGTCGACGCG
CGGGACATCG ACGCGAAACG CGCGGTGACG TTCGAGTTCG CGCGCGTGGA AGACGCGACG
GCGGCGCTCG AGCGCCTGCG CGGAGGACGC ACGCGCGCGT CTGTGTTCGG CGCGGGCGTC
GCGGACGGCG AGACGGCGCT CGCGTGTGGG TACAGCGCGC GGGCGAGCGA CTCGCGAGCG
CCGGAGCGCG AGCGAACGAA GGCGTCGCGA CGGTCGTCTG TCGAGGGACT GACGCTGATC
GAGAATTTCG TCACGGTGGA CGAGGAAAGA GCGTTGGCGA CGCTCGCGGC GACGTCGGGG
GACGAGACGC GGTTGGCGCG GCGCCGGGTG AAACATTTTG GGTACGCGTT CGATTACGGC
ACGCGAGACG CGAACTTGAA GGTTGTCGAC GAGATTCCTG AGCTGGCGAT GGAAGTGTTG
CGGAGGCTTC CGCGCGAGAC GCCTGGGTAT GAAGGCGCGA TGCGGTGCGA CCAGGTGACG
GTGAATGAGT ATCCTCGAGG CGTCGGCTTG GCTCCGCACG TGGACACGCA CTCGGCGTTT
GGCGACACAA TTTTATCGCT CTCTTTGCTC GGGGGGACGG TGATGGAATT TAGGACGAGC
GGGGAAGCGC ATCGGGCGAT TTATCTTCCG CCTCGGTCAT TGCTAGTCAT GCATGGTGAA
TCGCGGTACC GATGGCAGCA TTATATACCG CATCGGAAGT TTGACACTCT CGAGGGCGAA
GCCGCGCCGA CGCCTCGAGA CGATGTGCGG CTCTCGTACA CTTTTCGCGA GCGGAGAAGT
GGACCATGCG AGTGCGCATT CCCGTTGCAG TGCGATTCGA GGGATGGCGC GCAGTCAAAG
TGTAGCAAAC GTAAGACGGG TAAGACCCAG GCGTTCGCAG AGCTCGTGGG AGAGTCGGGA
GATAGCTTTT AG
 
Protein sequence
MSYAKSTRPT RFLRVCPLRA RDEARALAEG AAREAVEVDA RDIDAKRAVT FEFARVEDAT 
AALERLRGGR TRASVFGAGV ADGETALACG YSARASDSRA PERERTKASR RSSVEGLTLI
ENFVTVDEER ALATLAATSG DETRLARRRV KHFGYAFDYG TRDANLKVVD EIPELAMEVL
RRLPRETPGY EGAMRCDQVT VNEYPRGVGL APHVDTHSAF GDTILSLSLL GGTVMEFRTS
GEAHRAIYLP PRSLLVMHGE SRYRWQHYIP HRKFDTLEGE AAPTPRDDVR LSYTFRERRS
GPCECAFPLQ CDSRDGAQSK CSKRKTGKTQ AFAELVGESG DSF