Gene OSTLU_2530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_2530 
Symbol 
ID5002654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp183185 
End bp184225 
Gene Length1041 bp 
Protein Length347 aa 
Translation table 
GC content59% 
IMG OID640418075 
Productpredicted protein 
Protein accessionXP_001418861 
Protein GI145348860 
COG category[L] Replication, recombination and repair 
COG ID[COG5260] DNA polymerase sigma 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATCGACTTTT CGTACCTGGA CGCGGCGCCG ATACGGAACG AACCGGGAAG CGCGAGCGCG 
ACGGCGGTCA AGGCGCTCGC GGTGGAGGAG GAGGAAGCCG AGGAGGAGTA CTTCGATCCG
AACGAGAATC CGCGATGGTG CCCGCCGGGA ACGGTGAAAC GTTTGAGGAA TTTACAGTCG
CCTCTGATCA GGCTGCACAC CGAAATCGTG GATTTCAGTA GGTATTTAGA GCCCACCGAG
GAGGAAGCGA CGTCGCGCGC CGCCGCCGTC GAACGCGTGC GAGCGGTGGT GAACGGGATC
TGGCCCGACG CTCGATTCGA AGTTCACGGT TCGTTTGCGA CCGGCATGTA CTTGCCGAGC
TCGGACATCG ATGCCGTGAT CTTGGACAGT GGTGCAAAAA ATGCGGGTCT GTGTTTGAAG
GCGCTCGCCG TCGCCTTGGC GCGACGCGGC ATGGCGATCA AGATACAACT CATAGCCAAG
GCGCGCGTGC CCATCGTGAA ATTCGAAGAA GTGGAAAGCG GACATCAGTT TGACATTAGT
TTCGACGTCG CGAACGGGCC GGCGAGCGCG GAGATCGTTC GAGAAAACAT GCGAAGGTTT
CCCGCGTTGC GTCCGCTCAC CACGGTGTTG AAGGCGTTTC TTCATCAACG CGGGCTCAAC
GAGGTGTATT CCGGTGGCAT CGGCTCTTAC GCGCTGCTTT GCATGGTGAT GGCTCATTTG
CAGTTGCACA ACACGACGTG TAAATCGACG TGGGCGGGGT CGCACGGCGC GAGCGATGCT
AGCGAAGGCT GCCTAGGAAC GCTCCTCATC GACTTTTTTG AGCTCTTCGG TCGCAGGCTC
GTCGCGGAAG AGGTTGGGAT CTCATGCGGA GGCAAAGGTC CAGGCTTTTT TAAGAAACGC
GACAAGGGCA TGTACGAAGA CTCTCGGCCG TTCTTGTGGG CGATCGAAGA CCCACAAGAC
GAAACGAATG ATCTCGGTAG GAACTCGTAC GCGTGCAGGC AGGTGAAGAG CGCGTTTGAG
CACGCGTTCA CCGTCATCAC G
 
Protein sequence
IDFSYLDAAP IRNEPGSASA TAVKALAVEE EEAEEEYFDP NENPRWCPPG TVKRLRNLQS 
PLIRLHTEIV DFSRYLEPTE EEATSRAAAV ERVRAVVNGI WPDARFEVHG SFATGMYLPS
SDIDAVILDS GAKNAGLCLK ALAVALARRG MAIKIQLIAK ARVPIVKFEE VESGHQFDIS
FDVANGPASA EIVRENMRRF PALRPLTTVL KAFLHQRGLN EVYSGGIGSY ALLCMVMAHL
QLHNTTCKST WAGSHGASDA SEGCLGTLLI DFFELFGRRL VAEEVGISCG GKGPGFFKKR
DKGMYEDSRP FLWAIEDPQD ETNDLGRNSY ACRQVKSAFE HAFTVIT