Gene OSTLU_47438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_47438 
Symbol 
ID5005348 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp189918 
End bp192088 
Gene Length2171 bp 
Protein Length650 aa 
Translation table 
GC content58% 
IMG OID640420769 
Productpredicted protein 
Protein accessionXP_001421229 
Protein GI145353884 
COG category 
COG ID 
TIGRFAM ID[TIGR00617] replication factor-a protein 1 (rpa1) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.53124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0322988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCCGCGCGC GCCCGAGCGC ACGCGCGATG CCGCCCGCGC TGACGCCGAA CGCGATATCC 
AATATTCTCG AACAGACCCA CGGATCGCAG GACTTCAAAC CCATCGTCCA AGTGTTCGAT
CTGAAGGAAT TGAAAACCAA ACCCGACGCC GACGACGCCG CGAAGCGATT CCGCGTCCTC
GCGTCCGACG GTGGATTCGC GGCGCAGGGA TTGTTCGGGG CGGAGTTGAA CGCGATGTGC
GAGCGAGGGG AAATTACGAA ATTCACGGTG CTGCGGTTGA GAGAGTACAT CGTGAACGAT
CTGAACGGGA GACGGTGCGT GCGCGAGATG GGTTGGGATG GGCGGCGCGG CGCGGCGCGG
CGCGGCGCGA GGGGCGAGAC GAGACGCGCG AGGATGGAGA CGCGCGATCG AGGGATCGCG
ATGATCTTGG ATGGAAGCGA GGAAGGCGGC GCGATGCGCG AGGGGGGTGA CTGACGATCG
TGCGAACGGA CGAATGGTTT GAACGTAGGA TTTTGATCGT CATGGACGCC GAGGTGATGG
ATCGGTACGA CGCCGTCATC GGACAGCCGC GGGTGTGGCA GCCGGGGACT GGGACGAACG
CGTCGACGGG GATGAACGCG GGGGGGATGC AACAGCAAAG AAACGCGTAC GGAGGAGCGC
CGGCGGCGCA GGTGGAAGGC TACGGGTCGG GTGGAGGGAA CGGCGCGAAT CTGGCGACGG
AACCGCCGCG CGCGAGCGGT GGCGGGTACG GTGGCGGCGC GCCGGCGGCG CAGGGGCAGT
ATCGTCGAGA TGGTGGCGCG GTGGCGCGCA ATGAGCAGCC GAGGTCCATC ACGCCGATCC
ATGCGTTGAA CCCGTACCAG AACCGTTGGA CTATTCGCGC GCGAATCACG ACTCCGTTGG
AGTTGCGCTC GTATTCGAAT GCGAAAGGCG AAGGTAAGGT GCTCGGCTTT CAAGTGCTCG
ATGCCGACGG AACGGAGATC AAGTGCGTGT GCTTTAACGA CACCGCCGTG CGCCTCGCGG
GGGAGTTACG TCAAGGCTTG GTGTACGAAA TTTCCAAGGG AGCAATCGTC ACGCCGCGCG
ACCCGCGGTA CGCGATTTAT CAGTACGAAA TTAAGTTGGA TAACCACGCG ACGTTCGTGC
CGTGTCCAGA CGCCGAACGC GACATCAAGA AGATGGTATA CAAGTTCAAG AAGCTTTCTG
AACTCGACGC GCTCAACGCC GGAGATATGG TGGATGTCAT TGGCATCGCA TACTCTGTGG
GTGATTTGAC GACGATCATG AAGCGCGACG GTTCCGAAAC TTCGAAGCGT TCTGTGATGA
TTCGCGACGA CTCGGACACA TCCATCGAGT TCACGCTTTG GGATCCGCAC TCAGTCGAGA
TTGGCGGGCA AATCGAAAGC TTGATCGCTA GCGGCGAAAA ACCCGTCATC GCGGTGAAGA
GCTCTCGATT GGGCGAGTTC CAAGGCAAGA ACATGGGCAC CGTGAGCAGC ACGATGGTAG
AAATAAATCC CGACAGTTCC GAGGCGACGC GCATGCGCGT TTGGTTTGAT CAAGGCGGCG
CCGATAAAAC TTTCAACTCC TTGAGCGGTT CTGGCGGTGG CGGCGGCAAA GGCAGTGGTG
AATTGCTCTC GTTCTCGACT GTGAAAGAGA TCGGTGAAGA ACTCGTGGCT AAAAATGAGG
GCGTGGCGTA CCTGAGTTGC TGCGGTATCA TAAAGCACAT CAAACTCGGC GCGGAAGGTA
ACTTCTATCC CGCGTGTCCG TTGCTTAATG GTGAACGCAC GTGCCAAAAG AAGCTGCGTA
AAGATGACTC GACTGGTGAA TGGAAGTGCG AACGTCACGC CGGTGAAAAA ATCGAAGCCG
CGGATTGGCG TTACATGTTT AGCATGGTTT GCATGGATCA CAGCGATGAG TATTGGGTGA
GCGTTTTCGG TGACAAGGGT GACAAGATTT TCGGGATAAG CGCCGCTGAA ATGAAGGAAA
TCTACGACCG TGAACCGGAG CGATACGAAA ACATGATCAG TGACGCACTG TTCAACGATT
ACTCTCTACG CGTTAAGGTC GCCGTTGACA ACTACACCGA CGTACCCCGC GCCAAGGGCA
GCTTGGTTGA AATCGAGCGC GTCAACTACG TAGACATGAG CAAGAAGTTG ATCGGCAAGA
TTGCAAAGCT T
 
Protein sequence
MPPALTPNAI SNILEQTHGS QDFKPIVQVF DLKELKTKPD ADDAAKRFRV LASDGGFAAQ 
GLFGAELNAM CERGEITKFT VLRLREYIVN DLNGRRILIV MDAEVMDRYD AVIGQPRVWQ
PGTGTNASTG MNAGGMQQQR NAYGGAPAAQ VEGYGSGGGN GANLATEPPR ASGGGYGGGA
PAAQGQYRRD GGAVARNEQP RSITPIHALN PYQNRWTIRA RITTPLELRS YSNAKGEGKV
LGFQVLDADG TEIKCVCFND TAVRLAGELR QGLVYEISKG AIVTPRDPRY AIYQYEIKLD
NHATFVPCPD AERDIKKMVY KFKKLSELDA LNAGDMVDVI GIAYSVGDLT TIMKRDGSET
SKRSVMIRDD SDTSIEFTLW DPHSVEIGGQ IESLIASGEK PVIAVKSSRL GEFQGKNMGT
VSSTMVEINP DSSEATRMRV WFDQGGADKT FNSLSGSGGG GGKGSGELLS FSTVKEIGEE
LVAKNEGVAY LSCCGIIKHI KLGAEGNFYP ACPLLNGERT CQKKLRKDDS TGEWKCERHA
GEKIEAADWR YMFSMVCMDH SDEYWVSVFG DKGDKIFGIS AAEMKEIYDR EPERYENMIS
DALFNDYSLR VKVAVDNYTD VPRAKGSLVE IERVNYVDMS KKLIGKIAKL