Gene OSTLU_43365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43365 
Symbol 
ID5005310 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp332774 
End bp334914 
Gene Length2141 bp 
Protein Length649 aa 
Translation table 
GC content58% 
IMG OID640420731 
Productpredicted protein 
Protein accessionXP_001421265 
Protein GI145353961 
COG category 
COG ID 
TIGRFAM ID[TIGR00617] replication factor-a protein 1 (rpa1) 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.895959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0131975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCG CGCTGACGCC GAACGCGATA TCCAATATTC TCGAACAGAC CCACGGATCG 
CAGGACTTCA AACCCATCGT CCAAGTGTTC GATCTGAAGG AATTGAAAAC CAAACCCGAC
GCCGACGACG CCGCGAAGCG ATTCCGCGTC CTCGCGTCCG ACGGTGGATT CGCGGCGCAG
GGATTGTTCG GGGCGGAGTT GAACGCGATG TGCGAGCGAG GGGAAATTAC GAAATTCACG
GTGCTGCGGT TGAGAGAGTA CATCGTGAAC GATCTGAACG GGAGACGGTG CGTGCGCGAG
ATGGGTTGGG ATGGGCGGCG CGGCGCGGCG CGGCGCGGCG CGAGGGGCGA GACGAGACGC
GCGAGGATGG AGACGCGCGA TCGAGGGATC GCGATGATCT TGGATGGAAG CGAGGAAGGC
GGCGCGATGC GCGAGGGGGG TGACTGACGA TCGTGCGAAC GGACGAATGG TTTGAACGTA
GGATTTTGAT CGTCATGGAC GCCGAGGTGA TGGATCGGTA CGACGCCGTC ATCGGACAGC
CGCGGGTGTG GCAGCCGGGG ACTGGGACGA ACGCGTCGAC GGGGATGAAC GCGGGGGGGA
TGCAACAGCA AAGAAACGCG TACGGAGGAG CGCCGGCGGC GCAGGTGGAA GGCTACGGGT
CGGGTGGAGG GAACGGCGCG AATCTGGCGA CGGAACCGCC GCGCGCGAGC GGTGGCGGGT
ACGGTGGCGG CGCGCCGGCG GCGCAGGGGC AGTATCGTCG AGATGGTGGC GCGGTGGCGC
GCAATGAGCA GCCGAGGTCC ATCACGCCGA TCCATGCGTT GAACCCGTAC CAGAACCGTT
GGACTATTCG CGCGCGAATC ACGACTCCGT TGGAGTTGCG CTCGTATTCG AATGCGAAAG
GCGAAGGTAA GGTGCTCGGC TTTCAAGTGC TCGATGCCGA CGGAACGGAG ATCAAGTGCG
TGTGCTTTAA CGACACCGCC GTGCGCCTCG CGGGGGAGTT ACGTCAAGGC TTGGTGTACG
AAATTTCCAA GGGAGCAATC GTCACGCCGC GCGACCCGCG GTACGCGATT TATCAGTACG
AAATTAAGTT GGATAACCAC GCGACGTTCG TGCCGTGTCC AGACGCCGAA CGCGACATCA
AGAAGATGGT ATACAAGTTC AAGAAGCTTT CTGAACTCGA CGCGCTCAAC GCCGGAGATA
TGGTGGATGT CATTGGCATC GCATACTCTG TGGGTGATTT GACGACGATC ATGAAGCGCG
ACGGTTCCGA AACTTCGAAG CGTTCTGTGA TGATTCGCGA CGACTCGGAC ACATCCATCG
AGTTCACGCT TTGGGATCCG CACTCAGTCG AGATTGGCGG GCAAATCGAA AGCTTGATCG
CTAGCGGCGA AAAACCCGTC ATCGCGGTGA AGAGCTCTCG ATTGGGCGAG TTCCAAGGCA
AGAACATGGG CACCGTGAGC AGCACGATGG TAGAAATAAA TCCCGACAGT TCCGAGGCGA
CGCGCATGCG CGTTTGGTTT GATCAAGGCG GCGCCGATAA AACTTTCAAC TCCTTGAGCG
GTTCTGGCGG TGGCGGCGGC AAAGGCAGTG GTGAATTGCT CTCGTTCTCG ACTGTGAAAG
AGATCGGTGA AGAACTCGTG GCTAAAAATG AGGGCGTGGC GTACCTGAGT TGCTGCGGTA
TCATAAAGCA CATCAAACTC GGCGCGGAAG GTAACTTCTA TCCCGCGTGT CCGTTGCTTA
ATGGTGAACG CACGTGCCAA AAGAAGCTGC GTAAAGATGA CTCGACTGGT GAATGGAAGT
GCGAACGTCA CGCCGGTGAA AAAATCGAAG CCGCGGATTG GCGTTACATG TTTAGCATGG
TTTGCATGGA TCACAGCGAT GAGTATTGGG TGAGCGTTTT CGGTGACAAG GGTGACAAGA
TTTTCGGGAT AAGCGCCGCT GAAATGAAGG AAATCTACGA CCGTGAACCG GAGCGATACG
AAAACATGAT CAGTGACGCA CTGTTCAACG ATTACTCTCT ACGCGTTAAG GTCGCCGTTG
ACAACTACAC CGACGTACCC CGCGCCAAGG GCAGCTTGGT TGAAATCGAG CGCGTCAACT
ACGTAGACAT GAGCAAGAAG TTGATCGGCA AGATTGCAAA G
 
Protein sequence
MPPALTPNAI SNILEQTHGS QDFKPIVQVF DLKELKTKPD ADDAAKRFRV LASDGGFAAQ 
GLFGAELNAM CERGEITKFT VLRLREYIVN DLNGRRILIV MDAEVMDRYD AVIGQPRVWQ
PGTGTNASTG MNAGGMQQQR NAYGGAPAAQ VEGYGSGGGN GANLATEPPR ASGGGYGGGA
PAAQGQYRRD GGAVARNEQP RSITPIHALN PYQNRWTIRA RITTPLELRS YSNAKGEGKV
LGFQVLDADG TEIKCVCFND TAVRLAGELR QGLVYEISKG AIVTPRDPRY AIYQYEIKLD
NHATFVPCPD AERDIKKMVY KFKKLSELDA LNAGDMVDVI GIAYSVGDLT TIMKRDGSET
SKRSVMIRDD SDTSIEFTLW DPHSVEIGGQ IESLIASGEK PVIAVKSSRL GEFQGKNMGT
VSSTMVEINP DSSEATRMRV WFDQGGADKT FNSLSGSGGG GGKGSGELLS FSTVKEIGEE
LVAKNEGVAY LSCCGIIKHI KLGAEGNFYP ACPLLNGERT CQKKLRKDDS TGEWKCERHA
GEKIEAADWR YMFSMVCMDH SDEYWVSVFG DKGDKIFGIS AAEMKEIYDR EPERYENMIS
DALFNDYSLR VKVAVDNYTD VPRAKGSLVE IERVNYVDMS KKLIGKIAK