Gene OSTLU_17261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17261 
SymbolPAFE3501 
ID5004330 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp160506 
End bp161953 
Gene Length1448 bp 
Protein Length447 aa 
Translation table 
GC content59% 
IMG OID640419751 
Productpredicted protein 
Protein accessionXP_001420262 
Protein GI145351823 
COG category[K] Transcription 
COG ID[COG5157] RNA polymerase II assessory factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.474377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0107383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCTT TGCGTTTGAT TAGAGATCAT ACCATCGCCG GTATAATCGC CGAGTGCGTC 
CTCGACGCGC CGCCGCTCAC GCGCTCACGC GCCGCGCGCC GCGCGGAAAC GGTTCGAACG
CGCGCGACTG ACGATGGGCG CTCGAAACGG GACGTAGGTC GACGCACGAC GATGAAAGGA
TTCACCTGAA GGATGTCGAC GTCGAGCTCC GGCGGACGAC GGAGACGAAT TACAGATCCA
AGGCTGGGAA CAAGCTGCTG AAGGTGGAGG CGATATGGTA TTTCATCAAG TATCACGTTG
CGAACCCCGA CGCGGCGCAC ACGGCTTACA TGAAGGCGGC GATCGCGGCG GGGTTCTCGA
CGCTGTCTAT GTTGGATCGG AAAGATTTGA TCGCGTACTT GACGGGGGAG CGAGCTACGA
GCGATCGAAT CGATATCACC GTGCCGGTGA TCGTGGATGA GGAGGGTGTG TCGACGGTTG
ACGCGAAACG CGCGCGCGAG GAAGACGAGG CGGAAGGCGT GCCGCGAGAG CGCGTGTTAA
GAGATAGGAA CTCAGTGCTT CGCGCGCCGA AGGATATGAC GAGCGTGTTG GATTTCTTCG
CGGCGCCGGA GGAGGAGAAG GAACGTTTAG AAGAGGAGAA GCAGCAGGCG GCTGATTTAG
CCAAGGGGAT TAAAAACCAG AGATATCGCG ACGTGAAGGA GCAGGTGTTT TGGAGAGAGC
ACGTCGGGAG CGACTTTGAT ATGATGAATT TGGACACCAA TGCGTCCTTC TTGAGTGGGC
CCAAGCCACC CGTAGACGAC GGCACCGACA TGTTGATGAC GGACGCACGC GCGATGGAAA
AACAGCCGAC CGCACCGAGC GGTCCGTCGA CTGCGAGTCG TGGAGGACCC GCTGCGGCCG
CCAAGGCTCC GCGCAAGACA TCAGGCAAAC CGGGCGGCGT CCCGATCATC ATCGTCCCCG
CCGGGTTCAA TCAAAAGGTC GTTCTCAACA TGTTCAACGC CAAGGAGTTC TTGCAGGACG
GCAAGTTTAC GCAGTGGGAT GTGGTGCAAA AAGGCGGCGC TAAAAAGTCA AGCTCCGTGT
ACATTTCGCG CACGTACAAG CGCGACGGCG CCAAGGTCAA GTACGAAGTC ACCGAAAAAG
CCCCTCACAA ACGTTCCGAA GACTGGGCCC GCGTCGCCGC GGTCTTCGTC CTCGGTGCTA
AGTGGCAATT CAAAGACTGG CCCTTCCGCG GCGTCGAAGA CGGTGATCTC GTCGAAACCT
TCACCAAGAT TCGCGGCTTT CACGCCCGCT TTGACGGCGA TCCCGAAGTC GACGTCGTCA
AGACCTGGAA CGTCAAGCCC ATCACCATCA GTCGCACCAA GCGTCACGGC GATCGCGCCG
CGTTCGAGTT CTTCTGGGAC GAGCTCGATC GTCACCTCGC CCTTCGTAGC AGCGCCTTGA
AGTATTAA
 
Protein sequence
MDPLRLIRDH TIAGIIAEST HDDERIHLKD VDVELRRTTE TNYRSKAGNK LLKVEAIWYF 
IKYHVANPDA AHTAYMKAAI AAGFSTLSML DRKDLIAYLT GERATSDRID ITVPVIVDEE
GVSTVDAKRA REEDEAEGVP RERVLRDRNS VLRAPKDMTS VLDFFAAPEE EKERLEEEKQ
QAADLAKGIK NQRYRDVKEQ VFWREHVGSD FDMMNLDTNA SFLSGPKPPV DDGTDMLMTD
ARAMEKQPTA PSGPSTASRG GPAAAAKAPR KTSGKPGGVP IIIVPAGFNQ KVVLNMFNAK
EFLQDGKFTQ WDVVQKGGAK KSSSVYISRT YKRDGAKVKY EVTEKAPHKR SEDWARVAAV
FVLGAKWQFK DWPFRGVEDG DLVETFTKIR GFHARFDGDP EVDVVKTWNV KPITISRTKR
HGDRAAFEFF WDELDRHLAL RSSALKY