Gene OSTLU_50969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50969 
Symbol 
ID5004775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp324889 
End bp327129 
Gene Length2241 bp 
Protein Length456 aa 
Translation table 
GC content58% 
IMG OID640420196 
Productpredicted protein 
Protein accessionXP_001420817 
Protein GI145352993 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0627207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000441065 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATAA AAACCATCTC GCGCGTCGAG GAGGACTACA CGCGCGAGCG AAAGTCGGAC 
GCGCTCAGGG TGCATCGAAA TCTCGCGCCC GAGCTGCGGC CGATGGGACG GGCGACGGAG
TACAAGCGCG CGCTGAACGC GACGAAGCTG GACAAGGTGT TCGCGAAGCC GTTCGCGGGA
CAGATGAGCG GACACGCGGA CGGCGTGCTG TGCATGGCGA AGTCGCCGGC GTCGCTGACG
GAATTGGTGA GCGGCGCGGC GGATGGAGAG ATACGAGTGT GGGACGTGCC GAGCCTGAAG
ACGGTGCGGG TGCTGAAGGG ACATCGAGGG GCGTGCCGAG GCGTGAGCGC GTCGAACGAC
GGCGGCGCGG TGGTGTCGTG CGGCGACGAC GCGACGATTC GGTTGTGGAC GATGCCGAAG
GCGGGAATGG GGGAGATGAA CGATCCGACG CGGAAGATTC CGGTGTTGGA GACGTCGGAG
ATGTACGTCG AGAGCAACGG TTTTAGGGAC TGCGACGCGC ACTGGGGGAA AAAGGAGTTC
GCCACCGCGG GGGCGAACGT GCAGGTGTGG AGCATGGAAC GGAGTCATGC GCTGCATACG
TTCGAGTGGG GTTCCGATAC GGTGCTTTCA GTGCGATATA ATCCGGTGGA GACGGATATT
TTTGCGTCGT GTGGGTCGGA TCGATCCATC GCGTTGTACG ACGTTCGAAT GCAGACGCCG
TTGAAGAAGA TCATCATGCA GACAAAGTCG ACCAAACTGT GCTGGAATCC GATGGAGGCG
TTTAATTTCA CCGTTGCCAA CGAGGATACC AACTTATACT CGTACGACAT GCGAAAGCTG
GATATCGCGA CGTGCGTTCA TAAGGATTTC GTGAGCGCCG TGATGGATAT CGATTACTCG
CCCACGGGTC GGGAATTCGT GGCGGGGAGT TATGACAGAA CCGTGCGCAT GTTTGATTAC
AACGCTGGAC ACTCTAAAGA TTGCTACCAC ACCAAACGCA TGCAGCGCGT GTTCTGTACG
CGCTTTTCGA TGGATGGTTC GTACGTCTTC AGCGCCTCGG ACGACATGAA CGTGAGGTGT
TGGAAGGCGG ACGCGAGCGC GCAAATGGGC ACGCTCTCGG CTCGTGAAAA GCGCAAGCAC
GCGTATAACG CGTCTTTAAA GGACCGTTTT AAGCACATGC CCGAAATCAG GCGCATTGCG
AATCACCATC ACGTACCCAA GGCTATTCAC AAGCAAACCA AGTTGCGGCG GACGATGCAA
GAAGCCGAGA CTCGCAAGGC GAAACGTCGC GTCGCGCACG CCGCGCCTGG CGCGGAGAAG
AAGGAATTCA AACCCGCGCG CAAAAAGAAG ATCCTCGCAG AAGTGGAGTA GGGGACGATA
TTAGCTTGCC GCACGAAACG AACCTATGTA ACTGGTGTAC AAACAATGAT TTTACTGCTC
ACTTGCGAAG AGCGTCGAGG CGCGCCTGTA AATCGTCGTC AAGCCCGCCA CCCCCGGTCG
CGGGCTCAGC CGAACCACCC TCTAGGACCG CGGTTGGCGC CACTCGTTCG GCATCGACGG
CGGCGCCAAC CTTACCCGCG GGCGCCGAGA TTAATTCCGC CCCGACGTTG CATCCCAGTT
CGTCCAACAC CGCGTTCACG AGTTCATCCG TCTCCTCCTC TTCATCCTCA CCCTCGAACG
CATCGTCGAT TGCGTCCCCC ATCACCTCTG TCGTCATCTC CATCTTTTCA TTCTGTCGCT
CAAACTCCTT CAATATATTT TGCAACGAGG GTAAATTCAG TTTGGTGTTC ATCGACTTCA
TCGCCGTCGT CACGCCTCGC ATCGCGTCCG CCATAGCCTG CGAGCTCTTC AACGTTTGCA
TTCGCAGCGA CACCCCTTGC AACTGGGATT TAAGCGCATA GAACTTCGTT ATCGAGTGTC
TCGTGCGCAC TAAATCCTTC GCCATCACCT ACATCGCGCG ACGCACGTTC GTTTCGTTCC
GTCAGTCGCC GTGGTCAATT CCGACCGCCG CGTCGCGCTT CCACCTTCAT CCCCTCCAAC
GCATCATCGC GCCGACACCG CGCGTTTTCG CCCTCGCGCA CGATCGTCCA CGCGCGCACC
TTCACCGCGC CCATCTGATT CGCCTTGGCC ACGCGCTTAA TCTCCGCTAT GAGCTTCTTT
TCTTGCGACA TCATGGCTGA CCGTTCGCGA TCGATTTCTC GAATGGATCT GTCGAGCATG
CGCTTGTTCT CGCGCAACAG C
 
Protein sequence
MKIKTISRVE EDYTRERKSD ALRVHRNLAP ELRPMGRATE YKRALNATKL DKVFAKPFAG 
QMSGHADGVL CMAKSPASLT ELVSGAADGE IRVWDVPSLK TVRVLKGHRG ACRGVSASND
GGAVVSCGDD ATIRLWTMPK AGMGEMNDPT RKIPVLETSE MYVESNGFRD CDAHWGKKEF
ATAGANVQVW SMERSHALHT FEWGSDTVLS VRYNPVETDI FASCGSDRSI ALYDVRMQTP
LKKIIMQTKS TKLCWNPMEA FNFTVANEDT NLYSYDMRKL DIATCVHKDF VSAVMDIDYS
PTGREFVAGS YDRTVRMFDY NAGHSKDCYH TKRMQRVFCT RFSMDGSYVF SASDDMNVRC
WKADASAQMG TLSAREKRKH AYNASLKDRF KHMPEIRRIA NHHHVPKAIH KQTKLRRTMQ
EAETRKAKRR VAHAAPGAEK KEFKPARKKK ILAEVE