Gene OSTLU_46552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_46552 
Symbol 
ID5003826 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp491391 
End bp493407 
Gene Length2017 bp 
Protein Length525 aa 
Translation table 
GC content60% 
IMG OID640419247 
Productpredicted protein 
Protein accessionXP_001419614 
Protein GI145350442 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5064] Karyopherin (importin) alpha 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.105236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.308215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTTC GCCCGGGGTC CAAGGCGAGC GAACGGAAGA AGGGCTTCAA GAAAGCCATC 
GACGCCGATG AGGCGCGACG GAAGCGCGAA GGTGCGCGGG GTGACGCGCG CGAGCGATGA
CGCGCGCGGA CGGGCGCGGA GACGGCGCGC GGGCGCGAAG CGAGGGAGAT TAAGTCGCGG
TGGGATGATC TGGCGGCACG AAAACGGTGG ACGCGCGCGC GCGCAGCGCG CGCGCGAGGC
GACGGCGACG CGGAGACGAC GAAGGACTGA CGATGCACGC GCGCGCGCGG TGGTCGGCGA
ACGCAGATAA CATGATTCAG ATTCGTAAGG ATAAGCGCGA GGAGGCGATG ATGAAGAAGC
GAAGGTGCGC GCGAGACGAC GACGCGCGCG AGGCGGAAAA CGCGAACGCG CGGTGATTCA
TCGGACGGTG ACTGACGAAG TGGGTTTATG ATGACGCAGG GACGGTGCGA CTGGAAGCGT
GGCGTCGGAT TCGACGGCGA TGACGGGTTC GCCGGGCGGG GGATCGGTGC AGAGCAAGCT
CGCGCAGTTG CCGCAAATGC TTGAAGCGCT GAAGAACCCG GATCCGAACG TGCAACTCGA
GGCAACGATT GCGTTTCGTA AACTTCTTTC CATCGAGCGA TCGCCGCCGA TCGATCAGGT
GATTGAGACC GGCGCGACGC CGTTTTTCGT CGAGTTTTTA AAGCGTACCG ATGTTCCAAA
GTTGCAGTTT GAAGCCGCGT GGGCGCTGAC GAACATCGCC TCTGGTACGA GCGAGCACAC
GGCGATCGTG ATCGATCATG GGGCTGTGCC CATCTTTATC GCGCTGTTGG GTTCAGACAA
CCCCGACGTG CGCGAGCAAG CGGTTTGGGC GCTCGGGAAC ATCGCGGGCG ACAGTCCGCG
GTGCCGCGAC TTGGTGTTGC ACGCCAACGC GTTGCATCCG CTCCTCGCGC AACTCAACGC
CGAAGCGAAG ATCCAGATGC TTCGTAACGC GACGTGGACT TTGTCGAACT TTTGCCGCGG
TAAACCTCAG CCTGACTTTA GCGCGTTGCG AGCGGCGCTT CCGGCGCTCG CTCGCTTGGT
GCACTCGAAC GACGAGGAGG TGCTCACTGA TGCGTGCTGG GCGCTCTCGT ACCTTAGCGA
CGGTACGAAC GACAAGATCC AGGCCGTTAT CGAGGCTGGA GTGTGCCGAC GTCTCGTGGA
GCTCTTGGCG AGCAACCATC CGAGCGTGTT GATTCCGGCG CTTCGAACGG TTGGTAACAT
CGTGACCGGC GACGACTATC AGACTCAAAT CATCATCAAC TGCCACGCTC TGAAGGCGTT
GCTCGGATTA TTGGCGGGAG ACTACAAGAA GAGCATTAAA AAGGAAGCGT GCTGGACGAT
CTCGAACATC ACCGCCGGTA ACAAGGACCA AATCCAAAGC ATCATTGACG AGCAAATGGT
GCCGCCGTTG GTCGAGTTGC TCGCCAACGC CGAATTCGAT ATTAAGAAGG AGGCGGCCTG
GGCGATTTCT AACGCCACGA GCGGTGGTAC GCATCAACAG ATCAAGTATC TCGTCAGCCA
AGGGTGCATC AAGCCGCTGT GTGATCTCAT CAACTGCAGC GACGCGCGCA TCGTCACCGT
CGCGCTCGAG GGGTTGGAGA ACATTTTAAA AGTCGGCGAG GCTGATCGCG GCGACAACAT
GGAAGCCCCG AACGTTTTCG CGCAATACAT CGACGAAGCC GAGGGGTTGG AGAAGATCGA
ATCGTTACAG AATCACACCA ACGACGACAT TTACCAAAAG GCGATGCGTC TTTTGGAGAC
GTATTTCGGT TTAGAGGACG ACGACGCGCA AAACCTCATG CCCGAGGTCC AGGGCGACCA
GTTCGCCTTC GGCGCCGGCG CGCCCACGGG CGGGTTCAAC TTTTAGACTT TTTTCGCGCG
ATGGATGAAA CGACTTTTTT CGCGCGATGG ATGAAACGAC GACGACGACG ACGACGACGA
CGACGACGAC GATCACTAGC TCTTGAAATC AACGCAA
 
Protein sequence
MSLRPGSKAS ERKKGFKKAI DADEARRKRE DNMIQIRKDK REEAMMKKRS VASDSTAMTG 
SPGGGSVQSK LAQLPQMLEA LKNPDPNVQL EATIAFRKLL SIERSPPIDQ VIETGATPFF
VEFLKRTDVP KLQFEAAWAL TNIASGTSEH TAIVIDHGAV PIFIALLGSD NPDVREQAVW
ALGNIAGDSP RCRDLVLHAN ALHPLLAQLN AEAKIQMLRN ATWTLSNFCR GKPQPDFSAL
RAALPALARL VHSNDEEVLT DACWALSYLS DGTNDKIQAV IEAGVCRRLV ELLASNHPSV
LIPALRTVGN IVTGDDYQTQ IIINCHALKA LLGLLAGDYK KSIKKEACWT ISNITAGNKD
QIQSIIDEQM VPPLVELLAN AEFDIKKEAA WAISNATSGG THQQIKYLVS QGCIKPLCDL
INCSDARIVT VALEGLENIL KVGEADRGDN MEAPNVFAQY IDEAEGLEKI ESLQNHTNDD
IYQKAMRLLE TYFGLEDDDA QNLMPEVQGD QFAFGAGAPT GGFNF