Gene OSTLU_25624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25624 
Symbol 
ID5005767 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp414920 
End bp417989 
Gene Length3070 bp 
Protein Length743 aa 
Translation table 
GC content53% 
IMG OID640421188 
Productpredicted protein 
Protein accessionXP_001421657 
Protein GI145354786 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5656] Importin, protein involved in nuclear import 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.925958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.511141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACGC GGCAGAGCGC GAGCATTTAC TTCAAGCATC TCGTCAATAA GTCGTGGACG 
CAGCGCGAAG GCGCGACGGC GACGACGGAG ACGAATCCGA TCTTGGACGA GGGCGACAAA
GCGGCGGTTC GACGCGTCGC GCTGGAGGCG ATCGCGAACA CGCCGAGCAA GGTGCGAAGT
CAGCTCGTGG AGGCGGTGCG AGTGATCGTT CATCATGATT TCCCCGGGCG TTGGCCGGAG
GTGGCGAATC AGGTGCTGGA TGGGTTGAAC GCCGCGTCGT CGAGCGAGAG CGGAAAGCTG
TGCGGGACGG TGTTGGTGTT GCACGCGCTG TGTCGAAAGT ATGAGTTCAA GGCGGTGGAT
GAGCGAGCAG ACATCGAAGA GATGATACGC GTGGTGTTTC CAAAGTTGCT GGAGATTTTG
AAGGCGTTGC TTGCGTATCA AGGCCCGCCG GACACGGAAT TGGAGGAGTT GAAGAAGGCG
ATTTGCAAGA CGTACTTGAG CGCAACGTAT CTTAACGTCG GACCGTCTTT GCGCGAGGAA
GGGACGTTTC GGGAATGGAT GGCGGCGTTT CACGCGATCA TCACCGCGCC AGTGCCGACA
GAGAATATGC CGACGGACGA TAAGACTGAG TTGAAACATT GGCCGTGGTG GAAGACGAAG
AAGTGGGCGA TGCACGTCGT CAATCGGATG TTCAATAGAT ACGGCAACTT GAAAAAGTGC
CAACCTCACG ACAAGGCTCA AGCGACGGTG TATCGTGACA AATACGCTGG ACACTTTGTG
ACAGTATACA TCCAATTACT GAGCTCCCTC GCCACGGGTG CGGTGATGCC CGACCGAGTC
GTCAATCTCG CCGTGCACCA TCTCTCCACG GCGTTGGGGG TTCCGACGAT GTACAAGCAC
ATGGAGCCGC ACCTCGATGC AATCTTCCAG CAAATCGTAT TTCCAATGCT ATGTTTCAGC
GCTGAGGACG ACGAGCTGTG GAAGGATGAT CCCCAAGAGT ACGTTCGAAA ATCACAAGAT
CTCATAGAAG ACATGTATTC GCCACGAACC GCGGCGTGCA GTTACACACA AGAATTGGTG
ATTACCGGCA GGCGTCTGAA GGAAAACTTG CCAAAGGTGT TAGGCGCGAT GGTTCAAATA
TTCACCAAAA ACTCTTCCAG CGTCAGATCC GGGCCGATGG ACGCTAGAGC GCGATACGAA
CTCGATGGCG CGCTGCTCGT CATCACCACC CTGTCGCAAC TTTTATCCAC GCACCCGGAT
TACGCAAAGG AAATCGAAGG TATGCTCATG ACGCATGTTG TTCCGGCATT TGGTTGTGTA
CATGGTCATA TTCGCGCCAA GGCGGTATCG TGCGTATCAA AGTATTCGGA TATCACGTTC
CGAGACCAGA ATAACTTTAT GCAGCTGTTT TCGAGCGTCG TAAATGCGAT GAAGGATCCC
GAAATTCCGG TGAGGTTCGA GGCAGTCGTT GGGCTCGGAG CTTTTGTGCA AGCCACAGAC
GACGTGAGCG CGCTGAAGGG TATTCTACCA CAGTTGTTAG ACGAGTTTTT CAAGCTCATG
AACGAAGTGG AGTCGGAAGA TGTTGTGTAC ACGCTTGAAA CAATCACCGA AAAGTTTGGC
GAAGACATCG CGCCCTTCGC TTTGGGCATG ACGCAGAACC TCGCCGCAGC GTTCTGGAAG
GTTGTGCAAG AAGCTGAAGG AAAGGATGAC GATGAGTACG GCATGATGGC ATGCATGGGA
TGTCTGCGCG CCATGTCGAC GATTCTCGAA TCCGTTTCTA GTTTACCGCA CATGTACCCC
GAACTTGAAG CCGCCGTGTT CCCAATTTTG CATAAAATGA TTAGCGAAGA AGGATACGAC
GTGTTTGAGG AAGTTTTGGA GATATTGTCC TACCTAACGT ATTTCACTCC GGTCGTGACA
CCACGCATGT GGGAACTCTG GCCGCTGATG ATGCGCATGA TGGATGACTG GGCGCTGCAA
TATTTCGAGA ATATGCTCAT TCCGCTCGAC AACTACATCA GTCGAGGCAC GGAGCACTTT
CTCACCCCTG GCTCGAGTTA CGTGGAAGAT ACGTATAAGC TGTGCGAAAA GGTATTAGGA
GGCGATTATC CCGAGCCCGA TTGCTTGCCA GCGCCAAAGT TGATGGAGTG CGTGATGACG
AATTGCCGCG GCCGCGTCGA CGTCGTCATC GAACCGTACG TGAACATTGC ACTTGCGTGT
CTCGCGACTG CGGAGTTGCC ACACTTTAGG GATTGGCTCA TGATGACTTA CGCGCATGCA
CTTCATTATA ACGCATCGCT TGCGCTGGCA GCGACGAACC GAACTGGCAA AACGAACGAA
GTGTTCGCGT TGTGGTCGAA CATGCTCGCC GAGCGCAGGA AGAGTGGAGA ACGTAAAAAC
TTCACCTCTG AACACGCCAA GAAAGTTTGC GCGTTGGGAC TCATGGCGCT TTTGCAAGCG
CCGGCAGAGT CGTTAACACC TGAGATTCGA GGTGCGCTCG GAGGTATTCT TGACACGCTC
ATCTCTTTAC TCGAAGATTT GCGATTGCAA ATCACGGAGC GCAAGTCGGA TGAAGCGAGC
GGCAAGAGTA GGCATCAATG GAATGGCCTT GGCCTCTTCG ACGGCGAAGA CGAATACGAG
GAACACAACT TTGACGAAGA GGACGATGAT GGAGAAATGC ACTTTGACGC GACGACGCTC
CGCGCGTTGG CGAAGCAAGC GCAAGACGCT GATCCGTACT CTCGCGCCGG CGACGTCGAC
TCAGAAGACG ACGAGCATTT CTTCTTCGAC GACGATGACG ATTCGTGCCA AAGCCCGCTA
GACGACATTG ACACGTTCAT CGTATTTTCG GAGTGCATGA ATCAGTTGCA TCGCACCGGT
AGTCTCAATC CGAGCGCTGA ATCTCATTCA AAATTGCAGG AGTTGATCAA TCACGCCGCG
ATTCGTGCCG AAGAGTTCCC TCGAGAGCGC GCGGAGGCGA AGAAGGAATC ACACGGCGTG
TCTGGTAGTT GAACGAGTAC CAGCATACCG GCGGTACGTT AGACTGATAA TTATGTGAAT
ATACTTAGTA
 
Protein sequence
MGTRQSASIY FKHLVNKSWT QREGATATTE TNPILDEGDK AAVRRVALEA IANTPSKVRS 
QLVEAVRVIV HHDFPGRWPE VANQVLDGLN AASSSESGKL CGTVLVLHAL CRKYEFKAVD
ERADIEEMIR VVFPKLLEIL KALLAYQGPP DTELEELKKA ICKTYLSATY LNVGPSLREE
GTFREWMAAF HAIITAPVPT ENMPTDDKTE LKHWPWWKTK KWAMHVVNRM FNRYGNLKKC
QPHDKAQATV YRDKYAGHFV TVYIQLLSSL ATGAVMPDRV VNLAVHHLST ALGVPTMYKH
MEPHLDAIFQ QIVFPMLCFS AEDDELWKDD PQEYVRKSQD LIEDMYSPRT AACSYTQELV
ITGRRLKENL PKVLGAMVQI FTKNSSSVRS GPMDARARYE LDGALLVITT LSQLLSTHPD
YAKEIEGMLM THVVPAFGCV HGHIRAKAVS CVSKYSDITF RDQNNFMQLF SSVVNAMKDP
EIPVRFEAVV GLGAFVQATD DVSALKGILP QLLDEFFKLM NEVESEDVVY TLETITEKFG
EDIAPFALGM TQNLAAAFWK VVQEAEGKDD DEYGMMACMG CLRAMSTILE SVSSLPHMYP
ELEAAVFPIL HKMISEEGYD VFEEVLEILS YLTYFTPVVT PRMWELWPLM MRMMDDWALQ
YFENMLIPLD NYISRGTEHF LTPGSSYVED TYKLCEKVLG GDYPEPDCLP APKLMECVMT
NCRGRVDVVI EPYVNIALAW IGS