Gene OSTLU_359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_359 
Symbol 
ID5005444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp686103 
End bp688040 
Gene Length1938 bp 
Protein Length646 aa 
Translation table 
GC content57% 
IMG OID640420865 
Productpredicted protein 
Protein accessionXP_001421534 
Protein GI145354526 
COG category[R] General function prediction only 
COG ID[COG3596] Predicted GTPase 
TIGRFAM ID[TIGR00993] chloroplast protein import component Toc86/159, G and M domains 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCAGATCCGA GCGACGAAGA TGCGACGAGG ACGTACGAGC TGCAAATGCT TCGCATCAAG 
CTCTTGCGCT TGGCTTCGAG ATTGGAACAA AGCCCGAGAA ATACGGTGGT GGCGCAGGTG
ATCTACCGTC TGGAACTCGC AGAACAGCTC AAGGCTGGGA AAGGGACGCA GAAAGATCCG
TCCAATTCGT CCTTTGATCG CGCCGTGGCG CTCGCCGAGC AAGCCGAGAA AGAGGGCTCC
GACGCGGATT TGGATTTTAC GTGCACCATC TTGCTTTTGG GTAAGAGTGG CGTGGGTAAG
TCCGCGGTGA TCAACTCTCT GTTGGGCGAA GGTTCGGCGC CGTCTGGTAC GGACGATGAG
GACGCGACGA AGAAGGTGCA ACTGATTGAG AAGAAGATTC ACGGCATGAC GCTTCGACTC
ATCGATACGC CTGGTTTGCA AGCGTCTGCG ACGGACATTC GTTACAACTC CACCATCATG
AACGATGCGA AGAAGTTTAC CAAGCAACAC AAGCCCGACA TCGTGCTTTA CTTCGATCGT
CTCGACATTC CGTCGCGATC GGACGCGGCG GATTTGCCGT TGTTGAAGCA AATCACGAAC
ACCTTTGGCC AAGCGATTTG GTTCAACGCC ATCGTCGTCT TGACGCACGC CGCCGCCGCG
CCGCCGGATG GCGCAAATGG CCAGCCGATT TCTTACGAAA TGTACGTCGC TCAGCGTTCG
CACATCGTGC AGCAAACGAT TCGCCAAGCC GCGGGCGACA TGCGTCTCAT GAACCCAGTG
GCGCTCGCGG AGAACCACCC GCTTTGCCGC ACCAACCGTG CGGGCGAGCG AGTGCTTCCG
AACGGACAAG TTTGGAAGCC GCAGTTGTTG TTGTTGTGCT TCGCGTCCAA GATCCTCACG
GAGGCGAATA CGTTGTTGAA CTTGGCCGCC GACCAACAAA AGGCTGCCAA GGCGGCGCGC
GCGGGTGGCA TGCCGGGGCA ACAAAAGGTG CCCCCGCTTC CGTTCTTGTT ATCTTCACTC
ATCACCACTC GAAAGCCTCG TCGTTTGGTG GAGTATGAAG ACGATGGATT CGAAGATTTG
GAGAACGAAA TCATCTCTGG CGAGCCGTCC CCGTACGACA TTCCCGCGGA TCAGATGGAG
CCGACGCCGA CGCCAAAGCA AGTCTCCATT CCGGCGCCCG ATCCTCAATT GCCCTTGTCT
TTCGATGGCG ACACGCAAGG TCACCATTAC CGGCAACTTG AGTCGAACCA ACAGTGGTCG
TGCCGACCGA TCGTGGACGC GCACGGCTGG GATCACGAGA CTGGCGTGGA GGGCTTCTCC
GTCGAACATC AGTTTGTTCT CAAGGACCAA GTCCCAGGTG TGGTTCAAGC GCAAATTTCC
AAGGACAAAA AGGACAGCAA CTTCGGTTTT GAAGGTGAAA TGTCTGTCCC GCACTCGCGA
ACTCTGATTT CGACGACGGG CGTCGACATT CAAACCGTGG GCAAGGATTT GGTGTACACG
GCGCGAGGGG AGACGAGGTG GAAGTTTTGC GCCGTCGACA AGATCATCGG TGGTCTTTCC
GCCTCTTTTG TCGGTGGTGT GGTGGCTCTC GGTACAAAGA TCGAGAACCG ATTCAAGGCT
CGTCCTGGAA TGAAGGTTGT CGTTAGCACG GGCGCTGTCA CGGCGCAAAA GGATGTTGCG
TACGCAGGTA ACCTCGAGAC GATTATCCGT CACAGCGAGG ACCCGTCGAA CCCGAACTCA
TCCACGCTCA GCGCGAGTTT CATGAACTGG CGCGGCGACC TCGCCCTCGG GTGCAATGGT
ATGAGTTCTA TCCAAGTCGG CAAGGACACG CAAGTCACCA GTAGTTTCAA CATCAACTCG
CGCGGCACGG GGAAAATCTC CGTCCGAGCG ACGACGAATC AACGCATGTC TCTCGGCAGC
GTCGGCTTGA TTCCGATT
 
Protein sequence
PDPSDEDATR TYELQMLRIK LLRLASRLEQ SPRNTVVAQV IYRLELAEQL KAGKGTQKDP 
SNSSFDRAVA LAEQAEKEGS DADLDFTCTI LLLGKSGVGK SAVINSLLGE GSAPSGTDDE
DATKKVQLIE KKIHGMTLRL IDTPGLQASA TDIRYNSTIM NDAKKFTKQH KPDIVLYFDR
LDIPSRSDAA DLPLLKQITN TFGQAIWFNA IVVLTHAAAA PPDGANGQPI SYEMYVAQRS
HIVQQTIRQA AGDMRLMNPV ALAENHPLCR TNRAGERVLP NGQVWKPQLL LLCFASKILT
EANTLLNLAA DQQKAAKAAR AGGMPGQQKV PPLPFLLSSL ITTRKPRRLV EYEDDGFEDL
ENEIISGEPS PYDIPADQME PTPTPKQVSI PAPDPQLPLS FDGDTQGHHY RQLESNQQWS
CRPIVDAHGW DHETGVEGFS VEHQFVLKDQ VPGVVQAQIS KDKKDSNFGF EGEMSVPHSR
TLISTTGVDI QTVGKDLVYT ARGETRWKFC AVDKIIGGLS ASFVGGVVAL GTKIENRFKA
RPGMKVVVST GAVTAQKDVA YAGNLETIIR HSEDPSNPNS STLSASFMNW RGDLALGCNG
MSSIQVGKDT QVTSSFNINS RGTGKISVRA TTNQRMSLGS VGLIPI