Gene OSTLU_118781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_118781 
SymbolCIF2 
ID5000999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp346555 
End bp349787 
Gene Length3233 bp 
Protein Length1053 aa 
Translation table 
GC content61% 
IMG OID640416420 
Productchloroplast translation initiation factor 2 
Protein accessionXP_001416631 
Protein GI145344213 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0532] Translation initiation factor 2 (IF-2; GTPase) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00487] translation initiation factor IF-2 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.106372 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCCA TCGCCGTCAG CGACGCGCAC CGCACTTTCG TCCTCCCCCC CCCCGTGGCG 
ACGTCATCGA GCGTCACCGA CCCACCGCTC GTCTCCGACG TCGCCTCGTT CTCGTCCTCC
CCCGCCGTCG TCGCGTCGCC TTCGATCGCC CTCGCCTTCG CCTCCCACTT CCCACCCCCG
TCCTCGCTCG GTTTCGCCCC GATGTGCGTC ATCGCGTCTC GTTTCAAATC ATCCCTTCGC
GTCGACGCGT ACCGCATCCT CGTCCGTCGC GCCGTCGTCG CGGTGTGGCC GTCCAGCCGT
CGCCCGTCCG TCCGTCTCGT CCCGTCTCGG GACGCGAATC GCCGGATCGG CGAATCGCCG
GATCGCGCGA TCGGCGGATC GGCGGATCGG CGATCGCGCG ATCGATCGTC CGTCGATCGC
GCGATCGCGT CGATCGATCG ATCGTCGATC GACGTCCGTC GACGGACGTC GATCGTCGAT
CGTCCCGCCC GGCGCGTCGA TCGTCCCGCC CGCACAATGA CGTCGTCCGC GTCGGCGATG
TTTGCGCACG CGACGACGGG CGCGCGCGGC GCGTCGACGG TGCGCGCGCG CGCGAGGACG
TCGGTGACGA CGACGGGCGC GCGCGCGACG ATCGCGCGCG GCGCCGGGGC GCGGCGAAGC
GCGCGCGCGA CGGCCGAGGA CGGGCGCCGC GATGGAGGGT TTCGTCTCGG GACGGCGACG
ACGGACGGAT GGGCGCGAGC GCGCGCGATG CGCGGCGAGG GACGAGAGGA CAGAGCGGTG
ATGACGCGCG CGGTGGCGGA CGCGGAGGCG GCGCGGGAGG ACGACGATAA AAAGTCGCGC
GCGAAGCTCG TGCGAGGGGC GGACGGGAAG TTTTACCGCG AGGGCTCGGA GGGCGGCGGT
CGCGGCGGCC GCGGCGGTCG CGGCGGTCGC GGCGGTGGAC GCGGTGGTGG ACGCGAAGGT
GGACGAGGTG GATTCGGGCG CGGTGCGCGC GGTGGTGCAG GCGGCCGCGG CGATGGGGGT
CAACGAAACT TTAGAAACGC GGCCAAGCCG AGCGGCGGCG GTCGCGCGGG TCGCGGCGGC
CGCGGTGGAC GACAAGACTT TCGATTCCAA GACGGTCGCA AACCGGGACG CGGCGGTAGA
CCAAAGATGA ATATGAGTGG TGCGAGTGGT AGCGATCAAG TGAAACAGCG ACGAGGAAGC
AAAGCGGCAA AGAGCGCGCA GCGCAAAAAG GCGCTCGAGG AAAACAGAGC CGTGGCTGTG
GAGATTCTCG AAGTTCCCAC GGATGGCATG GCGATCGAAG ATCTCACCGA ACTTCTCGCC
ACGACGCAAG CGCAAATTAT CAAAACGCTT TTCATGAAGG GTATCGCGGT TCAAATGGGC
CAGTTGCTCG ATAAGGAAGC TGTGATTGCC GTGGCTGAGG ATATGGAAGT GGAATGGATA
GACGAGGCTG AACAGGGCGT CGCGACGGCG GCAAAGAAGG TGACGCAGTT CTTGAGCGAA
GACGACTTCG ACTACTTGGT ACCGCGAGCA CCCGTAGTTA CCATCATGGG TCACGTCGAT
CACGGTAAAA CATCGCTGTT GGATTACATT CACAAGTCCA AGGTTGCCGC GGGCGAGTCC
GGCGGCATCA CGCAAGGCAT CGGCGCGTAC CAAGTCAGCA CCATGGTTGG TGATGAAGAG
AAGGATATTA CCTTCCTCGA CACTCCGGGT CACGAAGCGT TCAGTGCGAT GCGTGCTCGT
GGTGCGCGAG TGACGGATAT TGCCATAATT ATCGTCGCGG CCGACGACGG CGTTCGACCG
CAAACGGAAG AGGCTGTATC TCACGCCCGA GCCGCCGACG TTCCGATCAT CGTCGCCGTC
AATAAAATTG ACAAAGAAGG CGCGAACGTC GACCGTGTGC GTGATGAACT ATCGCGCATC
GGTATCATCA GCGAAGAGTG GGGCGGGGAC GTTCCGTTCC TGCCCATCAG CGCCAAGTCT
GGCGAAGGTA TCGACGAGTT GCTCGAAACT ATTTCTCTCA CCGCCGAGCT CGCCGAGCTC
GTCGCGAATC CCGATCGCGA AGCTCAAGGT ACGGTCATCG AGGCATTCCT CGACAAACAA
CGTGGTCCGA TGGCGACTGT TCTCGTCCAA GCCGGCACTT TGCGCATCGG CGATGCGATT
CAAATTGGTG GCGCGTATGG CAAAGTGCGC GCAATGGACG ACGCAGATGG CACCAAAGTC
GAAGAGGCTG GACCATCTAT GCCTATTCAA ATCATGGGCC TCAACGGCGT TCCAGCCGCG
GGTGAAGAAT TTACCGTCTA CGCGTCCGAT ACCGCGGCTC GTGACAAGGC TGAGCAAGTC
CAAAATGAGA TTCGCAACAA CCGCTTGATT GAGGGCAACG TTGTTTCGCT GAGCAACTTG
CCGGGCGATG AGGATGGATT ACAAAAGATC AATCTCATCG TCAAAACAGA CGTTAGTGGT
TCTTGCGAAG CAGTCAAGGC TGCTCTCAGT GCGTTGCCTC AAGACCGCGT CCAGCTTCGT
TTCCTCATGG CCTCTGCAGG CGAAGTGAGC GAGAGCGACG TCGATTTGGC TTCTGCCTCC
GAAGGGATCA TCTTGGCTTT CAATACGCCT TGCAGCGACC GAATCGGCGA GATTGCGAAA
AAGAGGAAGG TTGAAGTACG TACTTACGAC GTTATCTACG ATCTGGTCGA TGAAGTTCGT
GCGGCAATGG AAGGCATGTT GAGTTCGATC AAGGAAGAAA TACCGTTTGG CAAGGCGACG
TGTAAGGCTG TCTTTGGCGG CGGTAAGGCG AAGGTTGCCG GTTGCGAGGT GACTGACGGT
TACTTCCAGT CCAAAAAGTA CTTGAGAGTC ACTAGAAGGG GCAAAGAAGT GTTCTTCGGT
AAGGTCGGCT CACTTCGTCG CGTCAAGGAT ATCGTCAAGA AGGTTGAGGC TGGCTTAGAA
TGCGGTATCG GCGCCGATCC CGAGTGGGAC GGCTTCAAAG CGGGCGACGA ACTGGAGTGC
TTGGACTTGG TCGATAAGAT TCAAACACTC GAAACTGCGA GTGAAATTTT GGCAGAACGT
GTTGAAGAAT ACCAAGCTGG TGAGGCCGAG CGTGAAGCCG CGCGAGAAAA GGCCAAGGAA
GGTTACCAGC GACAACAACG ACGGGCCGCT CCGTCAAAGT AGAGTGCATG TGCGACATAG
ATAAAATCAA GTTGAACGAA TGTGTATAAC TAAAACCAAC CATTTTTTAC AAC
 
Protein sequence
MMSIAVSDAH RTFVLPPPVA TSSSVTDPPL VSDVASFSSS PAVVASPSIA LAFASHFPPP 
SSLGFAPMCV IASRFKSSLR VDAYRILVRR AVVAVWPSSR RPSVRLVPSR DANRRIGESP
DRAIGGSADR RSRDRSSVDR AIASIDRSSI DVRRRTSIVD RPARRVDRPA RTMTSSASAM
FAHATTGARG ASTVRARART SVTTTGARAT IARGAGARRS ARATAEDGRR DGGFRLGTAT
TDGWARARAM RGEGREDRAV MTRAVADAEA AREDDDKKSR AKLVRGADGK FYREGSEGGG
RGGRGGRGGR GGGRGGGREG GRGGFGRGAR GGAGGRGDGG QRNFRNAAKP SGGGRAGRGG
RGGRQDFRFQ DGRKPGRGGR PKMNMSGASG SDQVKQRRGS KAAKSAQRKK ALEENRAVAV
EILEVPTDGM AIEDLTELLA TTQAQIIKTL FMKGIAVQMG QLLDKEAVIA VAEDMEVEWI
DEAEQGVATA AKKVTQFLSE DDFDYLVPRA PVVTIMGHVD HGKTSLLDYI HKSKVAAGES
GGITQGIGAY QVSTMVGDEE KDITFLDTPG HEAFSAMRAR GARVTDIAII IVAADDGVRP
QTEEAVSHAR AADVPIIVAV NKIDKEGANV DRVRDELSRI GIISEEWGGD VPFLPISAKS
GEGIDELLET ISLTAELAEL VANPDREAQG TVIEAFLDKQ RGPMATVLVQ AGTLRIGDAI
QIGGAYGKVR AMDDADGTKV EEAGPSMPIQ IMGLNGVPAA GEEFTVYASD TAARDKAEQV
QNEIRNNRLI EGNVVSLSNL PGDEDGLQKI NLIVKTDVSG SCEAVKAALS ALPQDRVQLR
FLMASAGEVS ESDVDLASAS EGIILAFNTP CSDRIGEIAK KRKVEVRTYD VIYDLVDEVR
AAMEGMLSSI KEEIPFGKAT CKAVFGGGKA KVAGCEVTDG YFQSKKYLRV TRRGKEVFFG
KVGSLRRVKD IVKKVEAGLE CGIGADPEWD GFKAGDELEC LDLVDKIQTL ETASEILAER
VEEYQAGEAE REAAREKAKE GYQRQQRRAA PSK