Gene OSTLU_37471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37471 
Symbol 
ID5001303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp467307 
End bp468512 
Gene Length1206 bp 
Protein Length401 aa 
Translation table 
GC content56% 
IMG OID640416724 
Productpredicted protein 
Protein accessionXP_001417253 
Protein GI145345515 
COG category[R] General function prediction only 
COG ID[COG1161] Predicted GTPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.0357188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGTTACGGGT GCGGCGTCGC GCTGCAGACG AAGGATGAAT CGATCGCGGG GTACGTCGAT 
GCGGCGGAGT ACGCGACGAA GGCGGTGCAC AGACATTACG ACATGATGCT CTGCGCGCGG
TGCGCGGCGC TGAGCAACGG GAAATTCGTA AACGCGGTCG AGGGGCAGGG CGGGTTGAAA
GCGGCGCCCG GATTGATCAC GCCGAAGCAG CTCAGAGATC AACTGAAACC GATCCGGGAG
AAAAAGGCGC TGGTGGTGAA GGTGGTGGAT GTGACAGATT TTCATGGGAG CTTTCTGAAA
AAGGTGAGAG ACGTCGTCGG CGGGAATCCG ATTCTCCTCG TAGTGACAAA GGTTGATTTG
TTAGATTCGA AAACGGACCT CGATGCACTC GTGGAGTGGG TCGGGCGCGA AGCCGAGACG
CGACGGCTTT CACTGGCGGG AATCGCGCTC GTGAGTTCTA GGAAAGGATC TGGGATGCGC
GACGCCGTAC TACAGATGAT GCGCGAGCGA AACGGTCGCG ATGTCTACGT CCTCGGCGCC
GCGAATGTTG GCAAAAGCTC ATTCATTCGG GCCGCGATGG ATGAGCTGCG ATCGGCTGGT
AATTATTTCG CACCTTCTAA GCGACTTCCC GTGGCGAGTG CGATGCCAGG AACGACGCTC
GGAGTGATAC CGTTGAAGGC GTTTGAGGGT AAAGGCATAT TGTTCGACAC ACCTGGTTTG
TTCTTACATC ACAGACTGAA CTCTTTGCTC GGGCCTGATG ATCTTTCGAC GATGAAACTC
GGCGCGTCAT TGAAAAAGTT CGTGCCAAAG ACGCCTGAAT GCGCCGAGCC GCCTGGGTTT
GATTCTTTTC AAGGGTACTC GTTGTGTTGG GGTTCGTTCG TGCGCGTGGA CGTCGTGCGG
TGTCCACCGA ACGTAGCTTT TTCGTTCTAC GGACCCAAAT CGCAGCGTGT GGATATCATC
AAGACCTCGG ACGTTCCACC GACGACGCCT GGACAAGAAG AAGCGGCATT GCGCGTGGTG
AATGAGATCG ACTTTGTACC GCCGACGAAC GTAGTCGGCC CTTTGGTCGA TCTTTCGGTG
TCTGGTCTAG GGGGCTGGAT TCGCGTCGAA AAAACGGATA GTAGAGGCGA TGGTGCCATT
CTGGCTCGTG TGTATGGTGT TCGTGGCTTA GAGGTTTTCG CTCGCGACGT CATGCCGACG
CCTTGA
 
Protein sequence
CYGCGVALQT KDESIAGYVD AAEYATKAVH RHYDMMLCAR CAALSNGKFV NAVEGQGGLK 
AAPGLITPKQ LRDQLKPIRE KKALVVKVVD VTDFHGSFLK KVRDVVGGNP ILLVVTKVDL
LDSKTDLDAL VEWVGREAET RRLSLAGIAL VSSRKGSGMR DAVLQMMRER NGRDVYVLGA
ANVGKSSFIR AAMDELRSAG NYFAPSKRLP VASAMPGTTL GVIPLKAFEG KGILFDTPGL
FLHHRLNSLL GPDDLSTMKL GASLKKFVPK TPECAEPPGF DSFQGYSLCW GSFVRVDVVR
CPPNVAFSFY GPKSQRVDII KTSDVPPTTP GQEEAALRVV NEIDFVPPTN VVGPLVDLSV
SGLGGWIRVE KTDSRGDGAI LARVYGVRGL EVFARDVMPT P