Gene OSTLU_29431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29431 
Symbol 
ID5006744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp220881 
End bp222371 
Gene Length1491 bp 
Protein Length496 aa 
Translation table 
GC content61% 
IMG OID640422165 
Productpredicted protein 
Protein accessionXP_001422687 
Protein GI145356952 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.386445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.32597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCG CGTCGATCGG GAATAAGTTG TTTGACGTCG TGAGCGTTAG GCACGCGGCG 
AGGGATCACG AGCGGGAGGC GACGTCGGGG TTGAGCGGCG TGCGGCGATT GACGGCGGAG
AAGATGGAAT TGGACATGGA GCGATATCCA AAGGCGAGGT GCTTGGATGG GACTCCTGGG
GCGTATTACG TCAATCTCGC GCCCATACGC GTGCGAAACG TAGACGACGA GTATCCGTCG
GCGAAGCGGG GGAGCGTCGC GCGCGCGGGA GATTCGAGCG GGGGATCGGG AAGCGCGCGC
GAGTTCGCGA CGTCTAAGAC GTGGGTCGTG ATGTTGCAAG GCGGTGGCGA GTGCACGAAC
GCCCCAGAGT GCTCCGAACG CTCTGGGACG GAAAGAGGAT CGAGTGAACT GTTGCCGGAC
GAGATCGTGT TTGATCGAGG CATCCAGGCG GTGACGGCGG ACGACGACGG CGAAGATTTG
CCGTTTTCGC GAGCCAACAT GGTCACCGTG GGGTATTGCT CGGGTGATGT GTACATGGGG
CGATCTGACG AGGCTGATGC GAGTGGGATG TGGCACTCGG GCGCACACAT CGTCGAGGCT
GTTTTACAAG AGCTCGTCCG GGCGTACAAC ATAGAGGACG CGGACGTCAT CGTCTTGGCG
GGCCGAAGCG CGGGGGGGAT CGGTTTGATC GCGCAAGTGG ACCAGTGGGC GGAACTACTT
CGCACAAAGT TCAGCGCCAT AGCGCGGAGC ACGGTGAAAA TCGTCGGTGC GCCGTTTGCT
GGGTTTCATT ACTTTCATAA CGATACGGAG GGCGCCGCCG ATGATTCGCT CAAGTACGTA
CCGTGGGACG AGGCTTCGTT CAAGCAGTAC GTAGACTATT GGCACGCGAG CGAGAGCCTT
CCCAAGGCGT GCGTCGAGGT GAATCAGGAC GCACCGTGGA GATGTATGGT GGCGGACTAT
TCCTTCCCTC ACACGCGAAC GCCCTTATTC TTTTCGCAAG CGCTTCTAGA TTCCGTCGTA
ATGCGGTTGC ACGACAATTT TGGCGGCGAC TTTACGCGAC ACAAGCAAGT CACGTTCGCG
CACGAATGGC AGTCGCAGAT GCGTCGCGTT CTCGAACCTG CGATGTCACA CGCCACCGCC
GGCGTGTTCG CGCCGTCGTG CTACATGCAC ACCGATTTCG ATGGCATCGT CATCGACGGT
ATCTCCCATC ACAGGGCGCT CGCCGAGTGG GTGTTCGAGA ACAAACCGAT CCGTCTCATC
GACGATTGCC GGGAACTGAT GTGCAACCCG ACGTGCAGAT CGCGCGATAA GTCGAGCACG
CTCTCCAACG ATTTAGACGA CGGCGCGCTC GGACACGCGT TCGATCGCAA GCGCCGGAAG
GACGAAGACG AGCTCTCCGC CGAAAAAGTC GCCGCCGAGC GCAAGACGGA CGACGCGCGC
GCGCGACGCA AAAGCAACCG CCGTCGCGCT CGGCACCGTC CATCGGATTG A
 
Protein sequence
MAGASIGNKL FDVVSVRHAA RDHEREATSG LSGVRRLTAE KMELDMERYP KARCLDGTPG 
AYYVNLAPIR VRNVDDEYPS AKRGSVARAG DSSGGSGSAR EFATSKTWVV MLQGGGECTN
APECSERSGT ERGSSELLPD EIVFDRGIQA VTADDDGEDL PFSRANMVTV GYCSGDVYMG
RSDEADASGM WHSGAHIVEA VLQELVRAYN IEDADVIVLA GRSAGGIGLI AQVDQWAELL
RTKFSAIARS TVKIVGAPFA GFHYFHNDTE GAADDSLKYV PWDEASFKQY VDYWHASESL
PKACVEVNQD APWRCMVADY SFPHTRTPLF FSQALLDSVV MRLHDNFGGD FTRHKQVTFA
HEWQSQMRRV LEPAMSHATA GVFAPSCYMH TDFDGIVIDG ISHHRALAEW VFENKPIRLI
DDCRELMCNP TCRSRDKSST LSNDLDDGAL GHAFDRKRRK DEDELSAEKV AAERKTDDAR
ARRKSNRRRA RHRPSD