Gene OSTLU_16948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16948 
Symbol 
ID5004229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp29470 
End bp31608 
Gene Length2139 bp 
Protein Length712 aa 
Translation table 
GC content50% 
IMG OID640419650 
Productpredicted protein 
Protein accessionXP_001419880 
Protein GI145351008 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCG TGGTGGCCGT CGGTGCAATT CTCTTCTCCG TGACCCTGCT CTGTTCTCTC 
GTCGCGAGCG TGGGTGTATG TTCCAACGAC CTCACAGGTG CATCCCTTCT CAATGCTTTG
AGCGATATGA GGATAGGTAC AGATACCGAC ATGAGTTTTG TATTCTTTGC CGCACTCGTA
GACGTCGGTC TGCAGTCGTA TTGCGGGAAT ACGCTGCGAA ATTTGGGTAG TGCGTCTGCT
CAGACTCTCA TTCTCGTTCC CGACAGTGCC TTTGAGAGGG CGTCGATAGT CATGGATAAA
GGATTAGAAG AAATAACAAG TGACAGAGTC CTCGTCTGTG AAATTTTGGA GTCATCCATC
GTGCAAGCAG CATTGAACAG CCGCGATGTT CGAGACGGAG ACGAGCTCTC TACCGCGTCA
ATGCGTATGC CCAAGCTTCG CGTAGACGTC GTTCAAGACA TAAACGTTCG CAAATTACGA
ATTTTCACTG ATGTTGGAAC AACTATCGCT GATTTATTCG CGACAGATGT CGCTCTGTGT
CCTCGCTTGA GTGCCGTGTT CGCCTCAGAG CTCTTGATAC CATTAGTCGA GTCAAACGCT
TTTCCCAGTG TGTGGAAGAT ATCAAAATCA TTACCAGAGC TTAGCACGAG TGCTGAAGCA
TTTCGAATCA CTGATTCGTT GCGCGTGATA TTCGAATCGA GCGAATACGA TGGCTACGAT
GGGACTGTGG GGAGGTCAAG TTTGCAGACA TGTGTCTTTG ATGGAACAAA TCATGGATTT
CGCGCTTATT TCTTACCATC GAATCGCGCC TGGAGACGGT TCTTTCGCAA AACTGAGCTC
ACAAAATCTG CTCTTTTCTC AGATTACTCC CTTCTTCTCA GTATCCTGCT TTACACTGAA
GGACAGCTAA CGTCGACACA GCAAAGTATT GACTTGAAGT CGTACTTGTC GACGAGCCTC
GTCCCAGGAC AATTGCTACG CCCCGTCGGG GCGTCTACAA TTTTGAGTGC CTTTCCAGGA
CAAGTGGAGA TTCCAGCTTT GCGAGTGGAC GTGGACATAT CTAAGGTCAG TTCGCGATAC
GTTAGAATTT CTGGTGGGAG CACGAGCAGT CGCTTTGATA ACACCGCCGT CGTCGTCGCC
GCAGATATTG TTTCATGCAC TGGCATCGTC CATATCATCG ATGATGTGCT AGTTCCACCA
ATGTTGACAG CATTTCGTCA GCTCTCATTG CGTAGCGAGC TGTCTATGTT TACAGATCTG
TTACGAGCGC CCGCAATGCA TGAGTTGATG CTCGAACTCG ATACACCATC CGAAACGAAT
ATATTTGCTG ACAGCGTCGT TTTCGCGCCA ACAAATGTAG CGATTCGACT AACTTTGAAT
TACATCGGTT GGACGTTCGA AGATTTGTTC CAGAGGGACG CACTGTTGCG ACAATTTGTC
ACGTACCACG TCATAAGTAC TCGAAGCGCC GAAGGAGCCC CTGAAAGACT CAAATTTCGT
CTAGGACTCG GTCCCGAGGA ACAGCAGTTT GCCACGCGCT TGGCATATCC AATCTTAGCG
AAGAGCTACG CTAACACGTT CCAAGGCACG CCCATCGCTA AGACAGTGAA AGTACTCTCA
ACTCGCCGCG GCGTGAAAAA GTTCTTCACA CCCCGCCATG TGCTTCGAGG ACGTTTAAAC
TCTGCACGGT TCATCAAAAG TGACATTCCA TCGACGAATG GCGGTATCGC TATTATCGAT
GCGGCGCTCA TTCCACCGAC GGCGGAATTC GGAGTGACTC TTTACGATAG AATTCTTCGC
ACACCGACGC TGCGTGTTTT TCGTGAACTT ACAAGCGTTC TCGGCCTCCA GCGCGAGTTT
GACACGCAAG GTTTTGGAAA CGGTGACTGT ACCGTATTTG CACCGACGGA TTCCGCATGG
GTTACCTTAC TTGCCGATCT CATGACGACG CAGGAGGAAA TCGCAAACCG CGCCACAGCG
ATGCTGTACG ATATTATCAT GCTCATGCTC GCCCCTAGAA CGGAAGAATT TCAAGAGATT
GAACGCGAAC TTTATGACCC ATGGCTGTCA AAGGACGCAC TGGATGGTGA CGAGCTCGTG
ACCGCGCTCT CGGGGTATAC GACGGTGAGA GTACTTTGA
 
Protein sequence
MTRVVAVGAI LFSVTLLCSL VASVGVCSND LTGASLLNAL SDMRIGTDTD MSFVFFAALV 
DVGLQSYCGN TLRNLGSASA QTLILVPDSA FERASIVMDK GLEEITSDRV LVCEILESSI
VQAALNSRDV RDGDELSTAS MRMPKLRVDV VQDINVRKLR IFTDVGTTIA DLFATDVALC
PRLSAVFASE LLIPLVESNA FPSVWKISKS LPELSTSAEA FRITDSLRVI FESSEYDGYD
GTVGRSSLQT CVFDGTNHGF RAYFLPSNRA WRRFFRKTEL TKSALFSDYS LLLSILLYTE
GQLTSTQQSI DLKSYLSTSL VPGQLLRPVG ASTILSAFPG QVEIPALRVD VDISKVSSRY
VRISGGSTSS RFDNTAVVVA ADIVSCTGIV HIIDDVLVPP MLTAFRQLSL RSELSMFTDL
LRAPAMHELM LELDTPSETN IFADSVVFAP TNVAIRLTLN YIGWTFEDLF QRDALLRQFV
TYHVISTRSA EGAPERLKFR LGLGPEEQQF ATRLAYPILA KSYANTFQGT PIAKTVKVLS
TRRGVKKFFT PRHVLRGRLN SARFIKSDIP STNGGIAIID AALIPPTAEF GVTLYDRILR
TPTLRVFREL TSVLGLQREF DTQGFGNGDC TVFAPTDSAW VTLLADLMTT QEEIANRATA
MLYDIIMLML APRTEEFQEI ERELYDPWLS KDALDGDELV TALSGYTTVR VL