Gene OSTLU_19236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19236 
Symbol 
ID5006977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp196138 
End bp197259 
Gene Length1122 bp 
Protein Length373 aa 
Translation table 
GC content64% 
IMG OID640422398 
Productpredicted protein 
Protein accessionXP_001422919 
Protein GI145357424 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value0.906456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.171238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCG ACGCGCCGAA GAGCGCGATC CCGGCGATGT GCGCGCACGT CGCGTTCGGC 
GTGTGTTACT TTCTCACGGC GCAGCGCGTG TTGGACGCGC GATTGGTGTC GCCGGAGGCG
CAGCCGTTGA CGGGCATGAT TTACGCGAGC GACGCCGCGA GCGCGAGCGC GGGCGTCCTG
ATGGCGCGAT TACGCGCGAG AAGAGACGGG CGAGTGACGG CGACGGCGAG GCGCGCGGGA
AAGTTCCCGC GCGCGGCGCT GTTGCTGCCG GTGTTTGATT TGTTTGGGCT GACGTGCGCG
TTCGAGGCGA TGCGAGCGCT CGGGGGACCG CTGTACCAAA CCATCTCTGG GTTGCTCATT
CCGCTCTCGG CGTTGCTGTC GAAGGTGGTG CTGAAACGCA CGTTCACTAA GGGGCAGATT
GGGGCGATCG CGGTGGTGAT TTGTGGGCTG GCGGTGAAGG CGAAGGACGT GGCGGACGAG
GCGGCGAGGC GCGGGACGGC GATCGACGCG AGGGGGATCT TGATCGCGAA CGCGGCGACG
GTGAGTTATG GGTTTCGGGG ATTGGTCATG GAATACCTGA GCGCGTCGAA ATCGAGTCTG
AGCGGGAACG CGCAGACGAT GCTGATGGGA ACGTGCGGGT TGGCGGCGTT TGCGATTTAC
ACGCTCGCGA GGACGGCGCG CGATATGGAC GGGATGGTAT GGGCGTATTA CAACGCCTCG
CCGCGAGATG TGTCATCGAT TTTAAAGGTG CACCTAGGAA ACATGCTCAG TCGGGCGTTC
ATGGTGAAGA TGATGATGGC TGTCGTCGCC AGAGCGGGCG CCACGCAGTT GGCGCTCTCG
AACGCGATTC GCAGCGTCGG CGTCATCGCG TTTTCGCACG TCTTATTTTG CTCGGACGAC
GCGAGACAGT GTTTGAGTTA CAATGGCGCC ATCAGCGCCG TCATGGTTGT CACCGGCGGT
CTCGCGTACG CGATGAGTGG GAAGCCGAAA ACCGCCGCCG CCGCCGCGCC CAAAACGGCT
CGCGCGAGAA AAACCACAGT CGCCGTCGCG CGCGCCGACG CAAAACCGAC AGCGTCACCC
TCTTCGACCG TCCGTCGCCG CTCCGCGCGT CGAGGATCGT GA
 
Protein sequence
MARDAPKSAI PAMCAHVAFG VCYFLTAQRV LDARLVSPEA QPLTGMIYAS DAASASAGVL 
MARLRARRDG RVTATARRAG KFPRAALLLP VFDLFGLTCA FEAMRALGGP LYQTISGLLI
PLSALLSKVV LKRTFTKGQI GAIAVVICGL AVKAKDVADE AARRGTAIDA RGILIANAAT
VSYGFRGLVM EYLSASKSSL SGNAQTMLMG TCGLAAFAIY TLARTARDMD GMVWAYYNAS
PRDVSSILKV HLGNMLSRAF MVKMMMAVVA RAGATQLALS NAIRSVGVIA FSHVLFCSDD
ARQCLSYNGA ISAVMVVTGG LAYAMSGKPK TAAAAAPKTA RARKTTVAVA RADAKPTASP
SSTVRRRSAR RGS