Gene OSTLU_29736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29736 
Symbol 
ID5006897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp205050 
End bp206538 
Gene Length1489 bp 
Protein Length474 aa 
Translation table 
GC content59% 
IMG OID640422318 
Productpredicted protein 
Protein accessionXP_001422921 
Protein GI145357428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value0.692446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0742895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGCCATGG CGCCCGCGCG CTCGAACGCG CGCGACGTCG AGTCGCAGGA ATCGTGCGCG 
GCGACGCACT GCGCCGGGAA CGGGCACCCC GCGGTCGCGG CGCTCGTCGC GCCGTTCGTC
CTCGCGTGGA ACGCGATCGA TGCGTATCTG ACGCCGTGCC TCGGCGCGTA CGCGCGGCTG
GGCGCGCGCG GCGCGATGGG ATCGCTGTGC TGTTGCCTGC TGGAATGCTT TCGGTACGAG
GATAAGGTCT GGGCGGGCGA CGCGGCGCTC GGGGTGGATT GCGAATTTCG CGGGTGCGAT
TGGGCGCGCG TCGACGACCT GAGCGCGGGG AGCGAGGACA AGCCGATGGT GCTGTATCAG
GGAATCATCG AACCTCGGGA CTGCGTGCAG GGCCAGCTCG GGGATTGTTG GTTGGTGAGC
GCGCTGGCGT GCCTGGCGGA ACACCCGGGA GCGATCAAAC GGTTGATATT GAACGGGGAA
AAGTCGCTTC GCGGCAAGTA TCGCGTGCGG TTTTACGACG GCAAGGAGAA GAGGTGGGTC
ACCGTGACGG TGGACGATCT CATTCCTTGT TACAAGGGGA CGAAGAATCC GATATTTATG
CAACCGCACA ACAACGAGTT TTGGCCTTTG ATCGTGGAGA AGGCGATGGC TAAGTTTATG
GGGAGCTACG CCGCGCTGGA CGGCGGGTTC GGCACGTGGG CCACGCACGC GCTCACGGGC
GATAACGTCT TCTTGCTCAA GAAGCGCATG GACGTCGAAC GCACGTGGCG GCGACACAAC
ATGAAGTTTA TCGGTAAGCC CGGTGACGGC GGTAAGAAGG ATCGCATCTA TCACGAAGAA
GTCGAGGAAA ACATCGTGCG CGATAAATTG TTCAACATCC TGACCCAGTA CGACAGCATC
AAGTCCCTGA TCGCCGTGTC GAGGATGACT AAAAATGGCG AGAGCAAAGA CGAAACCACC
GGCTTGGTGT CCGGTCACTT GTTCTCCGTC ATCTCCGTGC GTTGGGCTGG ACGCTCTTGG
GGCGTCGGTG GAAAGCGTTT CATCAAGCTT CGCAATCCGT GGTCGACGTT TGAATGGAAG
GGCGCTTGGG CTGATGGATC GAAAGAATGG GACAAACACC CGGCCATCGC GAAGGAGCTC
GCGTACGTGA ACGATCATCA CGACGGCGTG TTTTGGATGG AGTTTGACGA TTTTTGCGAG
TACTTCAACC AAATCGCGGT GTGTGACCGA ACGACAAAGC GCGACTTTTC GCTCCGGTAC
GATCACGACA ATAAATATTG TGGTCCATTG ATGGGCTGCG TGAGCGGTTG TGCGTGCTTC
TGGTGCGGAT GCCAAGGTCC GTACAAACTT TATTGCGGAC ACCAATCGAC GACCGAAACG
CGTCAGGCGA CCAAGTGCTG CGGCACGATG AAAGTCGCGA ACGACGCGTG AATAAATTAA
CGACTACTTT ACGCGAGTCG GCTTGTAATG AATTTGAGAG CTCTGTCTC
 
Protein sequence
MAPARSNARD VESQESCAAT HCAGNGHPAV AALVAPFVLA WNAIDAYLTP CLGAYARLGA 
RGAMGSLCCC LLECFRYEDK VWAGDAALGV DCEFRGCDWA RVDDLSAGSE DKPMVLYQGI
IEPRDCVQGQ LGDCWLVSAL ACLAEHPGAI KRLILNGEKS LRGKYRVRFY DGKEKRWVTV
TVDDLIPCYK GTKNPIFMQP HNNEFWPLIV EKAMAKFMGS YAALDGGFGT WATHALTGDN
VFLLKKRMDV ERTWRRHNMK FIGKPGDGGK KDRIYHEEVE ENIVRDKLFN ILTQYDSIKS
LIAVSRMTKN GESKDETTGL VSGHLFSVIS VRWAGRSWGV GGKRFIKLRN PWSTFEWKGA
WADGSKEWDK HPAIAKELAY VNDHHDGVFW MEFDDFCEYF NQIAVCDRTT KRDFSLRYDH
DNKYCGPLMG CVSGCACFWC GCQGPYKLYC GHQSTTETRQ ATKCCGTMKV ANDA