Gene OSTLU_29872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29872 
Symbol 
ID5000093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp103988 
End bp105245 
Gene Length1258 bp 
Protein Length411 aa 
Translation table 
GC content60% 
IMG OID640415514 
Productpredicted protein 
Protein accessionXP_001416072 
Protein GI145341970 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTCAACCGC CGTCGACATG CGAGTCTCGG TGCACGTCGC CGAACTCGAC CTCATCGTCA 
ACGTCGACGT CTCTCCGGAC TGTACGGTCG CAGACCTCAA AGCTGCCGTC GCCTGTGAGT
TCCCAGACGG CGATCCGCGC GATATAACGC GCGCCAAAGT TTTGAAAGAT GCCAAGCAGA
TTCCAGATGC GTCAAGTTTA CGCGACGCCG GCGTCGTGCA GGACGACCTG TTGATCGTGT
CGCTGGGTGC ATCCGGTGGG GAAAGCATCG CGTCGGCGGA CGCGCGCGCG CTTGCGGCGG
ATGGTAGCGC AGTGGACGCG CAGGCGATGA TGGAGAGTTT TCGCGGCAAC GCCGGCACGC
TCGAGGCGTT GCGTCGGCAG GGAGGAAGCG AACTCGTCGA TTGCATCGAA GCGAACGACG
TGGAGGGGTT TCAGAGAATG ATGCGGGAGA TGCGGAAACG GATGTTAGCG GCGAGGGAAC
AGGAGGCGGA GGAGATGGCG TTGATGACGA GCGACCCGTT CGACGTCGAG GCGCAGCGGA
AGATTGAGGA ACGCATCCGA CAAGAACAGG TGTTAGGCAA TTTTGCGACC GCGATGGAGG
AAACGCCCGA GGCATTCGCT CAGGTGGTGA TGCTATACGT CGATCTGGAG GTGAATGGAG
TGGCGCTGAA GGCGTTTGTC GATAGTGGGG CGCAGATGTC GATCATGTCG GTGACGTGCG
CGCGACAGTG CGGACTGGAA AGGCTCATCG ACAAGCGGTT TAGCGGCATC GCGAAAGGCG
TGGGGACGCA GAACATCATT GGACGCGTGC ACCAGGCACC GATGAAGGTG GGTGAACACT
TCTTGCCGTG CGCGATTACG GTTTTGGAGA AGGAACAAGA CATGGACTTC ATCTTTGGTT
TAGACATGCT GCGCAGACAC GCGTGCTCCA TCGACTTGAG GAAGAACGCC CTCGTTATCG
GCTCGGTCGA CGTGGAGTTG CCGTTTTTGA GCGAGAGCGA AATTGGAAAG ACAGCACAGG
AAGCGTTTCA AGGCAAAGCG CCCGAGGCGG CGATCCCGAC CCCCTCGGCG GCGGTCCCGA
CCCCCTCGCC GGCGGTCCCG ACGCCCTCGA CGACGCCTTC GTCTTCGTCG GCGCACGACG
AAGAAAAAAT TGCTCGATTA ACCGCGCTCG GCTTTTCTCG GCAGCAAGTT ATCGACGCCT
TGAACGCGAC GAGTGGTAAC GAAGAGTTTG CGGGTGCGCT ATTGTTCGGT TAAACGCT
 
Protein sequence
MRVSVHVAEL DLIVNVDVSP DCTVADLKAA VACEFPDGDP RDITRAKVLK DAKQIPDASS 
LRDAGVVQDD LLIVSLGASG GESIASADAR ALAADGSAVD AQAMMESFRG NAGTLEALRR
QGGSELVDCI EANDVEGFQR MMREMRKRML AAREQEAEEM ALMTSDPFDV EAQRKIEERI
RQEQVLGNFA TAMEETPEAF AQVVMLYVDL EVNGVALKAF VDSGAQMSIM SVTCARQCGL
ERLIDKRFSG IAKGVGTQNI IGRVHQAPMK VGEHFLPCAI TVLEKEQDMD FIFGLDMLRR
HACSIDLRKN ALVIGSVDVE LPFLSESEIG KTAQEAFQGK APEAAIPTPS AAVPTPSPAV
PTPSTTPSSS SAHDEEKIAR LTALGFSRQQ VIDALNATSG NEEFAGALLF G