Gene OSTLU_29975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29975 
Symbol 
ID5000536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp308075 
End bp309166 
Gene Length1092 bp 
Protein Length346 aa 
Translation table 
GC content64% 
IMG OID640415957 
Productpredicted protein 
Protein accessionXP_001416125 
Protein GI145342084 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.448266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTTCGCATC GCATCGCGAC GCATCGGATG GCGCGACGCG CGCGCCTGCG CACGCGGCTC 
ACCGATCTCG TTCCCGGACT CGAGCATCCC GTGACGCAGG GCGGCATGCA CCACGTCGCG
TACGCCGCGC TCGTCGCCGC GGTGTCGAAC GCCGGCGCGC TCGGCACGCT CACGGCGCTG
ACGCAGCCGA CGCCCGAGGA TCTGCGACGA GAGATCGCGC GAACGCGCGC GATGATCACG
CGGAGGAGCG AAAAGAGTAA GAGCGGATAC GCGCCGTTCG CGGTGAACTT CACGCTGCTG
CCGGCGCTGC GACCGCCGGA TTACGAATCG TACGCGAGGG TGATTTGCGA GAGCGACGTC
GAGGTGGTGG AGACGGCGGG AGCGAATCCG GGGAAATTCA TCGAGATGTT CAAGAAAAAG
GGGATAATAG TGATACATAA GTGCACGACG CTGCGACACG CGCTGGCGGC GGAGCGGTTG
GGGGTGGACG CGGTGAGCGT GGACGGGTTC GAGTGCGCGG GACATCCGGG GACGAACGAC
GTGGGGGCGA TGGTTTTGTT GGCCAAGGCG CGAGACGTGC TGACGGTACC GTTTCTAGCG
TGCGGGGGGA TAGGAACTGG GAGGCAACTC GCGGCGGCGC TGGCTTTGGG CGCGGATGGG
GTGTGCATGG GGACGAGATT TATGGCGACG CGCGAGGCGC CGATTAAGGA TGGCATCAAA
CGCGCGTTAA TCGCCGCCGA CGAGAACCAA ACCACGCTCG TCATGACGAC GGTGAAGAAT
CACGAGCGGG TGTATAAGAA TAAAGTCGCC GAAGAAGTGC GCGCGATCGA GGCGGTGAAG
CCCGGAGACT TTGGCGCGAT TCACCATTTA GTGCGCGGGG AAAACTATCG CGTATCGTTT
CAGGAAACCG GCGACGCCGA ATCGAGCGTC TGGAGCGCCG GATGCGTCAT GGGTCTCATC
GACGACGCCC CATCGTGCGA CGAACTCCTC ACGCGCATCA TCGACGAGGC TGTGGACGTG
ATGACGACGC GACTACATCG CATGATTGTC GTAGACGCCG CGCTCTGAGC CGCGTTTCGC
CGTAGAAGCT CC
 
Protein sequence
MARRARLRTR LTDLVPGLEH PVTQGGMHHV AYAALVAAVS NAGALGTLTA LTQPTPEDLR 
REIARTRAMI TRRSEKSKSG YAPFAVNFTL LPALRPPDYE SYARVICESD VEVVETAGAN
PGKFIEMFKK KGIIVIHKCT TLRHALAAER LGVDAVSVDG FECAGHPGTN DVGAMVLLAK
ARDVLTVPFL ACGGIGTGRQ LAAALALGAD GVCMGTRFMA TREAPIKDGI KRALIAADEN
QTTLVMTTVK NHERVYKNKV AEEVRAIEAV KPGDFGAIHH LVRGENYRVS FQETGDAESS
VWSAGCVMGL IDDAPSCDEL LTRIIDEAVD VMTTRLHRMI VVDAAL