Gene OSTLU_49687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49687 
Symbol 
ID5002143 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp596753 
End bp597991 
Gene Length1239 bp 
Protein Length369 aa 
Translation table 
GC content55% 
IMG OID640417564 
Productpredicted protein 
Protein accessionXP_001418058 
Protein GI145347191 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.622489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGG CCACCGCGCG CGCCGTCGCG AAGGATTTGC TCGGAATGTC CGCGCGCGCG 
CTTAAATCGA TCGTCGTCGA CGAGTGCGGG CAACCTTTGT ATCGAGCGAC GCAGATTCGC
GAACACCTGT ACGGCGCGCG GCGGTGTCGA AGGATTGAAG ATTTCTCGCT GATACCGCGT
GAAATGCGGG ACGCGCTCGT CGCGGGGGGG TATCGAACCG GAAGATTGGC GGTGGAGTCG
GCGAGCGTGA GTGGGTGCGG CACTGGGAAG GTCTCCTTGC GCGTGGGCGA GCGAGAGGTG
ATCGAGGCGG TGGGGATTCC AGACGCGAGT TGTTGGCGCG CGAGCGCGGA GGCTGAGGTG
GAGGTAGAGA ACGCATCTGA GGTGTTCAAA AGCGTGCAGG GATGGGATAA GAATCGGTTG
ACGGCGTGTG TGAGCAGTCA AGTCGGATGC GCGATGAAGT GCACGTTTTG CGCGACAGGG
ATGCAAGGAT ACAAGCGAAA TTTGACGCCG GCGGAAATCA CGGCTCAGGT GATCGAACTC
GAGGAGCTGT ACGGTAAACG CGTCTCGCAG GTGGTCTTTA TGGGCATGGG TGAACCGATG
CTGAATATCA AATCCGTGGT TCAAGCGATA AGGTGCCTGA ACGAAGATGT TGGGATTGGT
GGACGGCACA TAACAGTTTC AACCGTTGGC ATCCCGAATT CGTTGAAGAA ACTGGCGAAG
GAAAAGCTCG CAATCACGCT TGCAATCTCT TTGCACGCCC CTGATCAACA CACGCGGGCA
AAAATAGTGC CATCCGCGAA GTACTATCCA ATGGAGGACT TATTGAATGA CGCACGCGCT
TACTTTAAGG AGACAGGGAG ACGCGTGACG TTCGAGTACA CCTTGCTCGC CGGCGTCAAC
GATTCCCCAT CTCAAGCAAA AGCGCTGAGC CGAATGTTAA AACGGAAGTT TGGTACCGGC
GCACACGTCA ACATCATTCC TTGGAATAAC ATCGATGGTA TTAATCACAC AAGGCCATCC
GGAAACGCCA TTCATCGATT CTGTGCGCAG TTAGAGGGCG GTGTGACGCA TACCATACGA
CGCACGCGTG GCTTGGACAC AAACGCCGCA TGCGGAATGC TCACTGGAGC GTTCGAAAGA
CGAACGCTTC GTGCCAACGC GTGAGCATAC AGCAAAAAAA AGATGTACAG GATGTACGGT
AACGCTAGTA ACTGTAGCCT ATATTCTAGA AAAGTCTAG
 
Protein sequence
MAAATARAVA KDLLGMSARA LKSIVVDECG QPLYRATQIR EHLYGARRCR RIEDFSLIPR 
EMRDALVAGG YRTGRLAVES ASVSGCGTGK VSLRVGEREV IEAVGIPDAS CWRASAEAEN
RLTACVSSQV GCAMKCTFCA TGMQGYKRNL TPAEITAQVI ELEELYGKRV SQVVFMGMGE
PMLNIKSVVQ AIRCLNEDVG IGGRHITVST VGIPNSLKKL AKEKLAITLA ISLHAPDQHT
RAKIVPSAKY YPMEDLLNDA RAYFKETGRR VTFEYTLLAG VNDSPSQAKA LSRMLKRKFG
TGAHVNIIPW NNIDGINHTR PSGNAIHRFC AQLEGGVTHT IRRTRGLDTN AACGMLTGAF
ERRTLRANA