Gene OSTLU_31512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31512 
Symbol 
ID5002074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp72347 
End bp73963 
Gene Length1617 bp 
Protein Length538 aa 
Translation table 
GC content58% 
IMG OID640417495 
Productpredicted protein 
Protein accessionXP_001417665 
Protein GI145346376 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.268234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCCA GCCTCGCCTG GGTGCCTCCC GGCGCCGCGA GCGCGTCGCC CAAGTACGCG 
GACGTCCCCG AAGAGGAGCT CGAGCGGCTC GCGCATCGCG CGCGCGACGT CGCGCGTCGC
CGCGACTCGC GACGGCGCGA CTCCGACGCC TCGGACGCGT CGGAAGACGA GGATGGGATG
ACGGACGATG ATTCGGACGC GTCGAGCGGC GACGCGTCGG ACGAGAGCGA CTGGGAAGAG
GTCGAGGCGG ACGCGGAGGA GGATTTGGAC GAAATGGCGG CGGACGAAGA AGACGCGAAA
GAAACGACGA CCAAGGCGGT GGCGAAGGCG AAGGCGGTGG CGAAAGCGGC GAAAGGAATG
GTTGATGATC TCGCGGAACT GAACATGGAC GCGTACGACG ATGAAGAGGA GGATGAACGC
GCGGCGGCGG GACGGTTGTT CGGAAGCGGA CGCATGACGC ACTACGACGG AAACGAGGAC
GATCCGTACA TGACGATTAA GGATAGCGAT GACGACGAGG ATGAGATGCC GGACGATATG
ACGATGGCGG AGACGGATTT GGTGATATTG GCGGCGCGAA CGGACGAGGA TGTGTCGCAT
TTGGAGGTGT GGGTGTACGA GGAGGCGGGA GTGACGGGGA ACGCGGAGAC GAATTTGTAC
GTGCATCACG ACGTGCTTTT ACCGGCGTTT CCGTTGAGTG TGGCGTGGAT GAATTGCGCA
CCCAAGAGCG GGACGAATGA AGTCAACTGT GCGGCGATCG GGACGATGTA TCCAGGGATC
GAGATTTGGG ATTTGGATTG CGTGGACGCC GTCGAGCCGG TGACGACGCT GGGGGGATAT
TCAGACGAGG CGATCAAGGC TGCGAGTAAA AAGGGTAAGA AGGGCGGCAA GAAAGAGTCG
AAAGCGTTGA AAGGCGGCTC ACACGAAGAC GCCGTCATGG GATTGTCGTG GAATCGCGAG
TTTAGAAACG TCCTGGCGTC GGCGAGCGCC GACACGACGG TTAAGATTTG GGACATCGCG
ACGGAAACCG CCTCGCAAAC GCTGAATCAT CACAAAGGGA AAGTGCAGGC GTGCGAATGG
AACCCAGCTG AACCTACTGT GCTTCTCACA GGATCTTACG ATAAAACGGC TCAAGTTGTA
GACGTCCGCG CGCCCGATAA TGCATCACTT ACGTGGAAAG TCGGCGCCGA CGTCGAGAGC
GCAATTTGGC ACGTCGGATC GCCGACGCAG TTTTTAGTAT CGAACGAAGA TGGGCTCGTG
ATGTGCTTCG ATACACGCAT GGGATCAAAG TCGGACTGTG TTTTCAAGCT CCAGGCGCAC
GACAAGGCCA CAACAGGGCT GAGCATGGCG TCTGGTGCGC CCAACCTATT GACGACGTGC
TCCACGGACA AGTCGATCAA ATTGTGGGAT TTGAACGATG GTAAACCGTC CTTACTGTGT
CAGCACTCTC CTCAAGTGGG AGCTATTTTT GCGTGTGGAT TTTCGCCTTC GGTGCCGTAT
TTGATAGCCG CCGCTGGCTC CAAGGGCACC GTGGCGGTTT GGGACATCCT GTCGGAAGCC
GCAGTCAAGC AAACTCACGG AAAAACTCTC GAACAATACT ATCGCGTGTC AAAGTAA
 
Protein sequence
MISSLAWVPP GAASASPKYA DVPEEELERL AHRARDVARR RDSRRRDSDA SDASEDEDGM 
TDDDSDASSG DASDESDWEE VEADAEEDLD EMAADEEDAK ETTTKAVAKA KAVAKAAKGM
VDDLAELNMD AYDDEEEDER AAAGRLFGSG RMTHYDGNED DPYMTIKDSD DDEDEMPDDM
TMAETDLVIL AARTDEDVSH LEVWVYEEAG VTGNAETNLY VHHDVLLPAF PLSVAWMNCA
PKSGTNEVNC AAIGTMYPGI EIWDLDCVDA VEPVTTLGGY SDEAIKAASK KGKKGGKKES
KALKGGSHED AVMGLSWNRE FRNVLASASA DTTVKIWDIA TETASQTLNH HKGKVQACEW
NPAEPTVLLT GSYDKTAQVV DVRAPDNASL TWKVGADVES AIWHVGSPTQ FLVSNEDGLV
MCFDTRMGSK SDCVFKLQAH DKATTGLSMA SGAPNLLTTC STDKSIKLWD LNDGKPSLLC
QHSPQVGAIF ACGFSPSVPY LIAAAGSKGT VAVWDILSEA AVKQTHGKTL EQYYRVSK