Gene OSTLU_49932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49932 
Symbol 
ID5002203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp530680 
End bp532686 
Gene Length2007 bp 
Protein Length668 aa 
Translation table 
GC content62% 
IMG OID640417624 
Productpredicted protein 
Protein accessionXP_001418514 
Protein GI145348140 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID[TIGR01381] E1-like protein-activating enzyme Gsa7p/Apg7p 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0565286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACG CGACGACGCC GCTGATGTTC GAACCGCCGT GCAGCGCGCC GGACGGGGGG 
TTCTGGCGCG AGGCGGCGCG CGTGAAGCTG CACGAGGCGA AACTGGATGA AACGCCCATC
GACGTCCGCG CGCGCGTGTG CTGCGCGCAG AACGCCGAGG TGTCGAGCGC GGTGTCGTTG
GATGCGCTCG CGTTCGACGA CGCGACGAGC GAGGGCGAGG AGGCGGCGGG AGGACGAGGG
ACGTGGACGA CGCGCGGGCG GTTGACGTGC GCGAACACGC GCGAGGCGTT GGCGACGTTC
GATCGCGACG GTGCGATGCG CGCGATGGGA CGAGAGATGC TCGAGAGCGT GATGAATGGC
GACGCCGAAC GCGAGCCCGA ACGATTGAGG GCGTTCGCGG TGGTGGCGTA CGCGTGTTTG
AAGAGTTGGT CGTTTACGTA TTGGTTCGCG TTTCCGGCGC TCGCGAGCGC GGAGTTTAAG
ATTATGTCGT CAGCCGTCAC GGGAATGACG AACGAGGGCG TGGATGGCGA CATCGCGGCG
ACGTGTGAGC GATGGATCGC GAGCGGGGGG GCGTCTGCGT GGTTGGTGAG CGAGGACGGC
CGCGAGGCGT ACGCTTTGAC GGAGTACGAG GCGAGGACGC GAGCTGGTGC GAAGCCGCGG
CTCGCGTTCG CGGATGCGTG CTGCGCGATG ACGCATCCGG GCTGGACTTT GCGCAACTTG
GCAGTGTTGG CGTCAGCACG TTGGGGTGCG AGCGCGTTGG ACGTGGTATG CGTACGAGCG
CGCAAAGGGC GGGTCGCCGC CGAGGCATGT GTGAAGTTTA CGATGTCTTT TCCAAAGTTC
GATGTGGAGA CGATGAAAGT GGTTGGCTGG GAGCGTAACG CACGCGGAAA GATGGGCCCG
CGCACGGTCG ATCTCGGGGC GAGCATGGAT CCAAATCAAC TCGCGTCGCA GGCAGTGGAT
CTGAATCTAA AACTCATGCG ATGGCGTTTG CTTCCGGAAC TGGACCAAGA AAAACTCGCA
GCCACTAAGT GCTTGCTCAT CGGCGCGGGA ACGCTAGGGT GCGCAGTCGC TCGCACGCTC
ATGGGCTGGG GTGTCAAACA CATCACGTTC GTCGACTCGG GGCGAGTTTC GTATTCTAAC
CCAGTTCGAC AGTCGCTGTT TGAGTTCGAA GACTGTCTCG ACGGCGGCGC GCCCAAGGCT
GCGGCGGCGG CGAAAAAGCT CACAGAAATT TTCCCGGGCA TGTTTGCCAA AGGAGTTCTG
ATGAGTATTC CCATGCCCGG ACACAGCGTC AGCGAGAAAC TGAAAGCATC GGTGTTCAAA
GACGTCGACG ATATCGAGGC GTTGATCGAC GCGCACGACG TCGTGTACGT GCTCACGGAC
ACACGCGAAT CGAGATGGCT TCCAACGTTG ATTTGCGCCG ATAAGGGGAA GCTTTGCATT
AATACCGCGC TCGGTTTCAA CACTTATCTC GTTATGCGGC ACGGATGCGG CGTCGACCAT
GCGTCGTCAT CGCGGCTTGG ATGCTACTTC TGCAACGACG TCATGGCTCC GGCGAATAGC
ACAAAGGACC GCACGCTAGA TCAGCAATGC ACGGTGACAC GCCCAGGTCT TGCTCCCATC
GCAAGTGCGC TCGCCGCCGA ACTCATGGTG GCCTTACTTC ACGCCGAAAA TGGCGTCACG
ACGTCGCCTC CGACTCGCGA ACAAGACGTG AGCGCGGAGC GCGAAGCCGA CTCCTCGCCC
TTGGGCGTCG TCCCGCATCA AATTCGTGGC AGCGTCGCCG GGTTTACTCA AACCCTATTT
GACGCCCCGT GCTTCCCTCG ATGCACCGCG TGCTCCACCG CTGTCGTCGC GAAATACCGC
GACGATCGCG ACGGATTTCT CACCGCCGTC TTCGACGATC CCAAAACGCT CGAAGACGCC
ACTGGTCTTA CCGACCTCCT CGGCGCCGTC GACGCCGACG ACGCCGAGTG GCTCGACGAC
GACTCCGACG ACGCGTTCGA CGTTTAG
 
Protein sequence
MADATTPLMF EPPCSAPDGG FWREAARVKL HEAKLDETPI DVRARVCCAQ NAEVSSAVSL 
DALAFDDATS EGEEAAGGRG TWTTRGRLTC ANTREALATF DRDGAMRAMG REMLESVMNG
DAEREPERLR AFAVVAYACL KSWSFTYWFA FPALASAEFK IMSSAVTGMT NEGVDGDIAA
TCERWIASGG ASAWLVSEDG REAYALTEYE ARTRAGAKPR LAFADACCAM THPGWTLRNL
AVLASARWGA SALDVVCVRA RKGRVAAEAC VKFTMSFPKF DVETMKVVGW ERNARGKMGP
RTVDLGASMD PNQLASQAVD LNLKLMRWRL LPELDQEKLA ATKCLLIGAG TLGCAVARTL
MGWGVKHITF VDSGRVSYSN PVRQSLFEFE DCLDGGAPKA AAAAKKLTEI FPGMFAKGVL
MSIPMPGHSV SEKLKASVFK DVDDIEALID AHDVVYVLTD TRESRWLPTL ICADKGKLCI
NTALGFNTYL VMRHGCGVDH ASSSRLGCYF CNDVMAPANS TKDRTLDQQC TVTRPGLAPI
ASALAAELMV ALLHAENGVT TSPPTREQDV SAEREADSSP LGVVPHQIRG SVAGFTQTLF
DAPCFPRCTA CSTAVVAKYR DDRDGFLTAV FDDPKTLEDA TGLTDLLGAV DADDAEWLDD
DSDDAFDV