Gene OSTLU_31007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31007 
Symbol 
ID5001377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp143842 
End bp145809 
Gene Length1968 bp 
Protein Length655 aa 
Translation table 
GC content57% 
IMG OID640416798 
Productpredicted protein 
Protein accessionXP_001417434 
Protein GI145345894 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.906185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACG GGCATCGCGA GAAGCGCGCG AAGCTCGAGC CGCCGGCGCT CGCGAACGAA 
AGGTTCGAGG CGTATTACGC CGCGCAGGGC GTGTGCGAAG ACGCGGGCGA CTTTCGAGCG
ATGATGGAGG CGTTTCGAAG ACCGCTGCCG CTGACGTTTC GACTGAACGC GTCGACGGCG
CTCGTGGAGG CGGTGCGGAG GAAGCTGGAG GGCGACGTGT TGCCGGCGCT GAAGCGCGAG
GCGTCGATGA AGCCGCCGAA GTGCGTCGCG TGGTATCCGG ATCGTTTGAG CTGGCAGATA
GATATATCGC AGAGCGCGTC GTTGAAACAG AGTGAATCGC GCGGCGTGCT CGGGTTGCAC
GCGTTTTTGA AGAGCGCCGG GGAGACGGGG GCGCTGACGA GGCAAGAGTT GGTTTCGATG
ATTCCGCCGT TGTTTTTAGA GGTGAAACCG AAGCATCGCG TGATCGATAT GTGCGCCGCG
CCGGGAAGTA AGACGTCACA GTTATTGGAG ATGTTGCACG GCGCGACGAA CGCTGGGGAG
ACGCCTCGAG GGGTGGTGGT AGCGAACGAC GCGTCTTTAC AGCGAGCGAA TTTGCTCACG
CATCAGTGCA AGCGAAGCAA CTCCCCGGCA CTCGTCGTGA CGAATCATCA GGCGCAGTTG
TTTCCAATTT TACACGACGC CAAGGGAAAG AAAATTAGGT TCGATAGAAT TTTAGCCGAC
GTCCCGTGCA GCGGTGATGG GACGCTGCGG AAATCGCCCG ACCTTTGGAA GAAGTGGAAC
GCCTCGAGCG GGGTGGATTT ACACACGCTT CAGCTTGAGA TCGCCACCCA CGCCTTGCGC
TTGCTCGAAG TGGGTGGGCG TTTAGTCTAC TCGACGTGCA GTTTGAACCC ATTGGAGAAT
GAGTCCGTCG TCGCGGCGCT GTTGAAGCGC GCGAAAGGCT CTGTGGAGCT CGTCGACGTC
TCGAAGAGCT TGCCCGAGCT TAAGCGACGA CCGGGGATGA AGAGATGGAA AGTCGGTGAT
ATATACGGAT GGCACGACTC TTTCGAAGAG ACTGGGAAGA AACGCATGAA AACCGTGGCG
AAGACGATGT TTTGGAACCG AGAGTATGAC GCGATGCCGC TCGAGAGATG CGTTCGAGTA
TTTCCCCATC TCGACGATAC GGGTGGATTT TTCATCACCG CGTTGAAGAA GACGGCCGAG
TTGCCTCCGG AGATGGAGCA AACGCCGCAA ATGGACGCAA ACAAGACTTA TAGAATGGAG
CGTGCGAATG AGCAGTGGAA CGAAAAGAAA CGTGTGGCGC CGGTGATGAA GGTTGAAGAC
AGATCCATCG TGAAGAGCAT CAATAAACAT TACGGCGTGC AAGACGCGCT CGATCTGGAC
GACGCGTTGA TGACGCGTCA GCACTCCGAC CTTCCGGGTG TCACTCCAAA GCGTCTATAT
TACTTGTCCG ACGGCGCACG AAAAGTGTTG ACGGCGCGGG GAAAGGATGG CAAGAACGCT
GGTTTGCAGG TCGTTGCGTG CGGCGTTCGT GCGTTTGAAC GTCAAATCGT CGACGGCGTC
GAGTGCGCTT ATAGAATCAC GCAAGAAGGC CTCGACACCG CACTGCCATG CTTAAAGAAG
CAAATCGTCC GCGTTCGTGC GAGCGAGCTG GAAATCATTC TCGCGCGGCA GCAAGACGAA
AACGTGGGCG CGAATTCGTC GTCGCGATCG AGCTCGGACG ACGTCCCCGA GGAAATCACG
AATGCGAAAT CCATCGAACA TCTAAAAAAG GTGTCCGATG GGTGCGTTAT TTTAGTCCCT
AAAGCAAGAG ACGACGACAC AGAAACCGAG GCCAAGGCTC TGGCCGTCGC CGCGTGGCTC
GGCCGCGGTA AAAAAGGGAA ATCGATCTCG GTGTTGGCCA GCAAAGCGAG CGGGGGACAG
TTATTGTATC AACTTCGCGA TTGTATGTCG CGCGGCACGG TGGTGTGA
 
Protein sequence
MDDGHREKRA KLEPPALANE RFEAYYAAQG VCEDAGDFRA MMEAFRRPLP LTFRLNASTA 
LVEAVRRKLE GDVLPALKRE ASMKPPKCVA WYPDRLSWQI DISQSASLKQ SESRGVLGLH
AFLKSAGETG ALTRQELVSM IPPLFLEVKP KHRVIDMCAA PGSKTSQLLE MLHGATNAGE
TPRGVVVAND ASLQRANLLT HQCKRSNSPA LVVTNHQAQL FPILHDAKGK KIRFDRILAD
VPCSGDGTLR KSPDLWKKWN ASSGVDLHTL QLEIATHALR LLEVGGRLVY STCSLNPLEN
ESVVAALLKR AKGSVELVDV SKSLPELKRR PGMKRWKVGD IYGWHDSFEE TGKKRMKTVA
KTMFWNREYD AMPLERCVRV FPHLDDTGGF FITALKKTAE LPPEMEQTPQ MDANKTYRME
RANEQWNEKK RVAPVMKVED RSIVKSINKH YGVQDALDLD DALMTRQHSD LPGVTPKRLY
YLSDGARKVL TARGKDGKNA GLQVVACGVR AFERQIVDGV ECAYRITQEG LDTALPCLKK
QIVRVRASEL EIILARQQDE NVGANSSSRS SSDDVPEEIT NAKSIEHLKK VSDGCVILVP
KARDDDTETE AKALAVAAWL GRGKKGKSIS VLASKASGGQ LLYQLRDCMS RGTVV