Gene OSTLU_38572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38572 
Symbol 
ID5002107 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp503374 
End bp504777 
Gene Length1404 bp 
Protein Length410 aa 
Translation table 
GC content57% 
IMG OID640417528 
Productpredicted protein 
Protein accessionXP_001417794 
Protein GI145346642 
COG category[R] General function prediction only 
COG ID[COG2520] Predicted methyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00890076 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCGCGA CCGCGCTGAA CGGACCGTTA TCGCCGCCTC CGGCGGTGCA CGTCGATCGC 
GAGCAGTTCA AAACATCTCT GCGCGTGCGC GCGATTCGAG TGCCCACGCG CGAGACGGAT
GCGGTGCTGA AAGCGCTGCG AGGATTCGCG CTGGACTTGC CCAGGGTGAA GGCGGTGACG
CGAGACGCCG AGGGCGAACG CGACGGGAAT TTAATCCTCC TCAACGATAA AGTCGTCGAC
GACGATCTTC GCGCGGTGGT ACCGGAAGAA AGGTTAAGCG TCGTGCGCGA GCGCGTCGGA
GGTGAGATCG AGGTGACGGA GTACGACGTG CCGTTGACGT ACGAATATTT CAACGCCGCG
CAAGTGCTGA GGAAATTGTT GCCTTCGGCG GTGGAGGTGC CGAGCTCGTT CGAGACGGTC
GGGCACATCG CGCACATGAA TTTGAGGGAT GAGCACGAAT CGCACAAGTA TTTGATAGGG
AAGGTGATTT TGGAGAAGAA CGAGCGGTTG CGGACGGTGG TGAATAAAGT CGGGAGCATC
GAGAGCGAGT TTCGCGTGCC GGAGTGGGAG TTGCTGGCGG GCGAACCGAG TCTCGTGACG
GAAGTGAAGC AGCACGGGAT GACGTTTAAG CTAGATTTCG GAAGCGTGTA TTGGAATTCA
AGGTTAGAGA CGGAGCATAA ACGGTTGGTG GACTCGTTCA AGGCGAATGA AGTCATTTGC
GACGCGACGA GCGGCGTCGG GCCGTTTTCG GTACCGGCGG CGCAAAAGGG TATACGTTGC
TACGCCAGCG ACTTGAATCC GGATTGCGCC AAATATTTGA AAATCAACGC CAAAGAGAAT
CGAGTGAAGA ATCTCGTCAA GTGCTACAAC ATGGATGCGC GCGCGTTCAT CAAGGCACTT
TTAGCGGCGC CAGAGAACCG TGACGTCGAC GTCGAGCGAG AGTGGACGGC CACCAAAGCG
ACGTACGACG CGGAACTCGC CGATTATAAC GCCAAGAAGC GAGACGCGAA GGCCAAAAAG
ATTAATTATC GCGAGGCGAG GCCCAAGCTC ACGTGGGCGG CGGCTGATGA CGACGGCGCG
CCCCCCGCAG GGGCGACGTT CGATCATTTG GTTACGAATT TGCCGGCGTC TGGGATAGAA
TTTTTAGATT GTCTGAGGGG TTCGTTTGAT CGAAAGGTTT GGGAGCACAG AGAGTTACCT
ATGATACACT GTTACACGTT CAAAGGGGCG GACGAGACCG ACGCGGACGT CATTAAACGT
GGTGCGGGCC ATCTCGGCGC GGAAATCGTC GACGCCGCGG TGAGCGAAGT TCGCGACGTG
TCTCCGAATA AGCTCATGGT TTTGTTGTCT TTCCGAATCA GCGCGGAAGC AGCGTTCTGT
ACAAAAAGAC AGTGCACGAC GTGA
 
Protein sequence
MVATALNGPL SPPPAVHVDR EQFKTSLRVR AIRVPTRETD AVLKALRGFA LDLPRVKAVT 
RDAEGERDGN LILLNDKVVD DDLRAVVPEE RLSVVRERVG GEIEVTEYDV PLTYEYFNAA
QVLRKLLPSA VEVPSSFETV GHIAHMNLRD EHESHKYLIG KVILEKNERL RTVVNKVGSI
ESEFRVPEWE LLAGEPSLVT EVKQHGMTFK LDFGSVYWNS RLETEHKRLV DSFKANEVIC
DATSGVGPFS VPAAQKGIRC YASDLNPDCA KYLKINAKEN RVKNLVKCYN MDARAFIKAL
LAAPENRATF DHLVTNLPAS GIEFLDCLRG SFDRKVWEHR ELPMIHCYTF KGADETDADV
IKRGAGHLGA EIVDAAVSEV RDVSPNKLMV LLSFRISAEA AFCTKRQCTT