Gene OSTLU_33119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33119 
Symbol 
ID5003215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp420582 
End bp422195 
Gene Length1614 bp 
Protein Length537 aa 
Translation table 
GC content57% 
IMG OID640418636 
Productpredicted protein 
Protein accessionXP_001419157 
Protein GI145349473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.375322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.152711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTGG CGGCGACGAT GGGGGGGGAG GGAGGGGGGG GGACGGAGCT GCGGGCGTTC 
GTGGCGGCGT GCGCGAACGC GCTGGGGCGA AGGGACGAGG ACGTGGGGGC GAGGATCGCG
GCGACGCTGG AGGCGAATTG GTTCACGACG CCGATGGATT TGGCGAGATT GAGCGTGGAG
GAGGCGCGAG CGATGGCGGT GCCGATGAAA CTGGTGGATG AGTTTAAGCG CGCGCTGGAC
GGCGCGCGCG GGTTCGCGGC GAGCGGCGCG AGCGCGACGA CGACGACGAC GGAGAAAAAC
GACGAGGGAG AAACGGCGCT CGATTTAGGA CTCGAGCCGT TGGCGTCGGA GACGGCGGCG
ACTACGATGA AGACGATGGT GTCGTCGAGC GGACGCATGG CGTTGAAGAT GCGCACCGGT
TTATCCGCCG AGTCGACGCG AGTGACGAGT CGACCTGTCG GATTACCGAA TTATCGCATA
CCGTTGGAGG AATGCGGAAG CGATTTGAAG AAACAGTTCA AGGCTTTGCG CAAATTCCTT
ACCGTCCGTA GGCTTGGGCC TCAAGAATGC ATTCTCGCTG GGGTGACGGC GGAAAAGTAC
GAAGACGTTT TGCGAGGCGC CTTGGGATGG TTGTGCGCGG AGAAAAACAT GAAACCTTCG
AAAGTCACGC TCCTCGACTT ATTCCCGAGT ATTGACGCGG AGAGTGCCAA TGGAGCTTTT
GAATACGTCA CTTGGCTGAA CGATGAGCGA CAGACGTCGG CAAATTACGA GCTTCTCGTC
ACTCGCTCGT GCATCGCGGC GGTGAAATTC TTGTACGGCA ATTTGAGCAA GGCGCAACCG
GGCGAGGGCG AAGCGAAACC GTACCATGAC TTGCCGGTGA TGAAAGAATT ACGTCGTATG
GCGAAAGACG CTAAAGCTCG ATCAGCCAAG GCGCCTAGCG TGAGCGATGA GCGTCTGAAA
TGGCTCGAAT GGGATGAGTA CTTGACGCTC GTGCAACGAT TGAAGAGCGA GTGCACGGCG
AAGAATTGTC TGGGACAATC TCGCTCGGCG AGCGCCGTGG CGTGGAGTGT GCAGAAGTAT
TTAATTTTTG GCGTTCTGTC GTGTGTGCCC GATCGTCAGC GAACTCTACG CGAGCTACGA
GTCGGGAAGA CGCTGTTCAA AGAGGGCGAT AAGTGGGTCA TTCGCCACAA AGAAACCGAT
TATAAAACGG GCAAGGACTA CGGGGTGCGA CCACCGCTCG TAATCGCTCC GCATCTATAT
CCGACGCTCG AGACATTCAT CGAAACGCAC CGCAAAGAGC TCAATCCGAA CCACGATTTC
CTTTTCACTC GCAAGAATGG TGAGCAGTTT GACGGACAGG GGATTTATCG GCTCTTTACG
ACGACGGCGA TGCGTCTCAC CGGCAAGCGG ACGAACCCGC ACTTGATTCG AGACATGGTG
GTGACGCACT TACGAGGGAC GGACGCGTCC GAGCGACAGT TAGAGGCGCT AGCAATCTAT
ATGGGTCACT CACTTCAGAT GCAAAAGTCG ACGTACGACA GACGATCTGT CGAGCAAAAA
GTTGCGCCGG CGGTGGATCT GTTAGATTCG CTCAACGCAA AGATGAGACT CTAG
 
Protein sequence
MAVAATMGGE GGGGTELRAF VAACANALGR RDEDVGARIA ATLEANWFTT PMDLARLSVE 
EARAMAVPMK LVDEFKRALD GARGFAASGA SATTTTTEKN DEGETALDLG LEPLASETAA
TTMKTMVSSS GRMALKMRTG LSAESTRVTS RPVGLPNYRI PLEECGSDLK KQFKALRKFL
TVRRLGPQEC ILAGVTAEKY EDVLRGALGW LCAEKNMKPS KVTLLDLFPS IDAESANGAF
EYVTWLNDER QTSANYELLV TRSCIAAVKF LYGNLSKAQP GEGEAKPYHD LPVMKELRRM
AKDAKARSAK APSVSDERLK WLEWDEYLTL VQRLKSECTA KNCLGQSRSA SAVAWSVQKY
LIFGVLSCVP DRQRTLRELR VGKTLFKEGD KWVIRHKETD YKTGKDYGVR PPLVIAPHLY
PTLETFIETH RKELNPNHDF LFTRKNGEQF DGQGIYRLFT TTAMRLTGKR TNPHLIRDMV
VTHLRGTDAS ERQLEALAIY MGHSLQMQKS TYDRRSVEQK VAPAVDLLDS LNAKMRL