Gene OSTLU_119526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119526 
SymbolArbK 
ID5000257 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp487033 
End bp488863 
Gene Length1831 bp 
Protein Length477 aa 
Translation table 
GC content43% 
IMG OID640415678 
Productarmadillo/beta-catenin repeat protein 
Protein accessionXP_001416407 
Protein GI145343604 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.225141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCTCT CACAGGTCGG TCATGAGCCT GTTTTTGGGT ATTTTGTTCA ACCGACCTAC 
AGGAAGCATA CGATGAAGCT GTTACTTCGT ACGTAGAAAA CTTCGGAATG GATGCGACTA
CAGCCCAGAT TGAAGCTAGA GAGGAGTTTG CCTTGCAGGG CTTCGATATT TCAAGCATTG
CGACAAGCTT ATCACAAGTC AACTGCAATA CTTTCACAGC AGTTGTGCAA ACTTTTCTGA
CTTCGATCTG TCCTCTTGCT GAGATAGAAG ATGAAACCTG GGGAAATAGT GAAGAAGACA
AGCTCACGTC TGCGATCGTC TCATTCGAAT CATATTTGTC AGATCAGTCT GTTTTCCGTC
CAAACATCGT CGACTTCGCT TGCACTGCTG GTCAAAGAGA AATCATAGAA ATTCTTCTAA
AATTTTGCTG CAAAACAACT TTGGCTCAAA CACTACGCGA CCGCGCTCTA AATTGTTTAA
CAGGTGCGTG TAAAAGTTTT TTTCGTATTT TTTGTCTCAT TCGTACCTAC ACAGCGATGA
TGATCATACA CGAGAACCGA CGTCTAGTCT TGAAAAGTGC CGCATTCCTC ACTTTACTGG
AATGCCGAAC GTTGTCGACT TCCTTGTGCT CTTTCTTAGC AAGTGTCACG AAGGAGCACG
GTGATGCGAA AATAATTTTG ATGAAGACCG TACGTTACTT TTCGCTCTTC ATTGAACAAG
ACTCAAGAGA TGCCAGGAGC GCACGCTCAC CATCCTCACT GAAGCATTTG GTGCCTCTGA
TGTAGTCTTG ATGCGGGCGA TTTGTGAACT CTTCTCCGCG ATTTTAATCG ACGATGACAG
AGAGAGTTCA ACAAGCAATA AATATAGACA TGCGCTTATA TTACACGAGC ATGGATTGTT
GGCGAAAGTT AATGACGGTA TGTAGTAATT TCCGTCCCTG TGAAATCAGC AATCTGAGGA
ACCGCAGCTA TTCTCCAACA AGTACAGTCG AGTCAGCTTG AGACATCTGT GGTCTTATTC
AAGCTAGTAA GTAAAGCTTC CCTATGCAAG AATCTTGTTT CACAGCATAT GCAGTTTTCT
CACCTGTTGA CAAGCGAGAA GATATGTAGG AGCGTGCCAG AAGATGTTTT AAGTAGTTTA
CTGGTATGTG AATTTGATGT TGTGTCGATG TCAAATTTTG TCATGTCCAC ACAGGCAATA
ATATCTCGTC GAAGGGACGC AGTATTACTG AACCATTGCT GCAGATGCGT TCGGCGCCTT
ATATTGTCCG ACTCGCGCAA AGAAATCCTG CTAAAGTTCG ACGTTATCAA AACTTTTGTC
TCCTTAATGA CGTGCGAAGG TGAGCGGTTT ACTACTCTAT CGCATTGTAT CTGAGTGTAA
ATTACCCCAG TAAGTTCAGT GCGTGAGAAC TCGATCAGTA TTTTGGCGGC TTTGGCGCTT
CGGAATGTAG AAGTTTCTAA TGAATTGAGA GCAAATGGTG GGGTCGATCA GCTGTTAGAG
TTGATGCCAC AGGAAACTTG TGGTAAAGTT CTTCGTCAAT GCTGTATTCT CATCAGGAAT
ATTGCGGTTA GAAATCAGGA TACCAGGGTA TGAACATCAT TGTTTTACCT TCTTATTTTT
TAAAAGTAGC ATATACTAGG ATTATCTCGT GGCCAATAAT GTGACAGCTC AACTCCACGA
CCTGAAGGCT TTGCATCCGA GGTCATGTAT TGATGTTGGA AGTGCGGCAC TGCGCGATTT
AGGTTGCGAT GACTACGGTC GTGGCTGGGT GCCAAGAACA TTGGTCATGG GCACAGATGG
ATCGATTATA AATTCTAGTG ACTTGAACTA G
 
Protein sequence
MRLSQEAYDE AVTSYVENFG MDATTAQIEA REEFALQGFD ISSIATSLSQ VNCNTFTAVV 
QTFLTSICPL AEIEDETWGN SEEDKLTSAI VSFESYLSDQ SVFRPNIVDF ACTAGQREII
EILLKFCCKT TLAQTLRDRA LNCLTAMMII HENRRLVLKS AAFLTLLECR TLSTSLCSFL
ASVTKEHGDA KIILMKTERT LTILTEAFGA SDVVLMRAIC ELFSAILIDD DRESSTSNKY
RHALILHEHG LLAKVNDAIL QQVQSSQLET SVVLFKLFSH LLTSEKICRS VPEDVLSSLL
AIISRRRDAV LLNHCCRCVR RLILSDSRKE ILLKFDVIKT FVSLMTCEVS SVRENSISIL
AALALRNVEV SNELRANGGV DQLLELMPQE TCGKVLRQCC ILIRNIAVRN QDTRDYLVAN
NVTAQLHDLK ALHPRSCIDV GSAALRDLGC DDYGRGWVPR TLVMGTDGSI INSSDLN