Gene OSTLU_29609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29609 
SymbolARP3504 
ID5006676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp521006 
End bp522346 
Gene Length1341 bp 
Protein Length407 aa 
Translation table 
GC content59% 
IMG OID640422097 
Productpredicted protein 
Protein accessionXP_001422774 
Protein GI145357127 
COG category[Z] Cytoskeleton 
COG ID[COG5277] Actin and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00127301 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CATCGCGCCG CTCCGCGCGA GTGACGGCCC GAAGCGCGCC CCGCGCGACG CGACGGCGCC 
GGCGAACGAA CTCCAATTCC GAATCCAAAG AATCGCACCG GCCATCGATT CATCGACATG
ACGGACGCGA GCGGCGCGCT CCCGGCGCGG TTCGCGACGT CGGTGGGACT GACGAACCGT
GAGAACGTCG TGGTGTGCGA TACGGGCACG GGATTCGTCA AGGCTGGGTA CGCGGGGGAT
GAAGAACCGC GAACGCTGTT CCCGTGCATG GTGGGACGAC CGACGCTGCG GTACGAGGAA
GACGCGTTCG ACGATGAAGC GATGAAAGAC GTGTACGTCG GGGACGAGGC GGCGCGAAAA
CGCGCGAATT TAGAAATTTC GTACCCGGTG TCGAACGGGG TGGTGCGAGA CTGGGAAGAT
ATGGGATTGG TGTGGGACAG GGCGTTTGAG AGTTTGGGAT GCGATACGCG CGAGTGCAAG
GTGATGCTCA CGGATCCGCC GTTGAACCCG AAATCGAATC GCGAGCGCAT GATGTCGACG
ATGTTTGAGA CGTATGGGTT TCGAGGGGCG TACGTCCAAG TGCAGGCGGT GTTGACGCTG
TACGCGCAAG GATTGATGAC GGGAGTCGTC GTGGACTCGG GCGACGGCGT CACGCACGTC
GTGCCGGTGG TGGATGGATA TTCGTTCCCA CATCTCACCA AGCGATTGAA CGTGGCGGGA
AGGCACGTGA CGACGCGAAT GATTGATTTG TTAACGCGTC GAGGGTACCC GCTGAATCGA
ACGTCGGACG TGGAAACGGC GCGTTTGATC AAGGAGGAGT TGTGTTACGT CGCGTACGAC
TACAAGCGCG ATTTGCAGTT GGCGCGAGAG ACGACGGCGA CGAACGCGTC GTACACGCTG
CCGGACGGGC GAGTCATCAA ATTCGGCCCG GAACGGTTCA TGGGTCCCGA GTGTTTGTTC
CAGCCCGATC TCATCGACGT CGAAAGCGAC GGGATCTCCG ACCTCGTCTT CAAGTGCATC
CAAGAAAACG AAATCGACAA TCGACGGTCG CTGTATCAAC ACATCGTCTT GTCCGGCGGG
AACTCCATGT ACGCCGGCTT ACCCTCGCGA CTCGAGCGCG ACATCAAGCG CCTTTACCTG
AAAAACGTCT TGAACGGCGA CAAAGAGGCG ATGAAAAAGT TCAAGATGAA AGTCGAGGCG
CCGGCGCACC GCAAGCACAT GGTCTTCGTC GGCGGCGCCG TCTTAGCGGA CATCATGCGA
AGCAAGGACG AGTTCTGGAT CTCCAAGCAA GAGTACGAGG AGCAAGGCAT CGAACGAGCG
TTGAAAAAAT GCGGCATGTG A
 
Protein sequence
MTDASGALPA RFATSVGLTN RENVVVCDTG TGFVKAGYAG DEEPRTLFPC MVGRPTLRYE 
EDAFDDEAMK DVYVGDEAAR KRANLEISYP VSNGVVRDWE DMGLVWDRAF ESLGCDTREC
KVMLTDPPLN PKSNRERMMS TMFETYGFRG AYVQVQAVLT LYAQGLMTGV VVDSGDGVTH
VVPVVDGYSF PHLTKRLNVA GRHVTTRMID LLTRRGYPLN RTSDVETARL IKEELCYVAY
DYKRDLQLAR ETTATNASYT LPDGRVIKFG PERFMGPECL FQPDLIDVES DGISDLVFKC
IQENEIDNRR SLYQHIVLSG GNSMYAGLPS RLERDIKRLY LKNVLNGDKE AMKKFKMKVE
APAHRKHMVF VGGAVLADIM RSKDEFWISK QEYEEQGIER ALKKCGM