Gene OSTLU_41017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41017 
SymbolARP3502 
ID5002492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp407678 
End bp408754 
Gene Length1077 bp 
Protein Length358 aa 
Translation table 
GC content64% 
IMG OID640417913 
Productpredicted protein 
Protein accessionXP_001418472 
Protein GI145348055 
COG category[Z] Cytoskeleton 
COG ID[COG5277] Actin and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000637788 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.733987 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCG ACGCGCCGGT GACGGTGCTG GACGTCGTGG ACGGCGATCT GCGGGGCGGG 
TACGCGCTGG ACGGCGTCGC GCGCGCGCCG AGCGCGACGC GACGCGGTCG AGTGCGCGCG
AAAAGCGCTG GACGCGGGAA CGGAAGCGAC GACGGCGCGT TCGACGCGAT CGCGCGGTCG
CGCGTCGAAG ACGTGGACGC GTACGAGTGC GTCGTGCGCG CGATGACGTA CGGCGACCTG
GGATGGGAGC GAGGGAGCGA GGGGATGGTG GTGGCGTGCG AGGCGAGCGG GACGTCGAAC
AGGACGCGCG AGCGAACGGC GAGGATGTTC TTTGAAGAGT TCAACGTCGG AGGGTTGGCG
TTTTTAGATA AGGCGGTGTG CGCGATGTAT GCGTGTGGAC GCGCGAGCGG AGTGGCGATC
GATGTGGGAG AGCAAGGGGT GGAGTGCGCG TGCGTGGTGG AGGGGGCGAC GGCGCACTCG
ACGACGAGGC GGAACGACGA CGCGGGAGGA CGAGCGATGG ATCGCGCGCT GGTGCAGGCG
GTGAGGAAGA AACAAGGGAT TGCGTTAGAT TTGACGACGG CGAGTGATAT TCGTCGGAAG
TTGGGGAAAT GTGCGGCGAC GCGGGAGGAG TACGAGGCGT TGGCGCGAGG GTGCGCCACC
GTGGAGTGCG AGCAAGAGAC ATTCGCCATG CCGGATGGAA GCGCGCTAAA GCTGACGAAC
GAATTGTACG AGTGTGGAGA GGCGGTGATG CCGATCGTGG ACGACGTGTG CGAGTGCGTG
CAAAAGTGCT CGTCAGAATT GAGACGGTTC GTGTTGGACA GCGTCTTCGT GCACGGCGTG
GCCAGCAAAG TCTCTGGGCT TGATGCTCGC TTGTTTCACG AGCTCACGTC GAGTTTGCCG
CCCTCGTTGA CGCCAACGAT GGTAAACATT CCGGAGTACA TGCCAGAAAC CACGTGGTCG
CACGCGCCTT GGACGGGCGC CGCGATGGCG GCGAAAACCA TCTTCGCTTC GAACCAGTAC
ATTTCGAAGA GCGATTATAC CGATAACGGA CCACCGATCG CGCATCGCGG GCGTTAG
 
Protein sequence
MASDAPVTVL DVVDGDLRGG YALDGVARAP SATRRGRVRA KSAGRGNGSD DGAFDAIARS 
RVEDVDAYEC VVRAMTYGDL GWERGSEGMV VACEASGTSN RTRERTARMF FEEFNVGGLA
FLDKAVCAMY ACGRASGVAI DVGEQGVECA CVVEGATAHS TTRRNDDAGG RAMDRALVQA
VRKKQGIALD LTTASDIRRK LGKCAATREE YEALARGCAT VECEQETFAM PDGSALKLTN
ELYECGEAVM PIVDDVCECV QKCSSELRRF VLDSVFVHGV ASKVSGLDAR LFHELTSSLP
PSLTPTMVNI PEYMPETTWS HAPWTGAAMA AKTIFASNQY ISKSDYTDNG PPIAHRGR