Gene OSTLU_39896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39896 
SymbolARP3501 
ID4999699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp377282 
End bp378532 
Gene Length1251 bp 
Protein Length416 aa 
Translation table 
GC content59% 
IMG OID640415120 
Productpredicted protein 
Protein accessionXP_001415475 
Protein GI145340736 
COG category[Z] Cytoskeleton 
COG ID[COG5277] Actin and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.213761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAGCG ACGCGCACAC GCGCCCGGCC GTGGTGATCG ACAACGGCAC CGGGTACACG 
AAGATGGGAT TCGCCAAAAA TGTCAATCCG ACGCACGTGA TCCCGACCTG CGTCGCCGAG
AACGCGCCCG CGAGCGCGTC GAAGCGGCGG GGCGCGATGG ATGACTTAGA CTTCGCGATC
GGTGACGAAG CGATGGCGCT GAGTGGGTCG AGAGACGTGC GATGGCCGAT CAGACACGGA
CAAGTGGAGA ATTGGGAACA CATGGAAAAG TTTTGGGAGG CGTCGATTTG TCGATACCTG
AGGTGCGATC CCGAGGATCA CTACTTTTTG TTGACCGAAC CGCCGTTGAA TCCGCCGGAG
AATCGAGAGT ACACGGCGGA GATCATGTTT GAATCGTTCA ACGCGCCGGG GATGTACATC
GGCGTGCAGG CGGTGTTGGC GCTCGCGGCG AGCATAGCGA GCAAGAAGCA GAGTCAGTAC
GCGTCGGCGT TGACGGGGAC GGTGATTGAT ATCGGGGATG GGGTGACGCA CGTGATACCG
GTGAGTGATG GGTACGTGTT GGGGAGCTCG ATCAAGAGCG TGCCGTTGGC GGGGAGAGAT
TTGACGACGT TTGTACAATA TCTGATGCGA GAGCGCGGCG AACGCGTGCC GCCGGAGGAC
GCGATGGAGG TGGCGAGAAA GGTTAAGGAG GATTACTGCT ACGTGTGCAA AGATGTTGTG
AAAGAGTTCT TGCAGCACGA GCGCATGCCG GGCGAGTACG TGGTGCAAAT ACACGGCGTG
CGGGGGAAAA CCGGCGACAC GTGGACGGCG GATGTCGGTT ACGAACGATT TCTCGCCCCA
GAGGTCTTCT TCGAGCCCGA GATATACTCG TCGGACTACA TCACCCCGTT ACCAGAGCTA
GTGCACCAGG CGATTGCGTC GAGCCCGATC GATACCAGAC GTAATCTGTA CGGTAACATT
GTGCTCTCGG GCGGAAGCAC GATGTTTAAA GGATTCGGCA AGCGCATCAA ACGCGACGTC
AAAAGGCTCG TGGACGGGCG AATAGCGGCG ACGACGAAAG GCGCCACGTT CGAGTCGAAA
GAAGTCGAAG TCGAGGTCGT GACGCACAAC TTCCAGCGCA CCGCAGTTTG GTTCGGTGGA
AGCGTACTCG CGTCCACGCC CGGTTTTTAC TCGAGCTGCG TCACCAAAGC CGAGTACGAG
GAAAAGGGCG CGAGCGTCGT TCGGCAGAAT CCCGTGTTTC GAGGTATCTA A
 
Protein sequence
MSSDAHTRPA VVIDNGTGYT KMGFAKNVNP THVIPTCVAE NAPASASKRR GAMDDLDFAI 
GDEAMALSGS RDVRWPIRHG QVENWEHMEK FWEASICRYL RCDPEDHYFL LTEPPLNPPE
NREYTAEIMF ESFNAPGMYI GVQAVLALAA SIASKKQSQY ASALTGTVID IGDGVTHVIP
VSDGYVLGSS IKSVPLAGRD LTTFVQYLMR ERGERVPPED AMEVARKVKE DYCYVCKDVV
KEFLQHERMP GEYVVQIHGV RGKTGDTWTA DVGYERFLAP EVFFEPEIYS SDYITPLPEL
VHQAIASSPI DTRRNLYGNI VLSGGSTMFK GFGKRIKRDV KRLVDGRIAA TTKGATFESK
EVEVEVVTHN FQRTAVWFGG SVLASTPGFY SSCVTKAEYE EKGASVVRQN PVFRGI