Gene OSTLU_24370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24370 
Symbol 
ID5001214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp219574 
End bp221846 
Gene Length2273 bp 
Protein Length732 aa 
Translation table 
GC content59% 
IMG OID640416635 
Productpredicted protein 
Protein accessionXP_001417182 
Protein GI145345361 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.29712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCGCGCGGC GCGCGGGCGA GCGTTGACGA CGACGATCGC AACGACGACG ATGACGACCG 
TGGACGCATC GATGCCCGCC GCGATGCCGA GCGGCGCCGC GACGACCACG GAAACGATCG
ACGTTCCGGA CGTTGGCGCG GTCGACGACG ACGCGCGACG ACGCGCGACG ACGCGCGACG
AGAGCGCGAG CGAGGACGAA GATGAGGAAT ATCTCGGACG GATCCGTCGA CGCGCGCGCA
CGCGCGAGGA TGAGAGCGAG AGCGAGGAGG ACACGCGAGA GGGCGACGGG TCAGAGGACG
GCGAAGGCGA GGCGCCGCGA GGGAAAGGCG AAGGACGACG ACGACGATTG AAACGCGCGA
ATGGTGAACG CGTCGAACGA GACGTTGATG GAAGTGACTC GAGCGATTCG AGCGAGAGCG
AATTGGATGA GGAGATGATG ATGGAACGTA AGTACGCCGA GACGGGGCGG TTGACGGATG
AAGACGTCGA GGACGAGGCA GAGGAGACCG TTCGTGGAGC GTATGACGAA GTGGGTGTGG
GATATTCGTC CGATTCGTCG CTTGGATCAC CCGTAGCGGG CGATGATGTG CCGGCGAGCG
GTAGTCGGCC GAGGAAAATG ACGAAGAAGC AGATGAAGAA GGAACGAGAC GCGTTGGCCA
AGGAACAGGA ACGCATGATG AAACGAGCGC AAAGACGGGC GAAATTCCCG GGCTGGGACG
CCGAAGTGGT GCGCGTGTCT TATTTACCGT TGATTGACAA GCTTCGCGCT GCGGTGGCGC
ACATCAAGCA CGACGGACTA GTTATGGGAG AAGACTCGCC GGCAAAAGAG GCCGACAAAG
CCACAAAGGC GGACACGCCC ACCGTCGCCC CGCTTGCAGA CTCCGAAGAT GACGAAGCGG
ACGACGGTGA ACCTAAAGCA AAGACAGCCG AAGTCGTAGA AATCGATCTC GATGACGACG
AAGAAGAAGA TGACGATGCA TTACTCAAGG AAATCTTGGC AAAGAAAGCA GTCGCGGTGG
CGCAAAAGCC ATCCGAAACG GTGATGACTG AGGAACCAAC GCTTGTCGAA CAAGAGGACG
GAGAGGACGC GGATGACGAG TCCGAAGAAG ACAGCGAAGA TGATTTATCC GAGGAGGAAG
ATATGACGGA AGAAGAGCGT CGAATGCAGC GTAAGGCGGC GAAGAGATTC ATCAAGGCGG
ATAGAAGATC GCACAGAGCC GCCGCCACCA CGGGCGACGT CTTTGAAGAT GAAGCGGAGA
TGTCCGAAGA TGGTGGGCAC ACCGACGACG ATGATGATGA TGATATTCAG GATGACGTTG
ACGATGTCGC CGACGCTATC GATTTCCGCG AAGAACAGCC CGAAGACGAG CGCCGTGCCG
CGGCTCGTGC GCGCGCCTTC GCAAAGGAGC AGCAAGCCCA AGACGACGAC GAGCTCGAAA
AGATGAAGCA GATGGTTGGT AACGGTTTCA AACGCAAAAA GAATGGTCTG TTCGACTCTG
AAGACGCGTG GCAGCGTAAG AGACGCAATG CTAACGGCGA AGAGGAATCT GATTCGGATG
ACGTCGACTA TGGTCCCGTC ATCGAGCGTC CCGAAGAGGC CGTCGAATTA TCGGACGACG
ACGACGGTGA GTGGCGCGAA CAAGCCAAGC GTCGTCGTGC CCTGCACGAA TCCGGTACAC
AAGAGTCTCT TGAGCTACCA AATGCGTTCG AAGGTAACGT CAGTCAAGAA GTCTACGCCG
CCATCAAGGC TCCTCGCATG AATTCGTTCC ACAGCGAGTC GCAAGACACC ACGCAAGCGG
AAGGATGGGA AGCCCCGCTC CCTCGAGCGC AATCTATGCC CGCGGCTTTG GTCCGTACGT
CGAGCCATCT CGGCAGCGGA AGTTCTGCAA TGCTTGCCCG TCAATCATCG AAAACGTTCT
TGGGTAAGAA ACGACAAGTC ACCAAGGCGA CTGGCGCGCT GCTCGGCAAC AGCCAGGCTT
CACGCTCATA CGTCTTCGGC CGAACGGATA GCCAAAGTCA ATGGGGAGGC GACGATTCGG
GTCCCGCGAC CACGTTCAAG GAAATCGGTC GCGATGAAGA TGCGCGCGCT TTCGGTTCGA
CGAACATGGG ACCAACGAGA CCGAACGCGT CCGAACCGAA GAAGAAGCCA TCCTTATTCG
CGATGGTGAG CCAGACCGCG GGCGAGAACA ACGCCCGCCC TCGAGCGGAG GACGTTCAAA
AAGCGATGAA GGCAGCGTCG GGGAAATGAT TTATGTAACG AAGATAAGCG CAC
 
Protein sequence
MTTVDASMPA AMPSGAATTT ETIDVPDVGA VDDDARRRAT TRDESASEDE DEEYLGRIRR 
RARTREDESE SEEDTREGDG SEDGEGEAPR GKGEGRRRRL KRANGERVER DVDGSDSSDS
SESELDEEMM MERKYAETGR LTDEDVEDEA EETVRGAYDE VGVGYSSDSS LGSPVAGDDV
PASGSRPRKM TKKQMKKERD ALAKEQERMM KRAQRRAKFP GWDAEVVRVS YLPLIDKLRA
AVAHIKHDGL VMGEDSPAKE ADKATKADTP TVAPLADSED DEADDGEPKA KTAEVVEIDL
DDDEEEDDDA LLKEILAKKA VAVAQKPSET VMTEEPTLVE QEDGEDADDE SEEDSEDDLS
EEEDMTEEER RMQRKAAKRF IKADRRSHRA AATTGDVFED EAEMSEDGGH TDDDDDDDIQ
DDVDDVADAI DFREEQPEDE RRAAARARAF AKEQQAQDDD ELEKMKQMVG NGFKRKKNGL
FDSEDAWQRK RRNANGEEES DSDDVDYGPV IERPEEAVEL SDDDDGEWRE QAKRRRALHE
SGTQESLELP NAFEGNVSQE VYAAIKAPRM NSFHSESQDT TQAEGWEAPL PRAQSMPAAL
VRTSSHLGSG SSAMLARQSS KTFLGKKRQV TKATGALLGN SQASRSYVFG RTDSQSQWGG
DDSGPATTFK EIGRDEDARA FGSTNMGPTR PNASEPKKKP SLFAMVSQTA GENNARPRAE
DVQKAMKAAS GK