Gene OSTLU_40957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40957 
Symbol 
ID5002213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp565769 
End bp566845 
Gene Length1077 bp 
Protein Length358 aa 
Translation table 
GC content56% 
IMG OID640417634 
Productpredicted protein 
Protein accessionXP_001418524 
Protein GI145348161 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0367798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.330846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC GAGAGCTCGT CGCCGGCGGT GAAGGCGAAA CTGGTGAAGA GTCGACGATC 
GCTTTACAGC TCGAACACAC TTCTCTAGGT GGAGCAGGTG AAGTTGTGCA CGAGCTCACG
CCGGAAGACG TCGAACTCGC GCGCATCGAG GCTGAGCAAA GTCTTACGCA ATGGCGTGAA
AAGTCAGCCA AAGAGCGGGG CAACGAGGCC AACGCGCAAA ACTTGTGGCG GAAGTTGGAG
CAGCTCACCA CCGCGCTTTC GGCTGAACTT GCAGAGAAAC TTCGACTGAT TCTCGAGCCA
ACGCTTGCAA GTCGACTTCA AGGGGATTAC AAAACCGGCA AGCGACTAAA CATGCGGAAG
ATTATTCCTT ACATTGCCAG CGATTTCAGA AAGGACAAAA TTTGGCTCAG ACGTTCGAGA
CCTTCGGCTC GCAAGTATCA AGTGATGCTT GCCATCGATG ACTCGCGATC TATGGCAGAA
AACCACTGCG GCCACATCGC TCTTGAGTCA ATGGTTCTCC TGGCGCGCGC CATGGCTCGT
TTAGAGGTCG GAGAGATCGG TGTCGTAGGC TTCGGCGCCG GGGCCAACGC GGTGCGCACG
CTGCATCCGC TCGGCGCGCC GTTTACCGAC CAAGAGGGCC CATCGCTCGT GAGCAAGCTG
TCGTTCGCGG AAGACAACAC CTTGGCCGAT CGACCAATGG TTCAACTCCT CGAAACCATG
GCTGCGCGAC TGAGCGTCGC GCGAGAACAA ATTCGCGGCG GCGGCACGAA GCTCCAGCAG
CTCGTGCTCA TCATCGCCGA CGGTCGATTT CACGAAAAGG AAGCCTTGCG AAGGTGCATG
CGAGAAGTTG GTGCGCAGCG CGGGTTACTC GTCGCGTTCA TCGTGCTCGA TAATCCCCAA
AACAGTCTAC TGAACATGCA AAGTGTGTCA TTCACGGGAG GCAAACCGGT GATGAAAAAG
TATTTGGACT CGTTTCCGTT TCCATTCTAC GTCCTCGTGC AAGACGTGTC GCAGCTCCCG
GCGACGATAA GCGATTTGTT GCGACAGTGG TTCGAAATGG CGACGAGTCA CGATTAA
 
Protein sequence
MTERELVAGG EGETGEESTI ALQLEHTSLG GAGEVVHELT PEDVELARIE AEQSLTQWRE 
KSAKERGNEA NAQNLWRKLE QLTTALSAEL AEKLRLILEP TLASRLQGDY KTGKRLNMRK
IIPYIASDFR KDKIWLRRSR PSARKYQVML AIDDSRSMAE NHCGHIALES MVLLARAMAR
LEVGEIGVVG FGAGANAVRT LHPLGAPFTD QEGPSLVSKL SFAEDNTLAD RPMVQLLETM
AARLSVAREQ IRGGGTKLQQ LVLIIADGRF HEKEALRRCM REVGAQRGLL VAFIVLDNPQ
NSLLNMQSVS FTGGKPVMKK YLDSFPFPFY VLVQDVSQLP ATISDLLRQW FEMATSHD