Gene OSTLU_37931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37931 
Symbol 
ID5004207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp73488 
End bp74948 
Gene Length1461 bp 
Protein Length486 aa 
Translation table 
GC content55% 
IMG OID640419628 
Productpredicted protein 
Protein accessionXP_001419893 
Protein GI145351036 
COG category[L] Replication, recombination and repair 
COG ID[COG1112] Superfamily I DNA and RNA helicases and helicase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.310178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGA GGACTCGTGG AAACGTTCTC GCGTTGTTTA CCCAAACAGA ACGCGCCGCT 
ATCATTCGTC GACGCATCAT CGATTTGGAG CGACCCAAGT TCGCTCCAAT TGGTGACCAT
AAGAGGAAAA TCGAAGTCGC ACTCAAAGAT CTCGGCTTCC CTTTGAACGA CGAGCAGCTT
TTGGCAGTCG AAAAGATCGT TACGGCTGAA GATTACGTCC TCGTGCAGGG CTTTCCGGGT
GCCGGCAAAA CAGCCATGCT CGTCGCCGCC GTGAAAGCTC TTCGAGCGCA AGGAAAATCG
GTGTTGATCA CGTCCCACAC GCACAGCGCC ATCGATAACG TCTTCTCTAG GCTTCCAGGA
GTTGGAGTTC ACGAGTTTAT GCGCATCGGC GACGAAGTGA AAGTCGTCGA CGCCGTGCAA
GAATACAGGC TTGGATCAAA GCGATGGCCG TGCTCGAATT CGGATGACTT GCGTAAGGTA
TCAGAGCGCG CTATGGTCGT CGGTGCAACG TGCCACGCCA TGGGTCACGC GTATTTCCAG
CGTAAAATGT TTGACGTTGT TCTCATCGAT GAGTCGGGGC AGATTACGCT TCCGAGCATT
CTCCCTCCGC TCTTTGCGGC AAAGACGTTC GTGCTCGTCG GCGATCACCA TCAGCTCCCG
CCGCTCGTGA AGTCAAAACA GGCAATTGCC GGTGGACTTG GTCGATCTCT TCTTGCCATG
CTGTGTGACG CACATCCAGA CATGGTGACG AAATTGTCTT CGCAGTACCG CATGGCAGAG
CCGCTGACTC GATTACCAAA CATTTTGACG TACGACGGAA AGTTGCGCTG TGGTACGGAG
TCGGTCGCGA AACAGTTACT TTCGCTCGCG CCTCTCAAAG ATGCTTTCTC CTGCGCTCCG
CAGTGGTTAG CGCACGTCAT GAATCCCGCG AATCACACCG TTTTCTTGGA TACGAGCGCG
CTCGGCGCCG CGGCGCGCGA AACGCCTAAG CCGTATATTA ATGAGGCAGA GATGGACTTG
GTGCTGACGA CAGTCAGTGC GCTCACAACT CACGGCGCCA CGAGTGTGTG CGCATTATCA
CCATTTAACG CCCAGGTCGA CGCCATCAAA GCTCGCTTGA ACGGATTCAA AGCGCTCAGC
GGCGACGACG CGCCTCGAGC GCTCACAATT GACAAAGCGC AAGGCCAGGA TATGGACGCG
GTGTGCATCT CCTTCGTCTG TTCGAACGAT GAAGCCAAGG TGAACGTTTT GCTGAATGAC
ACCAGTCGCT TTAACGTTGC GATCACGCGT GCAAAGAAAA AGCTGATCCT CATCGGTAAC
GCCGAAACGT TGCGATCCTC TCCGGTGCTC GCGCGAGCGC TCGAATTTTA TCGCGCCGAG
GGGTGGATCG TGCCGCTCAC CATCGAAGCG CTGGATTTCA CGAACTTCGT CCTCGGGACG
TTCGAACCGA GCGTTCTCTA G
 
Protein sequence
MAARTRGNVL ALFTQTERAA IIRRRIIDLE RPKFAPIGDH KRKIEVALKD LGFPLNDEQL 
LAVEKIVTAE DYVLVQGFPG AGKTAMLVAA VKALRAQGKS VLITSHTHSA IDNVFSRLPG
VGVHEFMRIG DEVKVVDAVQ EYRLGSKRWP CSNSDDLRKV SERAMVVGAT CHAMGHAYFQ
RKMFDVVLID ESGQITLPSI LPPLFAAKTF VLVGDHHQLP PLVKSKQAIA GGLGRSLLAM
LCDAHPDMVT KLSSQYRMAE PLTRLPNILT YDGKLRCGTE SVAKQLLSLA PLKDAFSCAP
QWLAHVMNPA NHTVFLDTSA LGAAARETPK PYINEAEMDL VLTTVSALTT HGATSVCALS
PFNAQVDAIK ARLNGFKALS GDDAPRALTI DKAQGQDMDA VCISFVCSND EAKVNVLLND
TSRFNVAITR AKKKLILIGN AETLRSSPVL ARALEFYRAE GWIVPLTIEA LDFTNFVLGT
FEPSVL