Gene OSTLU_40777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40777 
Symbol 
ID5002546 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp463366 
End bp466738 
Gene Length3373 bp 
Protein Length1041 aa 
Translation table 
GC content55% 
IMG OID640417967 
Productpredicted protein 
Protein accessionXP_001418490 
Protein GI145348092 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.105626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0398224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCCA AAAATATGCT CGCACAGTTG GCGAAGCGTG AAGGATGGCT CGTACCACGA 
TTCGAACGGG ACAAGAGCGC CGAAGGCATC CGATACGTCG TTTCTGTCGA ACGTTCAACC
GGGCCAAAGT ACAAACGCAA GCCGACGCTT GTGTGTTCTA CGCACGACGA GGACGAACCT
ACAGCTGGCG GTTGGTTGAG CATTAACGAC GCGCAAAATG GTGCGGCTAC GCGTGCGTTG
TTTGAAGCGT CATTTAGCAA GGAAAACCAA ATCGTGCCGT TGGAATTGTC CGAGACGTAT
CGCGATTTGT GGGTTCGGTG GGCCATCGCG TTCATGAATT CAGACGTCGC GGATGTCGGT
GAGAACGCGA GAGACGATTT TATAGATGCG CTCATTAAGA CAATAGGCAC CGATGGCTTG
AACGTGCCCG ATGCTAGTAA CGGGGGTACA TATGTAGATG AATTGACGGC AAAAGCTCGC
GAGTTAAAAC TTAAAGAAGA GAATGATCGT TTGCGCACGG CCAGGGCAAA CATGAACAAG
CATGCGGAGG TGAGTCGACG ACTGAAAGAG CACCTCGAGG CGACAAAGCA GGATCCGCAG
TGGAAGAAAC TGTTTGCGAA GCGCTCGACG CTCCCTATTT GTGCGCTGGC TGACGAGCTT
CTCGATCGGC TTCGTTCTCA CGATGCTGTC GTCGTATGTG GCGAGACAGG TTGCGGTAAG
ACGACTCAGG TACCACAATT TCTTCTCGAC GACGCCATTG AACGAGAGCA GGGCGGTGCT
TGCAATATCG TGTGCACGCA ACCACGTCGT GTTGCGGCTA CAAGTATCGC CGAGCGCGTA
TCTGCCGAAC GATGCGAAAA AAACGGCGTC GGGGGTAATG GTTCGCTTGT CGGGCACCAC
GTCCGATTGG ATGCGAAAAT AACCAGCGCG ACACGTCTCA CTTTTTGCAC CACTGGTATT
CTGTTGCGGA GGTTGCAAGG CGATAGAATG CTTACCGATG TCACGCATGT CGTCGTCGAT
GAAGTCCACG AGCGCTCTCT AGACGGTGAC TTCTTGTTGA CGCTTCTCAG AGATCTTCCA
AGACGTCGCC GCGAAGCTGG TTTGCCTCCT GTGAAACTCG TGCTCATGTC TGCCACTTTG
AACGCAGCTT TATTCAGCGA GTATCTCGGT GGCTCGCCGG TTATATCAGC GCCCGGGCGC
TCGTTCCCCG TAGATACGAT TCATCTGGAA CATATTTATG ACACATTGGA TTATGTCATC
GACCCGGATA ATCGTTCATG TCGACGACCA AAGGGCAAAG CCGAAGATGC GATGAAAGCC
ATCAAGGCGG GCGGCGGGGG CGATAGGCGG CGCCAGAACG AGCTCTTGGG GTCTTGGGGC
GAAGACGCGG CGTCGGAGTT CGGTGGTGAA GAGAACCCAG AAAATCCAGA TTATGACTCG
AGCAAGTACG AGTATTGCAA ACGGAATACA CGATTGTCGC TGTCTCGCCT GGATGAGTCC
GTCATCGATT ACGACTTGAT CGAAGAGCTT CTCGCGTACG TCGACGACGT CACCGACGAT
GGGGCAGTTT TGGTATTCTT ACCCGGTATC GGCGAGGTGA CAGGGCTTTT GGATCGTCTC
GCCAGCTCAC CGCGATTTAA AGATGCGGTG CTCACGCCAT TACACTCCGC GTTGACGAAC
GCCGAGCAGC GCGAGGCGTT CAGGGTGCCG AAACCCGGCG TGCGCAAAAT TGTGGTGGCT
ACGAATGTGG CAGAAACGTC GGTGACGATC GAAGACATCG TCGTCGTCAT CGATTCCGGT
CGCGTGAAGG AGCGACAGTG GGATCCTCGA CGAGGTATGG CTTCGCTTGA GGAGGGATGG
GTCAGCCGCG CAGCGGCGAA ACAGCGCGCC GGTCGCGCCG GTCGAGTTCG AGCGGGAACA
TGCTATGCGC TCTTCACCTC GCATCGCGCA AACGGCGCGA TGCGGCCATT CCAAGTTCCT
GAAATGCACC GCGCGCCGCT CACGGAAGTC GTACTGCAGA TCGCAAGTCT CGATTTGCAC
AGCGACGCCG CAGTGGTTCT CGGGAACGCA CCCGAGCCTC CAAAAGAGGA AGCTGTTGCC
GCGGCGAAGA AGACGCTCAC CGAGATTGGC GCTTTCGACG AGCTAGGTCG ACTTACTGCC
TTAGGTCGCC ACCTCGCTGC ACTGCCAGTA GATGCGAGGG TTGCGAAAAT GCTATTATTC
GGAGTGATTC TACGGTGTCT CTCTCCGATT CTAACCATAG CCGCAACGTT GAGCTACAAG
TCGCCCTTTC AATCGTCCAA GGCGTCGAAC AGTCAAGTCG AAGCGGCGAT GCGTGCGTTC
GCGCAACCAG CCGCGGTGTC CTTGGCCGCC GGGCAGCAAA GCGATCACCT AGTGGTCGTC
GCCGCTTACG ACGGCTATAT CGAAGCATCG AAAGAAAGTC GCAACGCCGG ACGAAGATTT
GCGCAGAAAA ATGCGCTCGA CGTGGACACG ATGAGACAGA TTTCAGAAAT GCGCACGCAG
TACGCCGCGC TTCTCGCGGA CATGGGCGTC ATTCGAGTCC CCGCCGGTTA CTCACTTCGA
GGTAGAAACA CAAATTGGTT GGATGATCCT AAAGCGGTAC GTTAACACTT GACTTCACGA
TTTCTCGTAC TCGGAGATGG GTGCTTCACT CGAAGCGCCC CGGGCGTGTG CGCATGCAGC
CACCTGCGCG CGCGTAAAGT TATTTCCGTA CATCGACGAT GAGGCTTTTC GATGACGCGT
ATAGAAATAC GCGCACATCC CAGCAGCTTC GCAACATTTA AGTTTCATAG CTCGTCTCGT
TGGATGATTA TCCCTGACAT TTCGCATTTT TATTCGCACG CAGGCTTGGA ACAAAGACGC
GCGCCGCGTA CAAATGATCA AGGCTGTGCT CACGGCGGGC CTGTACGCAA ACGTCGCCGT
CGGCGATGAG GCATCGGATC AAGACTACGC GCAGTACACG TGGAAGGACG CAACGTCAGA
GGTGCGCGTG CACCCGTCGA GCGTGAACAA AGGGATCGGA ATCGACCGCA AGCCCGCGTA
TCCGTTCATG GTGTATCACG AAAAGATGCG AACGGCACGC GTGTACTTGC GAGACTGTAC
TGTCGTCGCA CCCGAGGCGT TATTACTTTT CGGTGGAAAC CTCGAGGTGC AGCACGCGAA
CGCACGCGTG ATCATGGATA ACTGGATCAA GTTCAAGTGT GACGCACCGG TGGCGGTGTT
ATTCAAGTAC CTTCGCCTCG CGCTCGACGA AGATTTCGCC AAACGAATCC GAAACGCGGG
CAAGTCGTCC TGGAGCGACG ACGACGACGA AATCATAGTC ACAATCAGAC GAATCCTCGA
CGACGTGCAA TAG
 
Protein sequence
MTPKNMLAQL AKREGWLVPR FERDKSAEGI RYVVSVERST GPKYKRKPTL VCSTHDEDEP 
TAGGWLSIND AQNGAATRAL FEASFSKENQ IVPLELSETY RDLWVRWAIA FMNSDVADVG
ENARDDFIDA LIKTIGTDGL NVPDASNGGT YVDELTAKAR ELKLKEENDR LRTARANMNK
HAEVSRRLKE HLEATKQDPQ WKKLFAKRST LPICALADEL LDRLRSHDAV VVCGETGCGK
TTQVPQFLLD DAIEREQGGA CNIVCTQPRR VAATSIAERV SAERCEKNGV GGNGSLVGHH
VRLDAKITSA TRLTFCTTGI LLRRLQGDRM LTDVTHVVVD EVHERSLDGD FLLTLLRDLP
RRRREAGLPP VKLVLMSATL NAALFSEYLG GSPVISAPGR SFPVDTIHLE HIYDTLDYVI
DPDNRSCRRP KGKAEDAMKA IKAGGGGDRR RQNELLGSWG EDAASEFGGE ENPENPDYDS
SKYEYCKRNT RLSLSRLDES VIDYDLIEEL LAYVDDVTDD GAVLVFLPGI GEVTGLLDRL
ASSPRFKDAV LTPLHSALTN AEQREAFRVP KPGVRKIVVA TNVAETSVTI EDIVVVIDSG
RVKERQWDPR RGMASLEEGW VSRAAAKQRA GRAGRVRAGT CYALFTSHRA NGAMRPFQVP
EMHRAPLTEV VLQIASLDLH SDAAVVLGNA PEPPKEEAVA AAKKTLTEIG AFDELGRLTA
LGRHLAALPV DARVAKMLLF GVILRCLSPI LTIAATLSYK SPFQSSKASN SQVEAAMRAF
AQPAAVSLAA GQQSDHLVVV AAYDGYIEAS KESRNAGRRF AQKNALDVDT MRQISEMRTQ
YAALLADMGV IRVPAGYSLR GRNTNWLDDP KAAWNKDARR VQMIKAVLTA GLYANVAVGD
EASDQDYAQY TWKDATSEVR VHPSSVNKGI GIDRKPAYPF MVYHEKMRTA RVYLRDCTVV
APEALLLFGG NLEVQHANAR VIMDNWIKFK CDAPVAVLFK YLRLALDEDF AKRIRNAGKS
SWSDDDDEII VTIRRILDDV Q