Gene OSTLU_51201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_51201 
Symbol 
ID5005156 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp258262 
End bp259970 
Gene Length1709 bp 
Protein Length503 aa 
Translation table 
GC content59% 
IMG OID640420577 
Productpredicted protein 
Protein accessionXP_001421099 
Protein GI145353608 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0017] Aspartyl/asparaginyl-tRNA synthetases 
TIGRFAM ID[TIGR00457] asparaginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.037074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCGA GAGCGACGCC GGGAGCGATC AAACACGCGC GGAGCGCGCG CGCGGTGGCG 
ACGCGGGCGA CGTACGGACG ACCCGAGCGC ATCGCGCGCG TGAAGGGAAA CGACGGGGGG
GCGTCGCGCG TGGGCGAGGC GTTGGAACTG CGAGGGTGGG CGCGATCGGT GCGGACGCAG
AAGGGAATGG CGTTCATCGA CCTGAACGAT GGGAGCGCGA TCTCGGGGAT GCAAGCGGTG
GTGAACGAGG GAAGCGCGGC GTGGGACGCG CTCGACGCGG GCGGCGTCTC CACGGGGGCG
GCGCTCAGGG TGAAGGGGAA ACTCGTGGCG AGTCCGGGAG GGAAGCAAGC GGCGGAATTG
GCGGTGGAGG AGATTGATGT CATCGGAACG GCGGATCCGG AGACGTATCC GCTGCAAAAG
AAGCGACACA CGCTGGAATA TCTTCGAAGC ATCGCGCACT TGCGACCTCG CACAAACACC
ATCGGCGCCG TGGCTCGCGT GCGCAATCAG TTGGCGTACG CGACGCACAC GTTCTTTCAA
GAGCATGGGT TTTTATACGC CAACACGCCG ATCATCACCG CGTCGGATTG CGAAGGCGCT
GGGGAACAGT TTCAAGTGAC GACGCTGTTG AACGGCTTCG GCGCCGGCGA GTGGACGACG
CCGGCCTCGG TCGTGGACGA GCAAGAAGCC ATGGTGAAGG CACAGGGTGA CGCGGTGAAG
GCCCTGAAGG AGGCGAAGAA AGCGGGAGAC GCGACGAAAG AGCAAGTTGA TGAGGCCGTG
GCGAAGCTTT TAGATCTCAA GGCGAGCGTC GAGGCGCTGA AGAACAATCG CCCGTCGTCC
GACTTGCCGA AGAATAAGGA TGGTTCCATC GACTATTCTC AAGACTTCTT CGGCAAACCG
TCTTATTTGA CCGTGTCTGG GCAATTGAAC GGTGAAATCA TGGCTTGCGC GGTCAACGAC
ATCTACACCT TCGGTCCGAC GTTCCGCGCG GAGAATAGCA ACACGTCGCG CCACCTGGCT
GAGTTTTGGA TGGTAGAGCC CGAACTCGCG TTCGCGGATT TGAACGACGA CATGGATTGC
GCAGAAGCGT ACTTGAAGTA TTGCCTGAAC CACGTCCTCG AGCACTGCGA CGAAGATCTT
GAATTCTTCG AGAAGAACAT CTCCAAAGAC AACCTGAGAG AGCGACTTCG AAACGTCGCG
TCGCAAGAGT TTGCGCGCAT CACGTACACC GAAGCCGTCG AGCACGTGTT GAACGCGAAG
AAGAAGTTTG AGTTCCCAAT CGAATGGGGA TCGGATCTTC AGAGCGAGCA CGAGCGGTAC
ATCTCAGAAG AAGTCTTCAA AGATCGCCCC GTGATCGTGC GCGATTATCC GAAAGATATC
AAGGCGTTCT ACATGCGTCT CAACGACGAC AACAAAACCG TCGCCGCAAT GGACGTCTTG
GTGCCCCGCG TTGGTGAGTT GATGGGTGGT AGCCAAAGAG AAGAACGCCT TGATGTCTTG
GAGCGCCGAA TCGAAGAGGT TGGTTTGGAA AAGGAGTCGT ACTGGTGGTA CTTAGATTTG
AGACGATACG GTTCGCAGCC GCACGCCGGT TTCGGCCTCG GTTTTGAACG CCTCGTTCAG
TACGTCACAG GTGTGGAGAA CATTCGTGAC GCCATTCCTT TCCCCCGTTA CCCGGGTAGC
GCTGAGTTCT AGATTTAGTC TAACGCGGC
 
Protein sequence
MRARATPGAI KHARSARAVA TRATYGRPER IARVKGNDGG ASRVGEALEL RGWARSVRTQ 
KGMAFIDLND GSAISGMQAV VNEGSAAWDA LDAGGVSTGA ALRVKGKLVA SPGGKQAAEL
AVEEIDVIGT ADPETYPLQK KRHTLEYLRS IAHLRPRTNT IGAVARVRNQ LAYATHTFFQ
EHGFLYANTP IITASDCEGA GEQFQVTTLV EALKNNRPSS DLPKNKDGSI DYSQDFFGKP
SYLTVSGQLN GEIMACAVND IYTFGPTFRA ENSNTSRHLA EFWMVEPELA FADLNDDMDC
AEAYLKYCLN HVLEHCDEDL EFFEKNISKD NLRERLRNVA SQEFARITYT EAVEHVLNAK
KKFEFPIEWG SDLQSEHERY ISEEVFKDRP VIVRDYPKDI KAFYMRLNDD NKTVAAMDVL
VPRVGELMGG SQREERLDVL ERRIEEVGLE KESYWWYLDL RRYGSQPHAG FGLGFERLVQ
YVTGVENIRD AIPFPRYPGS AEF