Gene OSTLU_37270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37270 
Symbol 
ID5001269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp798752 
End bp801655 
Gene Length2904 bp 
Protein Length967 aa 
Translation table 
GC content56% 
IMG OID640416690 
Productpredicted protein 
Protein accessionXP_001417607 
Protein GI145346253 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0013] Alanyl-tRNA synthetase 
TIGRFAM ID[TIGR00344] alanine--tRNA ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0476971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGTGT GCGAGCAAGC CGCGACGTAC GATCCTAGTG GACCGAAGGA GTTGCCGCGG 
CCGGTCAAGT ACGTAGAAGA ATGGCCGTCC TCGCGCGTCA GAGATACCTT CGTCAACTAC
TTCATCGAAA AGCAGGGCCA CGTGAACTGG CCGTCATCGC CAGCGGTGCC TGTCAACGAC
CCCACGCTTC TCTTCGCAAA CGCGGGCATG AATCAATACA AACCGATTTT CCTGGGCAAG
GCTGATCCGA AGACGGCGTT TGCCAAGCTC ACCAGGGCGA CGAATACGCA AAAGTGCATT
CGCGCAGGCG GGAAGCACAA CGATTTGGAC GATGTTGGAA AAGACACGTA CCATCACACC
TTCTTTGAGA TGTTGGGTAA CTGGTCTTTC GGCGATTACT TCAAGGAAGA AGCCATCGGT
ATGGCGTGGG ATCTTCTCAC GAATGTGTAC GGTCTCGCGC CCGATCGCTT GTATGCGACG
TACTTTGGCG GTGACGAGTC GCAAGGCTTG CAACCCGACT TAGAGGCCAA GGCAATTTGG
TTGAAGTATC TTCCGGAGTC TCGCGTCATG CCATTTGGGT GCGCGGACAA CTTCTGGGAG
ATGGGCGACG TCGGCCCGTG CGGTCCATGC ACTGAGATTC ACTATGATCG CATCGGTGGC
AGAGACGCGG CCTCCTTGGT GAACATGGAT GACCCGAACT GCCTCGAAAT CTGGAACGTC
GTGTTCATTC AGTACAATCG CGAAGAGGGC GGTGTTCTCA AATCGCTTCC CTCGAAGCAT
GTCGACACCG GTATGGGTTT CGAGCGTCTG ACGTCGATCT TACAAAACAA GATGAGCAAT
TACGATACGG ATGTCTTTAT GCCCATCTTC AAGGAGATTC AGCGCATCAG TGGTGCCGCA
CCCTACACTG GCCTTCTCGG CAAGGAGGAC GTCGGGGAGA AAGACATGGC TTACCGCGTT
GTCGCTGATC ATATTCGCAC ATTGTCCATC GCCATCGCCG ATGGCGCGGC TCCCGGGTCA
GATGGCCGCA ACTACGTGTT GCGTCGCGTC CTTCGCCGAG CGGTCCGCTT CGGACGTGAG
AAGCTCGGTG CAAAGCAAGG TTTCTTTCAC AAACTCGTCC CGTGTTTGAT TCAGCAGCTT
GGAGCTGTAT TCCCAGAACT CGTCGCCAAG CAAACGCACA TTACCGAAAT CATCGCCGAC
GAAGAGGAGT CGTTCGGCCG AACGCTCCAA AAGGGTATCG ACCAATTCGG CAAGGTTCTC
GCGGCGGCTA AGCAAGAAGG TAGAACAGTG ATTTCTGGTC CGGAAGCGTT CTTGTTGTGG
GAATCGTATG GTTTCCCGAA TGATCTTACG GAGCTTATGG CCGAGGAGAA CGGTTTTACG
CTCGACAATG AAGGTTTCGC GCAGGCGTTT GCCGAAGCGC AAGAAAAGTC TCGCGCCGGC
GGTAAGAAGT CTGGAGGTGT GCAGTTGCTG TTCGAAGCGG AAGCCACTGC GTGGTTGCAA
AACAATGGTG TGGCAATCAC GAAGGATGAA GAAAAGTACG CAAGCGGACG ACCGACGCTC
GAAAGCACCG TCACCGCCAT CATGTCGCCA AGCGGATTCA TAAAATCCAC CTCCGACGCC
GAAGGACCTT ACGGGTTCTT CATGGACGCA ACGACGTTTT ACGCCGAGTC GGGTGGTCAA
GTGTGTGACT CGGGTTTGAT TACCACCCCG AGTGGCTCGA TGTCCGTATC GGATGTCAAG
GTTGCTGCGG CGTACGTGAT GCACACCGGC GACGTATCCG GCACAGTGAG TGTTGGCGAT
GCCGCGAAGT GCGCAGTCGA TTATGACCGT CGAGATAACA TCATGCCCAA CCACACGATG
ACGCACGTGC TCAATTATGC GTTGCGCAAG GTGATGGGTG ATGGTGTCGA TCAAAAAGGT
TCGTTAGTAG ATGAAGAAAA GTTGCGATTC GATTTTTCTC ACAACAAGGG GGCGACGACG
AAACAAATTG CTGAAATCGA GGCCATCGTG AACGAGCAAA TCAAATCAAA GCTCGCAGTA
GACAAACGCG AGGTAGCGTT AGACAAGGCG ATGACTATCA ACGGTTTACG CGCGGTGTTC
GGTGAGGTGT ACCCCGACCC CGTTCGTGTG GTGTCTGTAG GTCCGAGCAT CGACGACTTG
CTCGCCAACC CGTCGGATGA CAAGTGGAAA AACTACTCCA TAGAGTTCTG TGGCGGCACC
CACTTGGCGT CGACCGATTT CGCGGAGCAA TTCGTCATTC TTGAAGAAAG TGGTATCGCA
AAGGGTATTC GTCGCATCAC CGGGGCGACG CGCGAAGGGG CTAAAGCGGC GCTCGCGCGC
GCTGCGGACG TTCTCGCTCT GGTGAAGAGT TGCGATTCGT TGTCCGGCGA AGCGCTCGAC
AAACAACTGG GTGTGTTGAA AAACGTCGTC GACACTGAAG TTCTTCCCGT CATTCAGCGC
GAGGAAATTC GTGCCGCTGT CACTAGTCAA GTCAAGCGAG TCCTGGACGC GCAAAAGGAA
GCCGCCGCGG CGGCCAAGGC GCAAGCCATC GTAGACGTTC AAGAAAAAAC CGCCGCGACA
AAGTCTGCGG GTGCAAAGTA CTTTGTCGCC ACCCTGGCGG ACGGTACTGA TGCTGGCGCT
ATGAAGGAAG CGGCGGCGGT CGCTTTCGCC GAAGGTATCG CGTGCACACT CTTGGCGAAC
TGCAAGGGTA AAGAGTTCGT TTACTGCAGC GTGCCTCCGG ATGTCGGCAT CGACGTCAAG
GGCTGGCTTG CGGCGTCGTG CGGCCCGCTC GGTGGTAAGG GTGGCGGCGG TAAGGGTGGT
TTGGCGCAAG GTCAAGGACC GAACGTCGAC GCTGTTCCAG ATGCCGTCGC CGCCGCCGAA
GCGTTCGCGA AACTCGCCAT TTAA
 
Protein sequence
MPVCEQAATY DPSGPKELPR PVKYVEEWPS SRVRDTFVNY FIEKQGHVNW PSSPAVPVND 
PTLLFANAGM NQYKPIFLGK ADPKTAFAKL TRATNTQKCI RAGGKHNDLD DVGKDTYHHT
FFEMLGNWSF GDYFKEEAIG MAWDLLTNVY GLAPDRLYAT YFGGDESQGL QPDLEAKAIW
LKYLPESRVM PFGCADNFWE MGDVGPCGPC TEIHYDRIGG RDAASLVNMD DPNCLEIWNV
VFIQYNREEG GVLKSLPSKH VDTGMGFERL TSILQNKMSN YDTDVFMPIF KEIQRISGAA
PYTGLLGKED VGEKDMAYRV VADHIRTLSI AIADGAAPGS DGRNYVLRRV LRRAVRFGRE
KLGAKQGFFH KLVPCLIQQL GAVFPELVAK QTHITEIIAD EEESFGRTLQ KGIDQFGKVL
AAAKQEGRTV ISGPEAFLLW ESYGFPNDLT ELMAEENGFT LDNEGFAQAF AEAQEKSRAG
GKKSGGVQLL FEAEATAWLQ NNGVAITKDE EKYASGRPTL ESTVTAIMSP SGFIKSTSDA
EGPYGFFMDA TTFYAESGGQ VCDSGLITTP SGSMSVSDVK VAAAYVMHTG DVSGTVSVGD
AAKCAVDYDR RDNIMPNHTM THVLNYALRK VMGDGVDQKG SLVDEEKLRF DFSHNKGATT
KQIAEIEAIV NEQIKSKLAV DKREVALDKA MTINGLRAVF GEVYPDPVRV VSVGPSIDDL
LANPSDDKWK NYSIEFCGGT HLASTDFAEQ FVILEESGIA KGIRRITGAT REGAKAALAR
AADVLALVKS CDSLSGEALD KQLGVLKNVV DTEVLPVIQR EEIRAAVTSQ VKRVLDAQKE
AAAAAKAQAI VDVQEKTAAT KSAGAKYFVA TLADGTDAGA MKEAAAVAFA EGIACTLLAN
CKGKEFVYCS VPPDVGIDVK GWLAASCGPL GGKGGGGKGG LAQGQGPNVD AVPDAVAAAE
AFAKLAI