Gene OSTLU_16331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16331 
Symbol 
ID5003388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp46685 
End bp47950 
Gene Length1266 bp 
Protein Length421 aa 
Translation table 
GC content56% 
IMG OID640418809 
Productpredicted protein 
Protein accessionXP_001419054 
Protein GI145349258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0448283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATGG AAATCGCACT GCCGCTCACG AACGAATGGG TGCGGAAACT CGCGACGGCG 
ATGGAAGCGA ACACGCACTG CAGCGTGCCG CGAAGGATAT CTAAAATTCT ATCCGCCACC
GAGGTTGAAA AGCTGTTCAC CGCGGTCGCG CGAGTGCTGA AACGGGAGAA AACTTTGGTG
CGCGTGAGGG GCGACGGCGA CGACGATTTT GACGAAGTCG TCGTGGTGGG CGATACCCAC
GGACAGTATC ACGACGTCTT GAAACTTTTC GAGCTCGCGG GCGAGCCCGA AAAGAAGAAA
ATGTTCGTGT TTAATGGTGA TTTCGTCGAT CGCGGGGCCT GGGGAGTGGA AGTGTTGTTG
TTGTTGTTGG CGCGCAAGGC GCTCGCGCCG GAGCGCGTAA TTTTACTCAG AGGGAACCAC
GAGACGGAAT TTTGCACGGA GTGCTACGGA TTTGAGCGTG AGTTGAGCGT CAAATACGGG
AAAACCGCAG GACGGCGGTT GTATCCAATG TTTTTAGAGT TGTGCGCGGC GTTGCCGTTA
GCTTGCAAGG TGGGGGACGC CACGTTGATT TTACACGGCG GATTGTTTCG AAGCGCGTCG
TTGTCGAAGA GGGAGGCGGC GACGAACCCG AAATTAGGGA CATTGGCAGA ATTAGAGAAG
GCATCAAAAG GCGGCGCGGA TCCAATCGGT GAAGGCCGCT CAATGATTGC GGGAGACGTA
TTGTGGAGCG ACCCCATTCC TGAAGATGGT TTGCACCACA ACGAAAACCG GGGCATCGGC
ATTCAATTCG GGCCAGCGCA AACACTCGAA TTCTTGCGCA ACGAAAATTT AAAGCTAATT
ATTCGTTCAC ACGAAGGTCC GGATGCGAGA CATGATCGAC CGGAGATGCC GAGCATCATG
AGCGGTTTTT GCGTCGATCA CGACTTTGGC GAAGACGGGA AATTGTGCAC GCTGTTCAGC
GCGCCCAACT ACCCGCAGTT TATCGAAGTC GATGACTTGC GCCACAACAA CCTCGCATCG
TTCGTCACGC TGAGCCGCGC GACGAACTTT TGCGATCCAG TGCCGACGTC GTTCGATGCC
GTGCCTCGAC CGCCGAGTCA GTGTTATTAC GAGTTAGATT TAAATGGAAG CGATGCGGAG
GGGCCCGAGA GCGACATTCG TCCCCACGAG CCTCTGCACA TCGACGTCGG CGAGGGCGAC
GAGGGCGCAA GATGGAAAGA GGATGAACTC TCGCACAAGC TCACCTTAGA TCGCGACCAA
ACGTGA
 
Protein sequence
MAMEIALPLT NEWVRKLATA MEANTHCSVP RRISKILSAT EVEKLFTAVA RVLKREKTLV 
RVRGDGDDDF DEVVVVGDTH GQYHDVLKLF ELAGEPEKKK MFVFNGDFVD RGAWGVEVLL
LLLARKALAP ERVILLRGNH ETEFCTECYG FERELSVKYG KTAGRRLYPM FLELCAALPL
ACKVGDATLI LHGGLFRSAS LSKREAATNP KLGTLAELEK ASKGGADPIG EGRSMIAGDV
LWSDPIPEDG LHHNENRGIG IQFGPAQTLE FLRNENLKLI IRSHEGPDAR HDRPEMPSIM
SGFCVDHDFG EDGKLCTLFS APNYPQFIEV DDLRHNNLAS FVTLSRATNF CDPVPTSFDA
VPRPPSQCYY ELDLNGSDAE GPESDIRPHE PLHIDVGEGD EGARWKEDEL SHKLTLDRDQ
T