Gene OSTLU_17761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17761 
Symbol 
ID5005079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp94533 
End bp95723 
Gene Length1191 bp 
Protein Length396 aa 
Translation table 
GC content61% 
IMG OID640420500 
Productpredicted protein 
Protein accessionXP_001420902 
Protein GI145353183 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.0264272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0121451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGC GAGACGCGGC GTCGCCCGTG CGCTCGCTCG ACGTCGCGCG CGCGCGCGCG 
ACGTGGACGG CGCTGGACGA GGCGATTCGA GGCGACGCCG GCGCGTGGAC GTTCGACGAC
GACGACGACG ACGCGCGCGC GACGACGAGC GGACGCGAAG AGCGCGACGC CGGTGATGCG
ATTCGCGCGC TCGGGTACGC GATCGTGCCG CGATGCGCGA ACCGAGCGCT GGCGAGGCGA
GTGGCGACGC TCGCGCGCGG GTTGGCGGCG CGAGGCCTGG ATCCGACGTA CGTTTTCGTG
TACGAGGACG CGTGGACGCT GTTGCGAACG ATAGAAACGG CGTGCAAAAG TCGAGGATGC
TTTGGTGGGT TAATCATGAA TTACGACGTG CTGGCGTGGT GCGTGGATCC GGTGAGCGAT
GGGGAGGCGA CGACGGCGTT TTGTCCGCAC CGGGATCGAC AGCCGGACGA CTCGCCGGGA
TCGTTTTCGG AGAATGGGGA CGCGAAGTAC GCCACAGCGT GGGTCGCGTT GGCGAATGAT
GCGACGCCGG AGAATTCGTG TTTGTACTGC GTGCCGAGAC CGCACGATCC GGGTTATTAC
GACGGCGATG ATGACGACGC GAACGCGAAG GATCCCTTGA GCGTGGCCTT GGACTCAAAA
AAAGCGTTTC AATACGTTCG CGCGTTGCCG TGTAAAGCGG GCGATGGCGT GGTTTTCACC
CATCGGTTGA TACATTGGGG ATCGATAGGC GAGGGGCGCG AGGACAGGCC GAGGATTAAC
ATCAGTTTTG GATTCGCCTG CGAAAGCTTT GAACCGGCGT ACCTGACGAG TCGCTCGCGC
GTGCCCAGTT TCGACGAACG ACTTGCCCTC GTCGCAGGGC AGTTGATATG CTATCACGAA
AGGTTTCCAC CGAACGCGAA AGAACTTGGT GTGTTGAAGA AGCTATTCGA CGGCATGAAG
TCGAGCTTCG ACGACGCGTA CGTTCGAAAG GTGAATAAAG AATTCGCAAA CGCCGCGCTG
CGCGGCGAGA ACGACGACGA GGTCGAAGAA GACGCGCTCG ACGCCATGCT CGACGCCGCC
GATGATTTTG ACGACGATTT TGATGATTTT GAAGATGGCG TCGAAGGTGA TGCGTACGGA
CGAGTCGCCG ACGGAGATGA CGCTCTGGCG AAACGATCGA AAAGGAAGTG A
 
Protein sequence
MGARDAASPV RSLDVARARA TWTALDEAIR GDAGAWTFDD DDDDARATTS GREERDAGDA 
IRALGYAIVP RCANRALARR VATLARGLAA RGLDPTYVFV YEDAWTLLRT IETACKSRGC
FGGLIMNYDV LAWCVDPVSD GEATTAFCPH RDRQPDDSPG SFSENGDAKY ATAWVALAND
ATPENSCLYC VPRPHDPGYY DGDDDDANAK DPLSVALDSK KAFQYVRALP CKAGDGVVFT
HRLIHWGSIG EGREDRPRIN ISFGFACESF EPAYLTSRSR VPSFDERLAL VAGQLICYHE
RFPPNAKELG VLKKLFDGMK SSFDDAYVRK VNKEFANAAL RGENDDEVEE DALDAMLDAA
DDFDDDFDDF EDGVEGDAYG RVADGDDALA KRSKRK