Gene OSTLU_19724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19724 
Symbol 
ID5004318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp103122 
End bp104238 
Gene Length1117 bp 
Protein Length371 aa 
Translation table 
GC content68% 
IMG OID640419739 
Productpredicted protein 
Protein accessionXP_001420414 
Protein GI145352138 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GATGACGCGC GAACGCGCGC CGACGACGGC GACGCTGGAG GCGCTGCGAC ACCTCGTGCG 
CGCGGCGAAG GAAGACAGCG CGATCCTGGA CGCGCCGGCG CTCGAATTCT TCCGACGATG
GCTCGAGGAG GATTTGGGGG CGACGATTCC GGCGCCGCGG ACGACGACGA CGACGGGGAC
GGGGACGGAC GCGATCGAGA TCGAGGACGA CGAGGCGATG GCGGCCGAGA GCGATGATCT
GAGCGCGATC GCGATGGGGG CGGAGACGGC GCCGGAGACG CTCGGGGAGG CGGAGGAGGC
GAAGGCGAGC GAGGCGAAGC GACTGGCGAG CGAGGCGTTC GCGCGCGAGG CGTGGGAGGA
GGCGATCGAG AGGTACACGG AGGCGCTGAT GATCGCGCCG TCGGCGCTGA CGTACGCGAA
ACGGGCGGAA TGTTTCATCA AGTTGCGAAA GCCGCTGTCG GCGATTCGAG ACGGGACGGC
GGCGTTGAAG TTGAATCCGG ATTCGGCCAA GGCGTTGAAG GTTCGAGGCG CGGCGCACAG
GTACTTGGGA CACTGGAACG AGGCCAACGC GGATCTGAGC GCGGGATTGT CTCAGGACTT
CGACGAGACG TACGGGGAGA TGCATAAAAA AGTCTTGAGC GTCGTGCACG AGCTTCACGT
GCGCGAGGGC AAGGCGCGCG CCGCGAAGGA GGCCAAGGAA AGAGAAGAGC TCGAAAAACG
CCGAGCCGCC GCGGAGGCGG CGCGCAAAGA AGCCGCGGCG AAAGACGCCG GCGGGCCTGG
GTTCGGCCAA CCGGGCGCCG GATTCCCGGG CGGCGCCGGC GACTTGCCGC CCGGCGTTTC
GCCCGAGATG GCGCAAAAGC TGATGAGCGA CCCCGATCTC ATCGCCGCGA TGCAGAACCC
CAAGGTCATG CAAGCGCTTC AAACGATGAT GAAGAACCCG ATGGCGGCGA TGCAGTACAT
GAGCGACCCC GAAGTCGGAC CGGTGTTGCA AAAATTGATG GCTTCGATGG GCGGCGCGAT
GCCGGGCGGC GCGCCCGGCG GCTTCCCGGG CGGCTTCCCG GGCGGCTTCC CGGGCGCCGG
CGCCGCGCCC GGCGGCGCGG CGAACGACGT GGATTAG
 
Protein sequence
MTRERAPTTA TLEALRHLVR AAKEDSAILD APALEFFRRW LEEDLGATIP APRTTTTTGT 
GTDAIEIEDD EAMAAESDDL SAIAMGAETA PETLGEAEEA KASEAKRLAS EAFAREAWEE
AIERYTEALM IAPSALTYAK RAECFIKLRK PLSAIRDGTA ALKLNPDSAK ALKVRGAAHR
YLGHWNEANA DLSAGLSQDF DETYGEMHKK VLSVVHELHV REGKARAAKE AKEREELEKR
RAAAEAARKE AAAKDAGGPG FGQPGAGFPG GAGDLPPGVS PEMAQKLMSD PDLIAAMQNP
KVMQALQTMM KNPMAAMQYM SDPEVGPVLQ KLMASMGGAM PGGAPGGFPG GFPGGFPGAG
AAPGGAANDV D