Gene OSTLU_26970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26970 
Symbol 
ID5004861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp519466 
End bp520563 
Gene Length1098 bp 
Protein Length365 aa 
Translation table 
GC content55% 
IMG OID640420282 
Productpredicted protein 
Protein accessionXP_001420872 
Protein GI145353111 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGC AATTCTTTTT CGCCGGGGAT CTCATGCTAG CGCGCGGCGT GGATCAAGTT 
CTTCCGAATC CGCTGCGAAA CGCCTCGCTA CGCGAATCTT GTTGTACGCA CGCCAACGAC
TACGTACGCC TCGCGATGAA GAAGTGCGCG ATACAGCGTA AACCAACCTG GACGGCGAAG
GAACTGCTCG GCGACCTGCT TCCTGCTCTT CGTTCGCGAG AGCCCATCGA CTTAAGCGTC
GTAAACTTGG AAACGTCAGT GACAACGAAC GATGAGTTTT GGCCGCGTAA AGGAGTGTGT
TATCGCGCGT CGAAGGAAAA CTGTCTCGAT GTCTTGGCAA CTTTGCGCGT CGATGTGCTG
ACGCTGGCAA ACAATCACAC ACTGGACTTT GGCATCAAAG GACTGATCGA TACACTCGAC
GCGATTGATT CAGCGTCTCG AGTGGGCGCA GGACGAAACG ACCTTCAGGC GTTCGAACCG
AAGATTGTTG CCGACACCGC GGTATTTGGA TTGTGTCTTG AAAACTCAGG AGTACCGGTT
TCGTGGCAGG CTCGAAGGAA CGCCGCGGGT TTGGCTCTCA TTCTAGACGA ACACGACTTG
AATCGAGTCG CTCGTGAAAT CGAACGCACA GACGTGCCGA TCAAAATCGT CTCGCTGCAC
GCCGGCGGCA ATTGGGGGTA TTCGATTGAG CCGGAGACGC GAATGGTCTG TCGCCGACTG
ATTGACGCTG GTGCGCACTT CGTACATGGA CACTCGAGCC ATCACGCGCG TAGCGCCGAG
CTCTACAAAC GTCGACTCAT ACTCTTTGGG TGCGGAGAAC TTCTCAACGA TTACGAAGGC
ATAGGCGATC ACGCAGGCTT TCCATCGAAA ACATACAACG CCGATTTGCG ATACGCGTAC
TTTCCCACGT TGAACGATAC CGGCGAATTC ACGGAAATGA AAATCGACGT CTTCACGCAA
GCAAACTGTT TTCGCCTCGA GCGCGCAACG ATCGACGCCA CACAGCGAGC TTTCAGATCG
CTCGTCGCCG ATTACCGCAG AGGGGGGCTC GCGATGTCGA TGAAAGCACC GAACACGCTC
CTTGTCAAGC CGCTGTGA
 
Protein sequence
MTKQFFFAGD LMLARGVDQV LPNPLRNASL RESCCTHAND YVRLAMKKCA IQRKPTWTAK 
ELLGDLLPAL RSREPIDLSV VNLETSVTTN DEFWPRKGVC YRASKENCLD VLATLRVDVL
TLANNHTLDF GIKGLIDTLD AIDSASRVGA GRNDLQAFEP KIVADTAVFG LCLENSGVPV
SWQARRNAAG LALILDEHDL NRVAREIERT DVPIKIVSLH AGGNWGYSIE PETRMVCRRL
IDAGAHFVHG HSSHHARSAE LYKRRLILFG CGELLNDYEG IGDHAGFPSK TYNADLRYAY
FPTLNDTGEF TEMKIDVFTQ ANCFRLERAT IDATQRAFRS LVADYRRGGL AMSMKAPNTL
LVKPL