Gene OSTLU_87709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_87709 
Symbol 
ID5002947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp181065 
End bp182645 
Gene Length1581 bp 
Protein Length526 aa 
Translation table 
GC content59% 
IMG OID640418368 
Productpredicted protein 
Protein accessionXP_001418632 
Protein GI145348391 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCGAG AGACGCAACG CGCGGCGTTT GATCGCGCGC TGTCGCTCGG GAACAACGTC 
GTGCCGCTGT ATCGAAGGAT TTTTGACGAT CAGCTGACGC CCATCCTGGC GTATCGACTG
CTCGTCAAGG AGGACGAGCG CGAGGCGCCG AGCTTTTTGC TCGAGTCCGT CGTCGGTGGG
ACGCAGATTG GACGCTATTC GTTCCTCGGA CGACGACCGG TGATGGAGGT GACGGCGAAG
GATTACGAGG TGACGGTGAC GCGACACGAG GGCGGGGGGG CGGCGAGCGA GACGACGACG
GAGCGGGATC CGATGGAGGT GATGAAGCGA ATCGGGGAGA GCTGGCGAGC GTGTAAGACG
CCCGGGTTAC CGGATTGCTT CGCGGGTGGA TGGGTCGGGT TCACGGGGTA CGACACGGTG
AGATATCAGT ATCAAAGCAA GCTCGGTTTC GAAGGCGCGC CGAAGGACGA TAGGTCGCTG
CCGGATATTC ATCTCGGATT GTATAAGGAT GTTGTGGTGT TCGATAACGC CACAAAACAG
CTCTATGCGG TACACTGGGT GATGGTTGAC GAATACTCGA GCGCCGACGA GGCGTACACA
ACGGGTATGG ACGCTTTGGA CGCGATGATA GACGACTTGC AGCCGTCCAA GTCCCCGCCG
ATGAAGCAAG GATACGTCAA CTTGGAGCTA AATCAGCGAC CGAGCGAACC AAAAGATAGC
ACGATGACCA AGGATGAGTT TTTGGGGGCG GTAGCGGCGA CGAAGGAGCA TATCAAAGCG
GGTGACATCT TCCAACTCGT TCTCAGCCAT AGATTTCAAC GGAAGACGTC CGTGGACCCG
TTCGAAGTGT ACCGAGCTTT GCGAGTCGTC AACCCGTCGC CGTACATGAT TTATTACCAG
GGTCGAGACT GCATCCTAGT CGCATCGAGC CCTGAAATCT TGTGCCGCGT CGACAAGGCG
CGCACGGTGG TGAACCGTCC GCTAGCGGGA ACGCGCATGC GAGGGAAGAC GCCTGAAGAA
GACGAGGCGC TCGAGGTGGA TCTCCTAGCG GATGAAAAGG AGCGCGCCGA GCACGTTATG
CTCGTCGATC TTGGGCGAAA TGACGTCGGT CGCGTATCCA AGGCTGGTAC GGTCAAAGTC
GAGAAGCTCA TGGAAATTGA ACGCTATTCT CACGTGATGC ACATTTCCTC TACTGTGACG
GGTAACTTAG TAGACGACCT CGGCCCGTGG GACGTGCTCC GTGCAGCGTT GCCCGCGGGC
ACAGTCAGCG GCGCGCCGAA GGTCCGCGCG ATGCAGATTA TCGACAACTT AGAGGTGACG
CGTCGCGGTC CGTACGGTGG TGGCATCGGC TACGTAGGCT TCACCGGCGA GATGGACATG
GCGCTCGCGT TGCGCACCAT GGTCGTTCCG ACTCGCCAGG CCAAAGAAAC TAACGGTTCG
CGCGAGTGGA TGATTCACCT TCAAGCTGGC GCTGGGATCG TCGCGGATTC CAATCCAGAG
AGTGAGTATC AAGAGACTGT CAACAAGGCG GCGGCGCTCG GTAGAGCCAT CGACCTCGCG
GAGAGCGCGT TTACGGATTG A
 
Protein sequence
VARETQRAAF DRALSLGNNV VPLYRRIFDD QLTPILAYRL LVKEDEREAP SFLLESVVGG 
TQIGRYSFLG RRPVMEVTAK DYEVTVTRHE GGGAASETTT ERDPMEVMKR IGESWRACKT
PGLPDCFAGG WVGFTGYDTV RYQYQSKLGF EGAPKDDRSL PDIHLGLYKD VVVFDNATKQ
LYAVHWVMVD EYSSADEAYT TGMDALDAMI DDLQPSKSPP MKQGYVNLEL NQRPSEPKDS
TMTKDEFLGA VAATKEHIKA GDIFQLVLSH RFQRKTSVDP FEVYRALRVV NPSPYMIYYQ
GRDCILVASS PEILCRVDKA RTVVNRPLAG TRMRGKTPEE DEALEVDLLA DEKERAEHVM
LVDLGRNDVG RVSKAGTVKV EKLMEIERYS HVMHISSTVT GNLVDDLGPW DVLRAALPAG
TVSGAPKVRA MQIIDNLEVT RRGPYGGGIG YVGFTGEMDM ALALRTMVVP TRQAKETNGS
REWMIHLQAG AGIVADSNPE SEYQETVNKA AALGRAIDLA ESAFTD