Gene OSTLU_32105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32105 
Symbol 
ID5002563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp281764 
End bp283043 
Gene Length1280 bp 
Protein Length370 aa 
Translation table 
GC content68% 
IMG OID640417984 
Productpredicted protein 
Protein accessionXP_001418201 
Protein GI145347499 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00110753 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0624968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGGGACGCG ACGCGGCCGA CGCGCGAAGA CTGCGCGCGA TCCGAGCGCG ATGCGCGCGT 
TCAACGCGCC CGCGGCGACG GCGACGCGCG GTGCGGTGAC GACGGCGACG CGACGACGAC
GCGCGACGAC GATGGAGGGA CCGAGCGCGA CGATCGCGGC GACGACGATG CGCGCGAGGC
GATGGGAACG CGCGCGAGGG TGCGGGAACG GGCGCGCGGG ACGCGCGGTC GTCGCGCGGG
CGACGGCGAG GCGCGCGGCG GTGGATCTGC GGAAGGTGAT CGAGGATTTG TGCGAGGGGG
CGGATTTATC CGAGGAGGAC GCGCACGCGG CGATGGAGGC GCTGCTGGAC GCGGATCCGA
CGCAGATCGC GGCGTTTTTG GTGCTGCTCC GGGCGAAGGG GGAGACGGCG AGCGAGATGG
CGGGGTTGGC GCGAGCGATG CAGTCGAGAG CGGTGACGGT GGACGCGGGG GACGACGTGC
TGGACATCGT CGGCACGGGG GGGGACGACG CGGGCACGGT GAATATTTCC ACTGGATCGT
GCGTGTTGGC CGCCGCGGCG GGAGCGAAGG TGGCGAAACA CGGTTCGCGA TCGGTGTCTT
CGCTGTGCGG GTCGGGCGAC GTGCTGGAGG CGCTCGGCGT GGACATCGAG CTGGGACCGG
AGAGCATGAA GCGCTGCGTC GAGGAAGTCG GCGTCGGATT TATGTTCGCG CCGAGATATC
ATCCGGCGAT GGCGAAGGTC TCGCCCGTGC GCAAGGCGCT CAAGGTGCGA ACGGCGTTTA
ACATGTTGGG CCCGATGTTG AACCCGGCGC ACAGCAAGTA CGCGCTCGTC GGCGTGTACA
GCACGGGGGT GCAGCAACTC ATGGCGGACT CGTTGATGAA GCTTGGGATG AAGAAGGCGT
TGATCGTGCA CTCCATGGGA TTGGACGAAC TCACGCCGGC GGGACCCGCG GACGTCGTCG
AGGTGACGCC GAGCGGCACG CGCGCGTACA CGTTCGAGCC GAAGGATGTC GACATTAAGC
CGTGCACGCT CGAGGATTTG CGCGGCGGCG ACCCGACGAC AAACGCGAGA ATTTTGCGAG
CCGCGTTGGA GGGTGAGAAG GGCCCGGTCG CCGAGACCCT GATTTTGAAT GCCGGCGTCG
CTATGGCGGC CGCGCAGCAA GCGAAGGACG TCGTGGAAGG CATCGCCATG GCGAGAGAGG
CGCACGAGAG CGGCAAGGCG GGCAAAACGC TCGACTCCTG GATCAAGCTC ACCCAAGAAT
TGAGAAAGAC CGAGGCGTAG
 
Protein sequence
MRARRWERAR GCGNGRAGRA VVARATARRA AVDLRKVIED LCEGADLSEE DAHAAMEALL 
DADPTQIAAF LVLLRAKGET ASEMAGLARA MQSRAVTVDA GDDVLDIVGT GGDDAGTVNI
STGSCVLAAA AGAKVAKHGS RSVSSLCGSG DVLEALGVDI ELGPESMKRC VEEVGVGFMF
APRYHPAMAK VSPVRKALKV RTAFNMLGPM LNPAHSKYAL VGVYSTGVQQ LMADSLMKLG
MKKALIVHSM GLDELTPAGP ADVVEVTPSG TRAYTFEPKD VDIKPCTLED LRGGDPTTNA
RILRAALEGE KGPVAETLIL NAGVAMAAAQ QAKDVVEGIA MAREAHESGK AGKTLDSWIK
LTQELRKTEA