Gene OSTLU_37879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37879 
SymbolATS1 
ID5004239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp61496 
End bp62680 
Gene Length1185 bp 
Protein Length394 aa 
Translation table 
GC content61% 
IMG OID640419660 
ProductATP sulfurylase (sulfate adenylyltransferase) 
Protein accessionXP_001420064 
Protein GI145351391 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.624538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGC GCGAGCGGAC GTTCGCGTCG CAGAGCGAAG GGTTGATCGC GCCGCACGGC 
GGCGCGCTGG TGAATCTGAT GCTCGAGGAC GACGGGGCGA AGGCGAGGGC GATCGCGTCG
TGCACGCGGG CGCTCGAGCT GTCGGATCGA AACGCGTGCG ACGTCGAGCT GCTGAGCGCG
GGGGGGTTCT CGCCGCTGCG AGGGTTCATG AACGAGGACG AGTACGAACA CTGCGTGGAG
ACGATGCGGT TGAAGGGGAG CGAGCTGTTG TTCGGGCTGC CGATCGTGTT GGACACGAAT
TGCGAGGACA CCAAGGCGGG CGACAGAGTG TTGCTCAAGT ATCAGGGCAA GGACGTCGGC
GTGCTGACGG TGGAGTCGAA GTGGAAGCCG AATAAGCCGA AAGAGGCGAA GATGTGCTAC
GGGACGAGCT CCATCGAGCA TCCCGGCGTG GCGATGATCT CCATGGAGCG TCGCAAGTAT
TACATCGGTG GTAAGATTGA GGGTTTGAAC ATTCCGCAGC GACCGTTTCC GTGCCCGACG
CCCGCCGAGG TGCGCGCGGG GTTGCCCGCG GGTAAGGATG TCGTGGCGTT CCAGTGCCGC
AACCCGGTGC ACCGCGCGCA CTACGAGCTC TTCACTCGCG CTTTGCACGC GGAAAACGTC
GGTAAGGACG CCGTGTGCCT CGTTCACCCG ACCATGGGTC CGACCCAAGA CGACGACATC
TCGGGCTTGG TGCGATACAA GACGTACGTC GTCCTCGCGG AAGAGGTGAA GAACCCGCAA
ATTCGCTGGG CCTACCTCCC GTACTCCATG CACATGGCGG GTCCGCGCGA AGCTATTCAG
CACATGATCA TTCGTAAGAA CTACGGCTGC ACGCACTTCA TCATCGGTCG CGATATGGCT
GGTTCCAAGT CTTCCCTCGA CGGAGAAGAC TTTTACGGCG CGTACGACGC CCAAGACTTG
GCCAAGGCGA ACGCGGCTGA GCTCGGCATG AAAACCGTCC CGAGCTTGAA CGTCGTGTAC
ACCGAAGAAG AAGGCTACGT CACCGCCGAT GTCGCCAAGG AGAAGGGTCT CAACATCAAG
AAGCTCAGCG GCACCAAGTT CCGCCAAATG TTGAGAGGCG GCGAGGACAT TCCAGAGTGG
TTCGCGTTCA AGTCCGTCGT CAAGGTCCTT CGCGAGAACA TTTAG
 
Protein sequence
MTARERTFAS QSEGLIAPHG GALVNLMLED DGAKARAIAS CTRALELSDR NACDVELLSA 
GGFSPLRGFM NEDEYEHCVE TMRLKGSELL FGLPIVLDTN CEDTKAGDRV LLKYQGKDVG
VLTVESKWKP NKPKEAKMCY GTSSIEHPGV AMISMERRKY YIGGKIEGLN IPQRPFPCPT
PAEVRAGLPA GKDVVAFQCR NPVHRAHYEL FTRALHAENV GKDAVCLVHP TMGPTQDDDI
SGLVRYKTYV VLAEEVKNPQ IRWAYLPYSM HMAGPREAIQ HMIIRKNYGC THFIIGRDMA
GSKSSLDGED FYGAYDAQDL AKANAAELGM KTVPSLNVVY TEEEGYVTAD VAKEKGLNIK
KLSGTKFRQM LRGGEDIPEW FAFKSVVKVL RENI