Gene P9303_04221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_04221 
SymbolaslB 
ID4778100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp422593 
End bp423768 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content45% 
IMG OID640085926 
Productputative arylsulfatase regulatory protein 
Protein accessionYP_001016439 
Protein GI124022132 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAG CAAGTCCTGA CATCAGTCGC TTCGGCCCAA TCGGCCTGGT AGTTGTTCAG 
TCCACCTCTC TCTGCAATCT GGACTGCTCA TACTGCTATC TGCCAGATCG TAAGAGCCGC
AAGATCTTTG ATCTTGAGCT TTTGCCTTTA CTGATGCAAA GAATATTCGA AAGCCCCTTC
TTTGGTGAAG AGCTAAGCCT GGTGTGGCAT GCAGGGGAGC CACTGACCCT ACCCTGCAGC
TACTACGACA GAGCCACACA ATTAATCAAC GAAGCTGTAG AAAAATGGAC TAATGGAACA
GTTCATGTTG AACAGCATGT ACAAACTAAT GCCACACTGA TCAACGATGC TTGGTGTGAA
TGCTTTCGCA GGAATCAGAT CATTGTTGGC ATCAGCGTGG ATGGCCCCAA AGACATACAC
GATGCCAACC GTTGCTTCCG TAATGGGGAA GGCTCTCATG TCCACTCAAT GCGAGGCATC
GAAGCACTCA AGCGAAACAA AATCCCATTT CATGCGATTG CTGTTGTAAC TGCAACAGCA
ATGGATCATC CAAGTGAGAT GTATCAATTT TTTCGTGACA ATGAAATTCA TTCTATTGGC
TTCAATGTTG AAGAGCAAGA AGGTGAGCAT ACAAGTTCAT CCATGCAAGG TTACGAGCGT
GAAGAACAAT ATCGTCAGTT CCTACAAACC TTCTGGCAGT TAAGTGAGCA GGATGGTTTC
CCTGTTGTAC TGAGGGAGTT TGATCAAGTG ATCAGTCTGA TTCGTGAGAA TCGGCGGCTT
AATCAAAATG AACTCAATCG ACCTTATTCA ATTCTGAGTG TCGATTGGCA GGGTAATTTT
TCAACCTTTG ATCCTGAACT TCTTTCAGTC TCATCAAAGC TTTATGGCAC ATTTGATCTT
GGTAGCATCC GCAAGCTTTC GCTAATGGAA GCAGCTAAGA CCGAGCGATT TCAAACATTG
TGGAAGGATA TGCTATCTGG CGTGCAACGC TGTGAGAAAG AATGCAACTA CTTCGGCTTC
TGCGGAGGCG GCATGGGAAG CAATAAGTTC TGGGAACACG GAAGTCTCAA TTGTAGCGAA
ACAAATGCTT GTCGCTTCAA CAATAAGATA CCTGTAGATG TGCTATTAGA TCGCTTTAAA
TCTAGCCCTC CAATAGACAA CGAAACTCCT TTTTAA
 
Protein sequence
MTTASPDISR FGPIGLVVVQ STSLCNLDCS YCYLPDRKSR KIFDLELLPL LMQRIFESPF 
FGEELSLVWH AGEPLTLPCS YYDRATQLIN EAVEKWTNGT VHVEQHVQTN ATLINDAWCE
CFRRNQIIVG ISVDGPKDIH DANRCFRNGE GSHVHSMRGI EALKRNKIPF HAIAVVTATA
MDHPSEMYQF FRDNEIHSIG FNVEEQEGEH TSSSMQGYER EEQYRQFLQT FWQLSEQDGF
PVVLREFDQV ISLIRENRRL NQNELNRPYS ILSVDWQGNF STFDPELLSV SSKLYGTFDL
GSIRKLSLME AAKTERFQTL WKDMLSGVQR CEKECNYFGF CGGGMGSNKF WEHGSLNCSE
TNACRFNNKI PVDVLLDRFK SSPPIDNETP F