Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04221 |
Symbol | aslB |
ID | 4778100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 422593 |
End bp | 423768 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640085926 |
Product | putative arylsulfatase regulatory protein |
Protein accession | YP_001016439 |
Protein GI | 124022132 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACAG CAAGTCCTGA CATCAGTCGC TTCGGCCCAA TCGGCCTGGT AGTTGTTCAG TCCACCTCTC TCTGCAATCT GGACTGCTCA TACTGCTATC TGCCAGATCG TAAGAGCCGC AAGATCTTTG ATCTTGAGCT TTTGCCTTTA CTGATGCAAA GAATATTCGA AAGCCCCTTC TTTGGTGAAG AGCTAAGCCT GGTGTGGCAT GCAGGGGAGC CACTGACCCT ACCCTGCAGC TACTACGACA GAGCCACACA ATTAATCAAC GAAGCTGTAG AAAAATGGAC TAATGGAACA GTTCATGTTG AACAGCATGT ACAAACTAAT GCCACACTGA TCAACGATGC TTGGTGTGAA TGCTTTCGCA GGAATCAGAT CATTGTTGGC ATCAGCGTGG ATGGCCCCAA AGACATACAC GATGCCAACC GTTGCTTCCG TAATGGGGAA GGCTCTCATG TCCACTCAAT GCGAGGCATC GAAGCACTCA AGCGAAACAA AATCCCATTT CATGCGATTG CTGTTGTAAC TGCAACAGCA ATGGATCATC CAAGTGAGAT GTATCAATTT TTTCGTGACA ATGAAATTCA TTCTATTGGC TTCAATGTTG AAGAGCAAGA AGGTGAGCAT ACAAGTTCAT CCATGCAAGG TTACGAGCGT GAAGAACAAT ATCGTCAGTT CCTACAAACC TTCTGGCAGT TAAGTGAGCA GGATGGTTTC CCTGTTGTAC TGAGGGAGTT TGATCAAGTG ATCAGTCTGA TTCGTGAGAA TCGGCGGCTT AATCAAAATG AACTCAATCG ACCTTATTCA ATTCTGAGTG TCGATTGGCA GGGTAATTTT TCAACCTTTG ATCCTGAACT TCTTTCAGTC TCATCAAAGC TTTATGGCAC ATTTGATCTT GGTAGCATCC GCAAGCTTTC GCTAATGGAA GCAGCTAAGA CCGAGCGATT TCAAACATTG TGGAAGGATA TGCTATCTGG CGTGCAACGC TGTGAGAAAG AATGCAACTA CTTCGGCTTC TGCGGAGGCG GCATGGGAAG CAATAAGTTC TGGGAACACG GAAGTCTCAA TTGTAGCGAA ACAAATGCTT GTCGCTTCAA CAATAAGATA CCTGTAGATG TGCTATTAGA TCGCTTTAAA TCTAGCCCTC CAATAGACAA CGAAACTCCT TTTTAA
|
Protein sequence | MTTASPDISR FGPIGLVVVQ STSLCNLDCS YCYLPDRKSR KIFDLELLPL LMQRIFESPF FGEELSLVWH AGEPLTLPCS YYDRATQLIN EAVEKWTNGT VHVEQHVQTN ATLINDAWCE CFRRNQIIVG ISVDGPKDIH DANRCFRNGE GSHVHSMRGI EALKRNKIPF HAIAVVTATA MDHPSEMYQF FRDNEIHSIG FNVEEQEGEH TSSSMQGYER EEQYRQFLQT FWQLSEQDGF PVVLREFDQV ISLIRENRRL NQNELNRPYS ILSVDWQGNF STFDPELLSV SSKLYGTFDL GSIRKLSLME AAKTERFQTL WKDMLSGVQR CEKECNYFGF CGGGMGSNKF WEHGSLNCSE TNACRFNNKI PVDVLLDRFK SSPPIDNETP F
|
| |