Gene SbBS512_E4120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4120 
SymbolaslB 
ID6270852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3843482 
End bp3844717 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content51% 
IMG OID641727948 
Productarylsulfatase-activating protein AslB 
Protein accessionYP_001882379 
Protein GI187733692 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000574144 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCAAC AGGTTCCAAC GCGTGCTTTT CATGTGATGG CGAAACCGAG TGGTTCCGAT 
TGTAATCTGA ACTGTGACTA CTGTTTTTAT CTCGAAAAAC AATCCCTTTA CCGCGAAAAG
CCAGTCACGC ATATGGACGA TGACACGCTG GAAGCGTATG TCCGTCACTA TATCGCTGCC
AGCGAACCGC AAAACGAAGT GGCTTTTACC TGGCAGGGCG GCGAACCAAC GTTACTCGGG
CTGGATTTTT TCCGCTGTGC CGTAAAGTTA CAGGCGAAAT ACGGTGCTGG CAGGAAGATA
AGTAACAGCT TCCAGACTAA CGGCGTGCTG CTCGATGATA AATGGTGTGC ATTTCTGGCA
GAAAATCATT TTCTTGTTGG GTTATCGCTG GACGGTCCGG CTGAGATCCA CAATCAATAT
CGCGTGACCA AAGGTGGCAG ACCAACGCAT AAGCTGGTGA TGCGTGCCCT GACGCTGCTG
CAAAAACATC ATGTCGACTA TAACGTGCTG GTCTGCGTCA ACCGCACCAG CGCGCAGCAA
CCGTTGCAGG TTTATGATTT TTTGTGCGAT GCGGGAGTCG AATTCATCCA GTTTATTCCG
GTGGTCGAGC GCCTGGCTGA TGAAACAGCT GCCAGCGATG GACTGAAACT ACATGCGCCT
GGTGATATTC AGGGGGAACT GACGGAATGG TCTGTGCACC CCGATGAATT TGGTGAATTT
CTGGTGGCGA TTTTTGACCA CTGGATCAAA CGCGACGTCG GCAAGATTTT CGTGATGAAT
ATCGAATGGG CGTTTGCCAA TTTTGTCGGT GCGCCGGGTG CGGTTTGCCA TCATCAGCCA
ACCTGTGGGC GCTCGGTGAT TGTTGAGCAC AACGGTGACG TTTACGCCTG CGATCACTAT
GTTTATCCGC AATATCGGCT GGGGAATATG CATCAGCAAA CAATTGCAGA AATGATCGAT
TCCCCGCAAC AGCAGGTGTT TGGTGAAGAT AAATTTAAGC AATTACCGGC GCAGTGTCGC
AGTTGTAACG TGTTAAAAGC GTGCTGGGGA GGCTGCCCGA AACACCGCTT CATGCTCGAT
GCCAGCGGCA AACCGGGGCT GAATTATTTG TGTGCCGGGT ATCAGCGTTA TTTCCGCCAT
CTACCGCCAT ATCTTAAAGC AATGGCTGAT TTGCTGGCGC ACGGTCGCCC GGCCAGTGAC
ATTATGCAGG CACATTTGCT GGTGGTGAAT AAGTAA
 
Protein sequence
MLQQVPTRAF HVMAKPSGSD CNLNCDYCFY LEKQSLYREK PVTHMDDDTL EAYVRHYIAA 
SEPQNEVAFT WQGGEPTLLG LDFFRCAVKL QAKYGAGRKI SNSFQTNGVL LDDKWCAFLA
ENHFLVGLSL DGPAEIHNQY RVTKGGRPTH KLVMRALTLL QKHHVDYNVL VCVNRTSAQQ
PLQVYDFLCD AGVEFIQFIP VVERLADETA ASDGLKLHAP GDIQGELTEW SVHPDEFGEF
LVAIFDHWIK RDVGKIFVMN IEWAFANFVG APGAVCHHQP TCGRSVIVEH NGDVYACDHY
VYPQYRLGNM HQQTIAEMID SPQQQVFGED KFKQLPAQCR SCNVLKACWG GCPKHRFMLD
ASGKPGLNYL CAGYQRYFRH LPPYLKAMAD LLAHGRPASD IMQAHLLVVN K