Gene SbBS512_E4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4014 
Symbol 
ID6272534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3746707 
End bp3747843 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID641727857 
Productauxiliary transport protein, membrane fusion protein family 
Protein accessionYP_001882289 
Protein GI187731943 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTAT TGATTGTTTT AACTTACGTG GCGCTGGCGT GGGCGGTCTT TAAAATCTTC 
CGCATTCCGG TAAATCAGTG GACGCTGGCG ACGGCGGCGC TGGGAGGCGT GTTTCTGGTG
AGTGGTTTGA TTTTGTTGAT GAACTACAAC CACCCTTACA CTTTTACCGC GCAAAAGGCA
GTGATAGCGA TCCCTATCAC GCCACAGGTG ACGGGAATTG TTACTGAAGT CACTGACAAG
AATAATCAGC TTATTCAAAA GGGCGAGGTG CTTTTTAAGC TCGACCCGGT TCGTTACCAG
GCGCGAGTTG ACAGACTTCA GGCTGACCTG ATGACGGCGA CGCATAATAT AAAGACGCTG
CGTGCGCAGC TCACTGAAGC GCAGGCCAAC ACCACCCAGG TTTCAGCGGA GCGCGACCGT
CTGTTTAAAA ATTATCAACG TTACTTGAAA GGCAGCCAGG CGGCGGTGAA TCCGTTCTCG
GAACGTGACA TCGACGATGC GCGGCAAAAT TTCCTCGCGC AGGATGCGCT GGTGAAAGGC
TCGGTGGCGG AGCAGGCGCA GATCCAGAGC CAGCTCGACA GTATGGTTAA CGGCGAGCAA
TCGCAGATTG TGAGCTTAAG AGCGCAACTT ACTGAAGCAA AATATAACCT TGAGCAGACT
GTCATTCGCG CGCCGAGCAA TGGCTACGTT ACTCAGGTAC TGATCCGCCC AGGTACATAC
GCAGCTGCCT TGCCGCTGCG TCCGGTGATG GTCTTCATCC CCGAGCAAAA ACGGCAAATT
GTCGCCCAAT TTCGGCAAAA CTCGCTGTTA CGTCTGAAAC CCGGCGATGA TGCGGAAGTG
GTGTTTAACG CGCTACCTAG GCAGGTGTTT CACGGCAAAC TGACTAGTAT TTTACCTGTC
GTGCCAGGCG GTTCTTATCA GGCGCAGGGG GGATTGCAAT CATTAACGGT CGTGCCCGGC
ACGGACGGTG TGCTGGGAAC CATTGAACTG GACCCTAACG ATGATATCGA TGCCTTACTC
GACGGCATCT ACGCCCAGGT GGCGGTTTAC TCCGACCATT TCAGCCATGT TTCGGTGATG
CGGAAAGTGC TGCTAAGAAT GACCAGCTGG ATGCATTATC TTTATTTGGA TCATTGA
 
Protein sequence
MDLLIVLTYV ALAWAVFKIF RIPVNQWTLA TAALGGVFLV SGLILLMNYN HPYTFTAQKA 
VIAIPITPQV TGIVTEVTDK NNQLIQKGEV LFKLDPVRYQ ARVDRLQADL MTATHNIKTL
RAQLTEAQAN TTQVSAERDR LFKNYQRYLK GSQAAVNPFS ERDIDDARQN FLAQDALVKG
SVAEQAQIQS QLDSMVNGEQ SQIVSLRAQL TEAKYNLEQT VIRAPSNGYV TQVLIRPGTY
AAALPLRPVM VFIPEQKRQI VAQFRQNSLL RLKPGDDAEV VFNALPRQVF HGKLTSILPV
VPGGSYQAQG GLQSLTVVPG TDGVLGTIEL DPNDDIDALL DGIYAQVAVY SDHFSHVSVM
RKVLLRMTSW MHYLYLDH