Gene SbBS512_E4939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4939 
Symbol 
ID6272770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4603253 
End bp4604920 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content55% 
IMG OID641728664 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_001883055 
Protein GI187730227 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTCAAT TCGTTTATAC CATGCATCGT GTCGGCAAAG TTGTTCCGCC GAAACGTCAT 
ATTTTGAAAA ACATCTCTCT GAGTTTCTTC CCTGGGGCAA AAATTGGTGT CCTGGGTCTG
AACGGCGCGG GTAAGTCTAC CCTGCTGCGC ATTATGGCGG GCATTGATAA AGACATCGAA
GGTGAAGCGC GTCCGCAGCC AGACATCAAG ATTGGTTACC TGCCGCAGGA AGCGCAGCTG
AACCCGGAAC ACACCGTGCG TGAGTCCATT GAAGAAGCGG TTTCTGAAGT GGTTAACGCC
CTGAAACGCC TGGATGAAGT GTATGCGCTG TACGCCGATC CGGATGCCGA TTTTGACAAG
CTGGCCGCTG AACAAGGCCG TCTGGAAGAG ATCATTCAGG CTCACGACGG TCATAACCTG
AACGTACAGC TGGAGCGTGC GGCGGATGCG CTACGTCTGC CGGACTGGGA CGCGAAAATC
GCTAACCTCT CCGGTGGTGA GCGTCGTCGC GTAGCGTTGT GCCGCCTGCT GCTGGAAAAA
CCAGACATGC TGCTGCTCGA CGAACCGACC AACCACCTGG ATGCCGAATC CGTGGCCTGG
CTGGAACGCT TCCTGCACGA CTTCGAGGGC ACCGTGGTGG CGATTACCCA CGACCGTTAC
TTCCTCGATA ACGTTGCAGG CTGGATCCTC GAACTTGACC GCGGTGAAGG TATTCCGTGG
GAAGGCAACT ACTCCTCCTG GCTGGAGCAG AAAGATCAGC GCCTGGCGCA GGAAGCTTCA
CAAGAAGCGG CGCGTCGTAA GTCGATCGAG AAAGAGCTGG AGTGGGTACG TCAGGGAACT
AAAGGCCGCC AGTCGAAAGG TAAAGCACGT CTGGCACGCT TTGAAGAGCT GAACAGCACC
GAATATCAGA AACGTAACGA AACCAACGAA CTGTTTATTC CACCTGGACC GCGTCTGGGC
GATAAAGTGC TGGAAGTCAG CAACCTGCGT AAATCCTACG GTGATCGCCT GCTGATTGAT
GACCTGAGCT TCTCGATCCC GAAAGGGGCA ATCGTCGGGA TCATCGGTCC GAACGGCGCG
GGTAAATCGA CCCTGTTCCG TATGATCTCT GGTCAGGAAC AGCCGGACAG CGGCACCATC
ACTTTAGGTG AAACGGTGAA ACTGGCATCG GTTGATCAGT TCCGTGACTC AATGGATAAC
AGCAAAACCG TTTGGGAAGA AGTTTCCGGC GGGCTGGATA TTATGAAGAT CGGCAACACC
GAGATGCCAA GCCGCGCCTA CGTTGGCCGC TTTAACTTTA AAGGGGTTGA TCAGGGTAAA
CGCGTTGGTG AACTTTCCGG TGGTGAGCGC GGTCGTCTGC ATCTGGCGAA GCTGCTGCAG
GTTGGCGGCA ACATGCTGCT GCTCGACGAA CCAACCAACG ACCTGGATAT CGAAACCCTG
CGCGCGCTGG AAAACGCCCT GCTGGAGTTC CCGGGCTGTG CGATGGTTAT CTCGCACGAC
CGTTGGTTCC TCGACCGTAT CGCCACGCAC ATCCTGGACT ACCAGGATGA AGGTAAAGTT
GAGTTCTTCG AAGGTAACTT TACTGAGTAC GAAGAGTACA AGAAACGCAC GCTGGGCGCA
GACGCACTGG AGCCGAAGCG TATCAAGTAC AAGCGTATTG CGAAGTAA
 
Protein sequence
MAQFVYTMHR VGKVVPPKRH ILKNISLSFF PGAKIGVLGL NGAGKSTLLR IMAGIDKDIE 
GEARPQPDIK IGYLPQEAQL NPEHTVRESI EEAVSEVVNA LKRLDEVYAL YADPDADFDK
LAAEQGRLEE IIQAHDGHNL NVQLERAADA LRLPDWDAKI ANLSGGERRR VALCRLLLEK
PDMLLLDEPT NHLDAESVAW LERFLHDFEG TVVAITHDRY FLDNVAGWIL ELDRGEGIPW
EGNYSSWLEQ KDQRLAQEAS QEAARRKSIE KELEWVRQGT KGRQSKGKAR LARFEELNST
EYQKRNETNE LFIPPGPRLG DKVLEVSNLR KSYGDRLLID DLSFSIPKGA IVGIIGPNGA
GKSTLFRMIS GQEQPDSGTI TLGETVKLAS VDQFRDSMDN SKTVWEEVSG GLDIMKIGNT
EMPSRAYVGR FNFKGVDQGK RVGELSGGER GRLHLAKLLQ VGGNMLLLDE PTNDLDIETL
RALENALLEF PGCAMVISHD RWFLDRIATH ILDYQDEGKV EFFEGNFTEY EEYKKRTLGA
DALEPKRIKY KRIAK