Gene SbBS512_E4010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4010 
Symbol 
ID6269185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3738846 
End bp3743012 
Gene Length4167 bp 
Protein Length1388 aa 
Translation table11 
GC content59% 
IMG OID641727855 
Productprotein rhsA precursor 
Protein accessionYP_001882287 
Protein GI187730147 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGAA AACCGGCAGC GCGTCAGGGC GACATGACGC AGTATGGCGG TAGCATTGTT 
CAGGGTTCAG CCGGAGTGCG CATTGGTGCC CCCACCGGCG TGGCCTGTTC GGTGTGCCCC
GGCGAAGTGA CGTCCGGCCA TCCGGTCAAT CCCCTGCTCG GTGCAAAGGT CCTTCCCGGT
GAAACCGACA TCGCCCTGCC CGGCCCGCTG CCGTTCATCC TCTCCCGCAC CTACAGCAGT
TACCGGACAA AAACGCCCGC GCCGGTGGGG AGCCTCGGCC CCGGCTGGAA AATGCCTGCG
GATATCCGCT TACAGCTGCG CGATAACACA CTGATACTCA GTGATAACGG CGGCAGAAGC
CTGTATTTTG AGCACCTGTT TCCCGGTGAG GACGGTTACA GCCGCAGCGA GTCACTGTGG
CTGGTGCGCG GCGGCGTGGC GAAACTGGAT GAAGGTCACC GGCTGGCCGC ACTCTGGCAG
GCGCTGCCGG AAGAACTCCG CTTAAGTCCG CATCGTTATC TGGCGACAAA CAGTCCGCAG
GGGCCGTGGT GGCTGCTCGG CTGGTGTGAG CGGGTGCCGG AAGCGGATGA GGTGCTGCCT
GCGCCGCTGC CGCCGTACCG GGTACTGACC GGGCTGGTGG ACCGCTTCGG GCGCACACAG
ACGTTCCACC GCGAAGCCGC CGGTGAATTC AGCGGCGAAA TCACCGGCGT GACGGATGGT
GCCGGGCGTC ACTTCCAACT GGTACTGACC ACGCAGGCGC AGCGGGCAGA AGAAGCCCGG
CAGCAGGCCA TTTCCGGCGG GACGGAACCG TCCGCTTTTC CTGATACCCT GCCGGGTTAC
ACCGAATATG GCCGGGACAA CGGCATCCGT CTGTCTGCCG TGTGGCTGAC GCACGACCCG
GAATACCCGG AGAATTTACC TGCCGCGCCG CTGGTGCGCT ATGGCTGGAC GCCGCGCGGC
GAACTGGCGG CGGTGTATGA CCGTAGTGGC AAACAGGTGC GCAGCTTTAC TTACGATGAT
AAATACCGGG GCCGGATGGT GGCGCACCGT CACACGGGCC GGCCGGAAAT CCGTTACCGT
TACGACAGCG ACGGGCGGGT GACAGAACAG CTTAACCCGG CAGGCTTAAG CTACACGTAT
CAGTATGAGA AGGACCGCAT CACCATCACC GACAGCCTGA ACCGCCGTGA AGTGCTGCAC
ACGCAGGGTG AAGCCGGGCT GAAGCGGGTG GTGAAAAAGG AACACGCGGA CGGCAGCGTC
ACGCAGAGTC AGTTTGACGC CGTGGGCAGG CTCAGGACAC AGACGGATGC CGCAGGCCGG
ACAACAGAAT ACAGCCCGGA TGTGGTGACG GGCCTCATCA CGCGCATCAC CACGCCGGAT
GGCAGGGCAT CGGCGTTTTA CTATAACCAC CACAGCCAGT TAACGTCAGC CACCGGGCCT
GACGGGCTGG AAATACGCCG GGAATATGAT GAATGGGGCC GTCTGATTCA GGAAACTGCC
CCTGACGGCG ATATCATCCG CTACCGTTAT GATAATCCAC ACAGTGACTT ACCCTGCGCA
ACGGAAGATG CCACCGGTAG CCGGAAAACC ATGACGTGGA GCCGTTACGG TCAGTTGCTG
AGCTTCACCG ACTGTTCCGG TTATGTAACC CGTTATGACC ATGACCGCTT CGGGCAGATG
ACGGCGGTGC ACCGCGAGGA AGGGCTGAGT CAGTACCGCG CATACGACAG CCGTGGACAG
TTAATTGCCG TGAAAGACAC GCAGGGCCAT GAAACGCGGT ATGAATACAA CGCCGCCGGT
GACCTGACCG CCGTCATTGC CCCGGACGGC AGCAGAAACG GGACACAGTA CGATGCGTGG
GGAAAGGCCA TCCGTACCAC GCAGGGCGAG CTGACGCGCA GTATGGAATA CGATGCTGCA
GGACGGGTCA TCCGCCTGAC CAGTGAAAAC GGCAGCCACA CCACCTTCCG TTACGATGTA
CTCGACCGGC TGATACAGGA AACCGGCTTT GACGGCCGCA CACAGCGTTA TCACCACGAC
CTGACCGGCA AACTTATCCG CAGCGAGGAT GAGGGGCTGG TCACCCACTG GCACTATGAC
GAAGCAGACC GCCTCACGCA CCGCACCGTG AATGGCGAAA CCGCAGAGCA GTGGCAGTAT
GACAAACGTG GCTGGCTGAC AGACATCAGC CATCTCAGCG AAGGGCACCG GGTGACGGTG
CATTACAGGT ATGATGAGAA AGGCCGGCTG ACCGGTGAGC GCCAGACGGT GCATCACCCG
CAGCCGGAAG CACTGCTCTG GCAGCATGAG ACCAGACACG CTTACAACGC GCAGGGACTG
GCGAACCGCT GTATACCGGA CAGCCTGCCC GCCGTGGAAT GGCTGGCCTA CGGCAGCGGT
TACCTGGCAG GCATGAAGCT CGGCGACACA CCGCTGGTGG ATTTCACCCG CGACCGCCTG
CACCGGGAAA CGCTGCGCCG CTTCGGCCGC TATGAACTCT CCACCGCTTA CACCCCTGCC
GGGCAGTTAC AGAGCCAGCA CCTGAACAGC CTGCAGTATG ACCGCGATTA CACCTGGAAC
GACAACGGCG AACTCATCCG CATCAGCAGC CCGCGCCAGA CCCGGAGTTA CAGCTACAGC
ACCACCGGCA GGCTGACCGG CGTTCACACC ACCGCAGCGA ACCTTGATAT CCGCATCCCG
TATGCCACAG ACCCGGCAGG TAACCGCCTG CCCGACCCGG AGCTGCACCC GGACAGCACC
CTCAGCATGT GGCCGGATAA CCGTATCGCC CGTGACGCGC ACTATCTTTA CCGGTATGAC
CGTCACGGCA GGCTGACAGA GAAAACCGAC CTCATCCCGG AAGGGGGTAT CCGCACGGAT
GATGAGCGCA CCCACCGGTA CCATTACGAC AGTCAGCACC GGCTGGTGCA CTACACGCGG
ACACAATATG CAGAGCCGCT GGTCGAAAGT CGCTATCTTT ACGACCCGCT GGGCCGCAGG
GTGGCAAAAC GGGTGTGGCG GCGTGAACGT GACCTGACGG GCTGGATGTC GCTGTCACGG
AAACCGGAAG TGACCTGGTA CGGCTGGGAC GGCGACCGGC TGACCACAAT ACAGAACGAC
AGGAGCCGCA TCCAGACGAT TTATCAGCCG GGGAGCTTCA CGCCACTCAT CAGAGTTGAA
ACCGCCACCG GTGAGCTGGC GAAAACGCAG CGCCGCAGCC TGGCGGATAC CCTTCAGCAG
TCCGGCGGCG AAGACGGTGG CAGTGTGGTG TTCCCGCCGG TGCTGGTGCA GATGCTCGAC
CGGCTGGAAA GTGAAATCCT GGCTGACCGG GTGAGTGAGG AAAGCCGCCG CTGGCTGGCA
TCGTGCGGCC TGACGGTGGC GCAGATGCAA AGCCAGATGG ACCCGGTGTA CACGCCGGCG
CGAAAAATCC ACCTGTACCA CTGCGACCAT CGCGGCCTGC CTCTGGCGCT CATCAGCACG
GAAGGGGCAA CAGCGTGGTG CGCAGAATAT GATGAATGGG GCAACCTGCT GAGTGATGAG
AACCCGCATC ATCTGCAGCA GCTCATTCGC CTGCCGGGTC AGCAGTATGA TGAGGAGTCC
GGCCTGTATT ACAACCGCCA CCGCTATTAT GACCCGCTGC AGGGGCGGTA TATCACTCAG
GATCCGATTG GGCTGAAGGG GGGATGGAAT TTTTATCAGT ATCCGTTGAA TCCGGTCATA
AATGTAGATC CGCAAGGTTT GGTTGATATA AATTTATACC CCGAAAGTGA TCTTATCCAT
TCTGTAGCTG ATGAGATTAA TATCCCAGGC GTTTTCACAA TCGGGGGGCA TGGTACCCCC
ACATCTATTG AATCCGCAAC GCGCAGTATC ATGACAGCTA AAGATCTAGC ATATCTAATT
AAATTTGATG GGAATTATAA AGATGGGATG ACAGTTTGGT TATTTTCTTG TAATACAAGT
AAAGGACAAA ATTCATTTGC TAGCCAATTA GCTAAAGAGT TACATACAAA TGTAATAGGA
CCTGACACGC TATGGACGTG GTGGGGGCGA GGAACTAATG GTAAGTTAAA AATGGATACA
GTGCTAACAG CACCAACGAA CCTTAATTCA AATAAGGATC TAATGGCTAT AACAACAAAA
GACCTGGTAA TGACTCCAAC TTATTGA
 
Protein sequence
MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GEVTSGHPVN PLLGAKVLPG 
ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS
LYFEHLFPGE DGYSRSESLW LVRGGVAKLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ
GPWWLLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHREAAGEF SGEITGVTDG
AGRHFQLVLT TQAQRAEEAR QQAISGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP
EYPENLPAAP LVRYGWTPRG ELAAVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR
YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLNRREVLH TQGEAGLKRV VKKEHADGSV
TQSQFDAVGR LRTQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HSQLTSATGP
DGLEIRREYD EWGRLIQETA PDGDIIRYRY DNPHSDLPCA TEDATGSRKT MTWSRYGQLL
SFTDCSGYVT RYDHDRFGQM TAVHREEGLS QYRAYDSRGQ LIAVKDTQGH ETRYEYNAAG
DLTAVIAPDG SRNGTQYDAW GKAIRTTQGE LTRSMEYDAA GRVIRLTSEN GSHTTFRYDV
LDRLIQETGF DGRTQRYHHD LTGKLIRSED EGLVTHWHYD EADRLTHRTV NGETAEQWQY
DKRGWLTDIS HLSEGHRVTV HYRYDEKGRL TGERQTVHHP QPEALLWQHE TRHAYNAQGL
ANRCIPDSLP AVEWLAYGSG YLAGMKLGDT PLVDFTRDRL HRETLRRFGR YELSTAYTPA
GQLQSQHLNS LQYDRDYTWN DNGELIRISS PRQTRSYSYS TTGRLTGVHT TAANLDIRIP
YATDPAGNRL PDPELHPDST LSMWPDNRIA RDAHYLYRYD RHGRLTEKTD LIPEGGIRTD
DERTHRYHYD SQHRLVHYTR TQYAEPLVES RYLYDPLGRR VAKRVWRRER DLTGWMSLSR
KPEVTWYGWD GDRLTTIQND RSRIQTIYQP GSFTPLIRVE TATGELAKTQ RRSLADTLQQ
SGGEDGGSVV FPPVLVQMLD RLESEILADR VSEESRRWLA SCGLTVAQMQ SQMDPVYTPA
RKIHLYHCDH RGLPLALIST EGATAWCAEY DEWGNLLSDE NPHHLQQLIR LPGQQYDEES
GLYYNRHRYY DPLQGRYITQ DPIGLKGGWN FYQYPLNPVI NVDPQGLVDI NLYPESDLIH
SVADEINIPG VFTIGGHGTP TSIESATRSI MTAKDLAYLI KFDGNYKDGM TVWLFSCNTS
KGQNSFASQL AKELHTNVIG PDTLWTWWGR GTNGKLKMDT VLTAPTNLNS NKDLMAITTK
DLVMTPTY