Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4010 |
Symbol | |
ID | 6269185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3738846 |
End bp | 3743012 |
Gene Length | 4167 bp |
Protein Length | 1388 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641727855 |
Product | protein rhsA precursor |
Protein accession | YP_001882287 |
Protein GI | 187730147 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGAA AACCGGCAGC GCGTCAGGGC GACATGACGC AGTATGGCGG TAGCATTGTT CAGGGTTCAG CCGGAGTGCG CATTGGTGCC CCCACCGGCG TGGCCTGTTC GGTGTGCCCC GGCGAAGTGA CGTCCGGCCA TCCGGTCAAT CCCCTGCTCG GTGCAAAGGT CCTTCCCGGT GAAACCGACA TCGCCCTGCC CGGCCCGCTG CCGTTCATCC TCTCCCGCAC CTACAGCAGT TACCGGACAA AAACGCCCGC GCCGGTGGGG AGCCTCGGCC CCGGCTGGAA AATGCCTGCG GATATCCGCT TACAGCTGCG CGATAACACA CTGATACTCA GTGATAACGG CGGCAGAAGC CTGTATTTTG AGCACCTGTT TCCCGGTGAG GACGGTTACA GCCGCAGCGA GTCACTGTGG CTGGTGCGCG GCGGCGTGGC GAAACTGGAT GAAGGTCACC GGCTGGCCGC ACTCTGGCAG GCGCTGCCGG AAGAACTCCG CTTAAGTCCG CATCGTTATC TGGCGACAAA CAGTCCGCAG GGGCCGTGGT GGCTGCTCGG CTGGTGTGAG CGGGTGCCGG AAGCGGATGA GGTGCTGCCT GCGCCGCTGC CGCCGTACCG GGTACTGACC GGGCTGGTGG ACCGCTTCGG GCGCACACAG ACGTTCCACC GCGAAGCCGC CGGTGAATTC AGCGGCGAAA TCACCGGCGT GACGGATGGT GCCGGGCGTC ACTTCCAACT GGTACTGACC ACGCAGGCGC AGCGGGCAGA AGAAGCCCGG CAGCAGGCCA TTTCCGGCGG GACGGAACCG TCCGCTTTTC CTGATACCCT GCCGGGTTAC ACCGAATATG GCCGGGACAA CGGCATCCGT CTGTCTGCCG TGTGGCTGAC GCACGACCCG GAATACCCGG AGAATTTACC TGCCGCGCCG CTGGTGCGCT ATGGCTGGAC GCCGCGCGGC GAACTGGCGG CGGTGTATGA CCGTAGTGGC AAACAGGTGC GCAGCTTTAC TTACGATGAT AAATACCGGG GCCGGATGGT GGCGCACCGT CACACGGGCC GGCCGGAAAT CCGTTACCGT TACGACAGCG ACGGGCGGGT GACAGAACAG CTTAACCCGG CAGGCTTAAG CTACACGTAT CAGTATGAGA AGGACCGCAT CACCATCACC GACAGCCTGA ACCGCCGTGA AGTGCTGCAC ACGCAGGGTG AAGCCGGGCT GAAGCGGGTG GTGAAAAAGG AACACGCGGA CGGCAGCGTC ACGCAGAGTC AGTTTGACGC CGTGGGCAGG CTCAGGACAC AGACGGATGC CGCAGGCCGG ACAACAGAAT ACAGCCCGGA TGTGGTGACG GGCCTCATCA CGCGCATCAC CACGCCGGAT GGCAGGGCAT CGGCGTTTTA CTATAACCAC CACAGCCAGT TAACGTCAGC CACCGGGCCT GACGGGCTGG AAATACGCCG GGAATATGAT GAATGGGGCC GTCTGATTCA GGAAACTGCC CCTGACGGCG ATATCATCCG CTACCGTTAT GATAATCCAC ACAGTGACTT ACCCTGCGCA ACGGAAGATG CCACCGGTAG CCGGAAAACC ATGACGTGGA GCCGTTACGG TCAGTTGCTG AGCTTCACCG ACTGTTCCGG TTATGTAACC CGTTATGACC ATGACCGCTT CGGGCAGATG ACGGCGGTGC ACCGCGAGGA AGGGCTGAGT CAGTACCGCG CATACGACAG CCGTGGACAG TTAATTGCCG TGAAAGACAC GCAGGGCCAT GAAACGCGGT ATGAATACAA CGCCGCCGGT GACCTGACCG CCGTCATTGC CCCGGACGGC AGCAGAAACG GGACACAGTA CGATGCGTGG GGAAAGGCCA TCCGTACCAC GCAGGGCGAG CTGACGCGCA GTATGGAATA CGATGCTGCA GGACGGGTCA TCCGCCTGAC CAGTGAAAAC GGCAGCCACA CCACCTTCCG TTACGATGTA CTCGACCGGC TGATACAGGA AACCGGCTTT GACGGCCGCA CACAGCGTTA TCACCACGAC CTGACCGGCA AACTTATCCG CAGCGAGGAT GAGGGGCTGG TCACCCACTG GCACTATGAC GAAGCAGACC GCCTCACGCA CCGCACCGTG AATGGCGAAA CCGCAGAGCA GTGGCAGTAT GACAAACGTG GCTGGCTGAC AGACATCAGC CATCTCAGCG AAGGGCACCG GGTGACGGTG CATTACAGGT ATGATGAGAA AGGCCGGCTG ACCGGTGAGC GCCAGACGGT GCATCACCCG CAGCCGGAAG CACTGCTCTG GCAGCATGAG ACCAGACACG CTTACAACGC GCAGGGACTG GCGAACCGCT GTATACCGGA CAGCCTGCCC GCCGTGGAAT GGCTGGCCTA CGGCAGCGGT TACCTGGCAG GCATGAAGCT CGGCGACACA CCGCTGGTGG ATTTCACCCG CGACCGCCTG CACCGGGAAA CGCTGCGCCG CTTCGGCCGC TATGAACTCT CCACCGCTTA CACCCCTGCC GGGCAGTTAC AGAGCCAGCA CCTGAACAGC CTGCAGTATG ACCGCGATTA CACCTGGAAC GACAACGGCG AACTCATCCG CATCAGCAGC CCGCGCCAGA CCCGGAGTTA CAGCTACAGC ACCACCGGCA GGCTGACCGG CGTTCACACC ACCGCAGCGA ACCTTGATAT CCGCATCCCG TATGCCACAG ACCCGGCAGG TAACCGCCTG CCCGACCCGG AGCTGCACCC GGACAGCACC CTCAGCATGT GGCCGGATAA CCGTATCGCC CGTGACGCGC ACTATCTTTA CCGGTATGAC CGTCACGGCA GGCTGACAGA GAAAACCGAC CTCATCCCGG AAGGGGGTAT CCGCACGGAT GATGAGCGCA CCCACCGGTA CCATTACGAC AGTCAGCACC GGCTGGTGCA CTACACGCGG ACACAATATG CAGAGCCGCT GGTCGAAAGT CGCTATCTTT ACGACCCGCT GGGCCGCAGG GTGGCAAAAC GGGTGTGGCG GCGTGAACGT GACCTGACGG GCTGGATGTC GCTGTCACGG AAACCGGAAG TGACCTGGTA CGGCTGGGAC GGCGACCGGC TGACCACAAT ACAGAACGAC AGGAGCCGCA TCCAGACGAT TTATCAGCCG GGGAGCTTCA CGCCACTCAT CAGAGTTGAA ACCGCCACCG GTGAGCTGGC GAAAACGCAG CGCCGCAGCC TGGCGGATAC CCTTCAGCAG TCCGGCGGCG AAGACGGTGG CAGTGTGGTG TTCCCGCCGG TGCTGGTGCA GATGCTCGAC CGGCTGGAAA GTGAAATCCT GGCTGACCGG GTGAGTGAGG AAAGCCGCCG CTGGCTGGCA TCGTGCGGCC TGACGGTGGC GCAGATGCAA AGCCAGATGG ACCCGGTGTA CACGCCGGCG CGAAAAATCC ACCTGTACCA CTGCGACCAT CGCGGCCTGC CTCTGGCGCT CATCAGCACG GAAGGGGCAA CAGCGTGGTG CGCAGAATAT GATGAATGGG GCAACCTGCT GAGTGATGAG AACCCGCATC ATCTGCAGCA GCTCATTCGC CTGCCGGGTC AGCAGTATGA TGAGGAGTCC GGCCTGTATT ACAACCGCCA CCGCTATTAT GACCCGCTGC AGGGGCGGTA TATCACTCAG GATCCGATTG GGCTGAAGGG GGGATGGAAT TTTTATCAGT ATCCGTTGAA TCCGGTCATA AATGTAGATC CGCAAGGTTT GGTTGATATA AATTTATACC CCGAAAGTGA TCTTATCCAT TCTGTAGCTG ATGAGATTAA TATCCCAGGC GTTTTCACAA TCGGGGGGCA TGGTACCCCC ACATCTATTG AATCCGCAAC GCGCAGTATC ATGACAGCTA AAGATCTAGC ATATCTAATT AAATTTGATG GGAATTATAA AGATGGGATG ACAGTTTGGT TATTTTCTTG TAATACAAGT AAAGGACAAA ATTCATTTGC TAGCCAATTA GCTAAAGAGT TACATACAAA TGTAATAGGA CCTGACACGC TATGGACGTG GTGGGGGCGA GGAACTAATG GTAAGTTAAA AATGGATACA GTGCTAACAG CACCAACGAA CCTTAATTCA AATAAGGATC TAATGGCTAT AACAACAAAA GACCTGGTAA TGACTCCAAC TTATTGA
|
Protein sequence | MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GEVTSGHPVN PLLGAKVLPG ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS LYFEHLFPGE DGYSRSESLW LVRGGVAKLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ GPWWLLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHREAAGEF SGEITGVTDG AGRHFQLVLT TQAQRAEEAR QQAISGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP EYPENLPAAP LVRYGWTPRG ELAAVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLNRREVLH TQGEAGLKRV VKKEHADGSV TQSQFDAVGR LRTQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HSQLTSATGP DGLEIRREYD EWGRLIQETA PDGDIIRYRY DNPHSDLPCA TEDATGSRKT MTWSRYGQLL SFTDCSGYVT RYDHDRFGQM TAVHREEGLS QYRAYDSRGQ LIAVKDTQGH ETRYEYNAAG DLTAVIAPDG SRNGTQYDAW GKAIRTTQGE LTRSMEYDAA GRVIRLTSEN GSHTTFRYDV LDRLIQETGF DGRTQRYHHD LTGKLIRSED EGLVTHWHYD EADRLTHRTV NGETAEQWQY DKRGWLTDIS HLSEGHRVTV HYRYDEKGRL TGERQTVHHP QPEALLWQHE TRHAYNAQGL ANRCIPDSLP AVEWLAYGSG YLAGMKLGDT PLVDFTRDRL HRETLRRFGR YELSTAYTPA GQLQSQHLNS LQYDRDYTWN DNGELIRISS PRQTRSYSYS TTGRLTGVHT TAANLDIRIP YATDPAGNRL PDPELHPDST LSMWPDNRIA RDAHYLYRYD RHGRLTEKTD LIPEGGIRTD DERTHRYHYD SQHRLVHYTR TQYAEPLVES RYLYDPLGRR VAKRVWRRER DLTGWMSLSR KPEVTWYGWD GDRLTTIQND RSRIQTIYQP GSFTPLIRVE TATGELAKTQ RRSLADTLQQ SGGEDGGSVV FPPVLVQMLD RLESEILADR VSEESRRWLA SCGLTVAQMQ SQMDPVYTPA RKIHLYHCDH RGLPLALIST EGATAWCAEY DEWGNLLSDE NPHHLQQLIR LPGQQYDEES GLYYNRHRYY DPLQGRYITQ DPIGLKGGWN FYQYPLNPVI NVDPQGLVDI NLYPESDLIH SVADEINIPG VFTIGGHGTP TSIESATRSI MTAKDLAYLI KFDGNYKDGM TVWLFSCNTS KGQNSFASQL AKELHTNVIG PDTLWTWWGR GTNGKLKMDT VLTAPTNLNS NKDLMAITTK DLVMTPTY
|
| |