Gene SbBS512_E2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2331 
SymbolymcA 
ID6271233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2118385 
End bp2120481 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content52% 
IMG OID641726335 
Productgroup 4 capsule (G4C) polysaccharide, lipoprotein YmcA 
Protein accessionYP_001880818 
Protein GI187732989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0175694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGA ATTCTTATCT TTTAAGCTGC CTGGCCATTG CCGTCTCCAG TGCCTGTCAT 
GCTGAAGTAT TAACCTACCC GGATCCGCTG GGTTCGTCGC AATCAGACTT TGGCGGCACA
GGATTGTTGC AGATGCCAAA TGCGCGCATC GCACCGGAAG GTGAATTCAG CGTCAACTAC
CGGGATAACG ATCAATACCG GTTCTACTCC ACCTCCGTGG CGCTGTTCCC ATGGCTGGAA
GGCACCATTC GTTATACGGA TGTGCGCACA CGCAAATATA GCCAGTGGGA AGATTTCAGC
GGCGATCAGT CATACAAAGA CAAATCATTC GATTTTAAAC TTCGCCTGTG GGAAGAAGGT
TACTGGCTAC CGCAAGTGGC GTTTGGTAAA CGTGATATTG CTGGTACGGG TCTGTTTGAC
GGTGAGTATC TGGTGGCCAG CAAGCAAGCG GGGCCATTTG ATTTCACCCT CGGGATGGCA
TGGGGCTACG CCGGTAATGC GGGCAATATT ACCAACCCGT TTTGCCGGGT GAGCGATAAA
TATTGTCATC GCGCAGAGTC TCACGATGCG GGCGATATCA GCTTTAGCGA TATCTTTCGT
GGCCCGGCTT CCATCTTTGG CGGCATTGAG TATCAAACGC CGTGGAATCC CCTGCGTCTG
AAACTCGAAT ACGATGGCAA CAATTACCAG AATGATTTCG CTGGCAAACT GCCTCAGGCA
AGCCATTTCA ACGTCGGCGC AGTTTATCGC GCTGCCAGCT GGGCAGATCT CAACCTGAGT
TATGAACGCG GTAACACGTT GATGTTTGGC TTCACGTTAC GGACCAATTT CAACGATCTG
CGCCCTGCCC TGCGCGATAC GCCAAAACCG GCATATCAAC CTGCGCCTGA ATCTGAAGGA
TTGCAGTACA CCACAGTAGC GAACCAACTT ACCGCGCTGA AGTACAACGC AGGTTTTGAA
GCACCGGAAA TTCAGCTGCG CGATAAGACG CTGTATATGT CTGGTCAACA ATACAAATAC
CGTGATTCTC GCGAAGCGGT CGATCGTGCC AACCGGATTC TGGTGAATAA CCTGCCGCAA
GGCGTTGAGA AGATTAGCGT GACGCAAAAG CGCGAGCATA TGGCGATGGT GACTACCGAA
ACCGACGTAG CCAGCCTGCG CAAACAGCTG GCTGGTACAG CGCCTGGTCA ATCAGAGCAA
CTTCAACAAC AACGTGTTGA AGCAGAAGAT CTTTCTGCCT TTGGTCGGGG CTACCGTATT
CGTGAAGATC GCTTTAGCTA CTCTTTCAAC CCAACACTTT CACAGTCGCT GGGCGGCCCG
GAAGATTTCT ATATGTTCCA GCTGGGGCTG ATGCCCAGTG CCCGCTACTG GTTTACCGAC
CACCTGCTGC TTGATGGCGG TATTTTCACC AATATTTACA ACAACTACGA CAAGTTTAAG
TCTTCGCTGT TGCCCGCGGA CTCTACCCTG CCCCGCGTGC GCACGCATAT CCGTGATTAC
GTTCGCAATG ACGTTTATCT CAACAACTTG CAGGCGAACT ACTTTGCCGA CTTAGGCAAT
GGTTTCTATG GCCAGGTGTA TGGCGGTTAT CTGGAAACGA TGTACGCCGG TGTCGGTTCC
GAGCTGCTTT ATCGCCCGCT AGATGCCAGC TGGGCGCTGG GTGTGGACGT TAACTACGTG
AAGCAACGTG ACTGGGACAA CATGATGCGC TTCACCGATT ATTCCACGCC AACTGGTTTC
GTGACGGCTT ACTGGAACCC GCCGACGCTC AATGGCGTAC TGATGAAACT TAGCGTTGGG
CAATATCTGG CAAAAGATAA AGGCGCAACG ATCGACGTCG CCAAACGCTT TGACAGCGGC
GTGGCGGTAG GGGTATGGGC GGCAATCAGT AACGTATCTA AAGATGACTA CGGCGAAGGC
GGCTTTAGTA AAGGTTTTTA TATCTCGATT CCATTCGACT TGATGACCAT TGGACCTAAC
CGCAACCGCG CGGTGGTTTC GTGGACACCA TTGACGCGTG ATGGTGGACA AATGCTGTCA
CGCAAATACC AGCTCTATCC AATGACGGCA GAGCGAGAAG TACCGGTTGG ACAATAA
 
Protein sequence
MKKNSYLLSC LAIAVSSACH AEVLTYPDPL GSSQSDFGGT GLLQMPNARI APEGEFSVNY 
RDNDQYRFYS TSVALFPWLE GTIRYTDVRT RKYSQWEDFS GDQSYKDKSF DFKLRLWEEG
YWLPQVAFGK RDIAGTGLFD GEYLVASKQA GPFDFTLGMA WGYAGNAGNI TNPFCRVSDK
YCHRAESHDA GDISFSDIFR GPASIFGGIE YQTPWNPLRL KLEYDGNNYQ NDFAGKLPQA
SHFNVGAVYR AASWADLNLS YERGNTLMFG FTLRTNFNDL RPALRDTPKP AYQPAPESEG
LQYTTVANQL TALKYNAGFE APEIQLRDKT LYMSGQQYKY RDSREAVDRA NRILVNNLPQ
GVEKISVTQK REHMAMVTTE TDVASLRKQL AGTAPGQSEQ LQQQRVEAED LSAFGRGYRI
REDRFSYSFN PTLSQSLGGP EDFYMFQLGL MPSARYWFTD HLLLDGGIFT NIYNNYDKFK
SSLLPADSTL PRVRTHIRDY VRNDVYLNNL QANYFADLGN GFYGQVYGGY LETMYAGVGS
ELLYRPLDAS WALGVDVNYV KQRDWDNMMR FTDYSTPTGF VTAYWNPPTL NGVLMKLSVG
QYLAKDKGAT IDVAKRFDSG VAVGVWAAIS NVSKDDYGEG GFSKGFYISI PFDLMTIGPN
RNRAVVSWTP LTRDGGQMLS RKYQLYPMTA EREVPVGQ