Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2895 |
Symbol | |
ID | 6272760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2696772 |
End bp | 2701733 |
Gene Length | 4962 bp |
Protein Length | 1653 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641726839 |
Product | alpha-2-macroglobulin domain protein |
Protein accession | YP_001881311 |
Protein GI | 187733068 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGT TACGCGTAGC CGCCTGCATG CTAATGCTGG CGCTGGCAGG GTGCGACAAC AACGATAACG CGCCAACAGC GGTGAAAAAA GATGCGCCTT CTGAAGTTAC TAAAGCGGCC TCTTCAGAAA ACGCGAGTTC AGCAAAACTC TCCGTGCCGG AGAGACAAAA ACTGGCCCAA CAGAGTGCCG GTAAGGTGCT GACATTGCTG GATCTCTCTG AAGTCCAACT TGATGGTGCA GCCACGCTGG TGCTGACGTT CTCCATCCCT CTCGACCCGG ATCAGGATTT CTCACGCGTT ATTCATGTCG TCGATAAAAA AAGCGGCAAA GTGGATGGTG CCTGGGAGCT GTCAGATAAT CTTAAAGAGC TACGTTTACG CCACCTCGAA CCGAAACGTG ATTTGATCGT TACTATTGGC AAGGAGGTCA AAGCACTCAA CAACGCAACC TTCAGTAAAG ATTACGAAAA AACTATAACT ACCCGCGACA TCCAACCCAG CGTCGGTTTT GCCAGCCGTG GTTCGCTGCT GCCTGGCAAA GTCGTTGAAG GGCTGCCGGT AATGGCGCTC AACGTTAATA ATGTCGATGT TAACTTCTTC CGCGTTAAGC CAGAATCTCT GCCAGCATTC ATTAGCCAAT GGGAATACCG CAATTCGCTG GCGAACTGGC AGTCAGACAA ACTGCTGCAG ATGGCGGATC TGGTCTACAC CGGACGGTTT GATCTCAATC CTGCGCGTAA CACCCGTGAA AAATTATTGC TGCCGCTGGG CGATATCAAA CCGCTTCAGC AGGCGGGCGT GTATCTGGCT GTGATGAATC AGGCTGGACG TTACGATTAC AGTAATCCCG CGACGCTGTT TACGTTAAGT GATATCGGCG TTTCAGCTCA CCGTTATCAC AATCGTCTGG ATATCTTTAC CCAAAGTCTG GAAAACGGCG CGGCCCAGCA AGGAATTGAA GTCTCTTTAT TAAATGAGAA AGGGCAGACT CTGACTCAGG CAACCAGTGA CGCTCAGGGG CATGTGCAGC TGGAAAATGA TAAAAATGCG GCATTACTGC TGGCACGTAA AGACGGTCAG ACAACGCTAC TCGATTTAAA ACTTCCGGCG CTGGACTTAG CAGAATTTAA CATTGCTGGC GCGCCAGGCT ATAGCAAACA GTTTTTCATG TTTGGCCCAC GCGATCTTTA TCGCCCAGGT GAAACGGTAA TCCTCAATGG TTTACTGCGC GATGCAGACG GTAAAGCATT GCCCGATCAC CCCATCAAGT TAGACGTGAT TAAACCTGAC GGGCAGGTAC TCAGGAGCGT CGTTAGTCAG CCGGAGAATG GCCTCTACCA CTTTACCTGG CCACTCGATA GCAATGCGGC AACCGGTATG TGGCATATTC GCGCCAACAC GGGCGATAAC CAGTACCGGA TGTGGGATTT CCACGTCGAA GATTTTATGC CAGAGCGTAT GGCGCTGAAT CTGACCGGTG AGAAAACCCC GCTTACGCCG AATGATGAAG TGAAATTCTC CGTGGTGGGA TACTACCTGT ACGGTGCGCC TGCTAATGGT AATACTTTGC AAGGGCAACT TTTCCTGCGC CCACTGCGTG AGGCTGTGTC AGCCTTACCT GGTTTTGAAT TCGGCGATAT AGCTGCCGAA AACCTTTCCC GCACGCTGGA TGAAGTTCAG TTGACGCTGG ATGATAAAGG GCGCGGCGAA GTTTCTACAG AAAGCCAGTG GAAGGAAACG CATTCCCCAT TGCAGGTTAT TTTCCAGGGC AGTTTGCTGG AATCGGGCGG TCGCCCGGTG ACGCGCCGCG CTGAGCAGGC TATCTGGCCT GCCGATGCAT TGCCGGGGCT CCGTCCGCAG TTCGCCTTGA AATCGGTTTA CGATTATCGC ACTGACAGCA CGGTGAAACA GCCCATTGTT GATGAAGGCA GTAACGCCGC TTTTGACATC GTTTATAGCG ATGCACAAGG CGTGAAAAAA GCCGTGTCGG GCTTGCAGGT GCGCCTGATT CGCGAACGCC GCGATTACTA CTGGAACTGG TCAGAAGATG AAGGCTGGCA GTCACAGTTT GATCAAAAAG ATCTGATCGA AAATGAACAA ACTCTGGATC TGAAAGCGGA CGAAACCGGC AAGGTCAGTT TTCCGGTAGA GTGGGGCGCT TATCGTCTGG AAGTCAAAGC GCCGAATGAA GCGGTCAGTA GTGTTCGTTT CTGGGCTGGC TATAGCTGGC AGGACAACAG CGACGGTAGC GGCGCAGTGC GACCCGACCG TGTCACGCTG AAACTGGATA AAGCCAGTTA TCGCCCTGGC GACACCATTA AGTTGCATAT CGCCGCGCCA ACGGCGGGTA AAGGTTATGC GATGGTCGAG TCCAGTGAAG GGCCGCTGTG GTGGCAAGAG ATTGATGTTC CGGCTCAAGG GCTGGATCTG ACGATTCCGG TCGATAAAAC CTGGAATCGT CATGATCTGT ATTTAAGTAC GCTGGTGGTA CGTCCTGGCG ATAAATCTCG CTCCGCGACG CCAAAACGCG CGGTTGGTGT GTTGCATCTG CCGCTTGGCG ATGAAAACCG TCGCCTCGAT CTGGCGCTGG AAACACCAGC AAAAATGCGT CCCAATCAAC CATTAACCGT GAAAATTAAA GCCAGCACTA AAAATGGCGA GAAGCCTAAA CAGGTGAATG TGCTGGTGTC TGCCGTTGAT AGTGGTGTGC TGAATATTAC TGACTACGTC ACGCCAGATC CGTGGCAGGC GTTCTTTGGT CAGAAACGCT ATGGCGCAGA CATTTACGAT ATTTACGGTC AGGTTATTGA AGGTCAGGGG CGTCTGGCAG CTCTGCGTTT CGGTGGCGAT GGTGATGAGC TGAAACGTGG TGGTAAACCG CCGGTCAATC ACGTCAATAT TGTCGCGCAG CAGGCGCTGC CGGTAACGCT CAACGAACAG GGCGAAGGCT CGGTTACACT GCCGATTGGC GATTTTAACG GTGAATTGCG CGTCATGGCG CAAGCCTGGA CGGCAGATGA TTTCGGTAGC AACGAAAGCA AAGTGATAGT TGCCGCACCG GTGATTGCTG AACTGAACAT GCCGCGCTTT ATGGCGAGTG GCGATACCTC GCGTCTGACG CTGGATATCA CTAATCTTAC CGATAAACCG CAAAAACTGA ACGTTGCCCT GACCGCCAGT GGTTTGCTTG AACTGGTCAG CGATTCACCC GCAGCCGTTG AATTAGCGCC AGGTGTGCGT ACTACGCTGT TTATCCCAGT GCGAGCATTG CCGGGTTATG GCGATGGAGA AATTCAGGCC ACCATTAGCG GGTTAGCGTT ACCGGGTGAA ACCGTTGCCG ATCAGCATAA GCAGTGGAAA ATCGGCGTCC GTCCGGCGTT CCCAGCACAA ACGGTTAATT ACGGTACGGC GTTACAGCCT GGTGAGACAT GGGCGATTCC GGCGGATGGA TTGCAAAACT TCTCGCCTGT TACGCTGGAA GGGCAATTGT TGTTGAGCGG CAAACCACCG CTGAACATTG CACGTTATAT CAAAGAGTTA AAAGCGTATC CGTACGGCTG TCTTGAGCAA ACCGCCAGCG GCCTGTTCCC GTCACTTTAT ACCAACGCAG CCCAACTGCA GGCGTTGGGC ATCAAAGGCG ACAGTGATGA GAAACGCCGT GCATCGGTCG ATATCGGCAT TTCCCGTTTG CTGCAAATGC AACGTGATAA CGGCGGTTTT GCGCTGTGGG ATAAAAACGG TGACGAAGAG TACTGGCTGA CGGCTTACGT GATGGATTTC CTGGTCCGCG CAGGCGAACA GGGTTACAGC GTGCCGACAG ACGCCATTAA CCGGGGTAAT GAGCGTCTGC TGCGCTATTT ACAAGATCCG GGCATGATGT CGATCCCGTA CGCGGATAAT CTCAAAGCCA GTAAATTCGC CGTTCAGTCT TACGCTGCGC TGGTGCTGGC CCGTCAGCAA AAAGCACCAC TGGGTGCGCT GCGTGAAATC TGGGAGCATC GTGCAGATGC CGCTTCTGGT TTACCGCTGC TGCAACTTGG CATTGCGCTG AAAACCATGG GTGATGCAAC GCGTGGTGAA GAAGCGATTG TGCTGGCGCT GAAAACGCCG AGAAATAGTG ATGAGCGGAT ATGGCTGGGT GATTACGGTA GTCCACTGCG CGACAACGCG TTAATGCTCT CCTTGCTGGA AGAAAATAAA CTGCTACCCG ATGAGCAGTA CACTTTGCTG AACACACTTT CGCAGCAGGC GTTTGGTGAA CGCTGTCTAT CGACGCAGGA AAGTAACGCG TTGTTCCTGG CTGCCCGTAC GATTCAGGAT TTACCGGGTA AATGGCAGGC GCAAACCTCT TTCTCAGCTG AGCCGCTGAC AGGCGAGAAA ACGCTAAACA GCAATCTGAA TAGCGATCAA CTTGCCACCT TGCAGGTGAG AAACAGTGGC GATCAGCCGT TATGGTTGCG TATGGATGCC AGCGGTTATC CGCAATCCGC ACCTTTACCG GCGAACAATG TGCTGCAAAT CGAGCGTCAT ATTCTTGGTA CTGATGGTAA GAGCAAATCG CTGGACTCGT TACGTAGCGG CGATCTGGTG CTGGTCTGGT TGCAGGTAAA AGCCAGTAAC AGCGTGCCGG ATGCGTTAGT CGTGGATCTG CTGCCTGCGG GTCTGGAACT GGAAAATCAG AATCTGGCGA ACGGTAGCGC CAGCCTGGAG CAAAGTGGTG GCGAAGTGCA AAACTTACTG AACCAGATGC AGCAGGCGAG CATTAAGCAC ATTGAGTTCC GTGACGATCG CTTTGTGGCG GCGGTTGCCG TTGATGAATA CCAACCGGTA ACGCTGGTGT ATCTGGCGCG GGCGGTGACG CCGGGAACGT ATCAGGTACC GCAACCGATG GTGGAATCAA TGTATGTTCC CCAATGGCGG GCGACCGGCG CGGCTGAAGA TCTGCTGATT GTCCGACCGT AA
|
Protein sequence | MKKLRVAACM LMLALAGCDN NDNAPTAVKK DAPSEVTKAA SSENASSAKL SVPERQKLAQ QSAGKVLTLL DLSEVQLDGA ATLVLTFSIP LDPDQDFSRV IHVVDKKSGK VDGAWELSDN LKELRLRHLE PKRDLIVTIG KEVKALNNAT FSKDYEKTIT TRDIQPSVGF ASRGSLLPGK VVEGLPVMAL NVNNVDVNFF RVKPESLPAF ISQWEYRNSL ANWQSDKLLQ MADLVYTGRF DLNPARNTRE KLLLPLGDIK PLQQAGVYLA VMNQAGRYDY SNPATLFTLS DIGVSAHRYH NRLDIFTQSL ENGAAQQGIE VSLLNEKGQT LTQATSDAQG HVQLENDKNA ALLLARKDGQ TTLLDLKLPA LDLAEFNIAG APGYSKQFFM FGPRDLYRPG ETVILNGLLR DADGKALPDH PIKLDVIKPD GQVLRSVVSQ PENGLYHFTW PLDSNAATGM WHIRANTGDN QYRMWDFHVE DFMPERMALN LTGEKTPLTP NDEVKFSVVG YYLYGAPANG NTLQGQLFLR PLREAVSALP GFEFGDIAAE NLSRTLDEVQ LTLDDKGRGE VSTESQWKET HSPLQVIFQG SLLESGGRPV TRRAEQAIWP ADALPGLRPQ FALKSVYDYR TDSTVKQPIV DEGSNAAFDI VYSDAQGVKK AVSGLQVRLI RERRDYYWNW SEDEGWQSQF DQKDLIENEQ TLDLKADETG KVSFPVEWGA YRLEVKAPNE AVSSVRFWAG YSWQDNSDGS GAVRPDRVTL KLDKASYRPG DTIKLHIAAP TAGKGYAMVE SSEGPLWWQE IDVPAQGLDL TIPVDKTWNR HDLYLSTLVV RPGDKSRSAT PKRAVGVLHL PLGDENRRLD LALETPAKMR PNQPLTVKIK ASTKNGEKPK QVNVLVSAVD SGVLNITDYV TPDPWQAFFG QKRYGADIYD IYGQVIEGQG RLAALRFGGD GDELKRGGKP PVNHVNIVAQ QALPVTLNEQ GEGSVTLPIG DFNGELRVMA QAWTADDFGS NESKVIVAAP VIAELNMPRF MASGDTSRLT LDITNLTDKP QKLNVALTAS GLLELVSDSP AAVELAPGVR TTLFIPVRAL PGYGDGEIQA TISGLALPGE TVADQHKQWK IGVRPAFPAQ TVNYGTALQP GETWAIPADG LQNFSPVTLE GQLLLSGKPP LNIARYIKEL KAYPYGCLEQ TASGLFPSLY TNAAQLQALG IKGDSDEKRR ASVDIGISRL LQMQRDNGGF ALWDKNGDEE YWLTAYVMDF LVRAGEQGYS VPTDAINRGN ERLLRYLQDP GMMSIPYADN LKASKFAVQS YAALVLARQQ KAPLGALREI WEHRADAASG LPLLQLGIAL KTMGDATRGE EAIVLALKTP RNSDERIWLG DYGSPLRDNA LMLSLLEENK LLPDEQYTLL NTLSQQAFGE RCLSTQESNA LFLAARTIQD LPGKWQAQTS FSAEPLTGEK TLNSNLNSDQ LATLQVRNSG DQPLWLRMDA SGYPQSAPLP ANNVLQIERH ILGTDGKSKS LDSLRSGDLV LVWLQVKASN SVPDALVVDL LPAGLELENQ NLANGSASLE QSGGEVQNLL NQMQQASIKH IEFRDDRFVA AVAVDEYQPV TLVYLARAVT PGTYQVPQPM VESMYVPQWR ATGAAEDLLI VRP
|
| |