Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4106 |
Symbol | |
ID | 6272022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3830677 |
End bp | 3832386 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641727936 |
Product | AsmA family protein |
Protein accession | YP_001882368 |
Protein GI | 187731979 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA TTGGGAAGCT GCTTCTCTAC ATTCTCATCG CTCTGTTAGT GGTGATCGCT GGCCTCTATT TTCTTCTGCA AACCCGCTGG GGAGCAGAAC ATATCAGCGC ATGGGTTTCC GAGAATAGCG ACTATCATCT GGCCTTCGGG GCGATGGATC ACCGTTTTTC CGCGCCATCT CATATCGTGC TGGAGAATGT CACGTTTGGT CGTGATGGTC AGCCCGCGAC CCTGGTGGCC AAAAGTGTCG ACATTGCGCT AAGCAGTCGG CAACTGACCG AACCACGCCA TGTCGATACC ATCCTGCTGG AAAACGGGAC GCTGAATCTC ACCGACCAGA CCGCGCCGCT ACCGTTCAAA GCCGATCGTC TGCAACTGCG TGATATGGCG TTTAATAGCC CGAATAGCGA ATGGAAACTG AGCGCGCAGC GGGTAAATGG CGGCGTAGTT CCGTGGTCAC CAAAAGCCGG TAAAGTGCTG GGTACGAAGG CGCAGATTCA GTTTAGTGCC GGATCGCTTT CGCTCAATGA TGTTCCTGCC ACCAATGTAC TGATTGAAGG CAGTATTGAT AACGATCGCG TTACGCTGAC TAACCTGGGT GCCGACATCG CCCGCGGGAC ATTAACCGGA AACGCGCAGC GTAACGCCGA CGGCAGCTGG CAAGTGGAAA ACCTGCGCAT GGCGGATATA CGTCTACAAA GCGAAAAATC GCTAACCGAC TTCTTTGCGC CATTACGCTC TGTCCCGTCG TTGCAGATTG GTCGCCTGGA AGTGATCGAT GCTCGTTTGC AAGGTCCGGA CTGGGCGGTG ACCGACCTCG ATCTCAGCTT GCACAACATG ACCTTCAGTA AAGATGACTG GCAGACACAA GAAGGCAAAC TGTCGATGAA CGCTAGCGAG TTCATTTATG GTTCGCTGCA TTTATTTGAC CCGATTATAA ACGCGGAATT TTCCCCGCAG GGCGTAGCGC TGCGCCAGTT CACCAGCCGC TGGGAAGGGG GTATGGTCAG AACGTCAGGG AACTGGCTGC GTGACGGGAA AACGTTGATC CTTGATGATG CGGCAATTGC CGGGCTGGAA TATACCTTGC CGAAAAACTG GCAACAGTTG TGGATGGAAA CGACACCCGG TTGGTTAAAC AGCCTGCAAC TGAAGAGATT TAGCGCCAGC CGCAATCTGA TCATTGATAT CGACCCTGAC TTCCCGTGGC AGCTCACCAC GCTCGATGGT TACGGTGCCA ACCTGACGCT GGTTACCGAT CATAAATGGG GCGTCTGGAG TGGCTCGGCG AATCTGAATG CCGCCGCCGC GACATTCAAT CGTGTTGATG TTCGTCGCCC GTCGCTGGCG CTGACCGCCA ACAGCAGCAC GGTGAATATC AGCGAACTGA GTGCATTTAC TGAAAAAGGC ATTCTGGAAG CCACTGCCAG TGTTTCACAA ACGCCACAAC GTCAGACCCA TATCAGCCTG AATGGACGCG GTGTGCCGGT GAATATTTTG CAACAATGGG GATGGCCTGA ATTACCGTTG ACTGGCGACG GCAATATTCA GCTTACCGCC AGTGGAGATA TTCAGGCCAA TGCCCCGCTG AAACCTACGG TTAGCGGGCA ATTGCATGCC GTGAACGCCG CAAAGCAGCA AGTGACTCAA ACCATGAATG CTGGCATCGT TTCCAGCGGT GAAGTTACGT CGGCAGAGCT GGTGCAGTAA
|
Protein sequence | MKFIGKLLLY ILIALLVVIA GLYFLLQTRW GAEHISAWVS ENSDYHLAFG AMDHRFSAPS HIVLENVTFG RDGQPATLVA KSVDIALSSR QLTEPRHVDT ILLENGTLNL TDQTAPLPFK ADRLQLRDMA FNSPNSEWKL SAQRVNGGVV PWSPKAGKVL GTKAQIQFSA GSLSLNDVPA TNVLIEGSID NDRVTLTNLG ADIARGTLTG NAQRNADGSW QVENLRMADI RLQSEKSLTD FFAPLRSVPS LQIGRLEVID ARLQGPDWAV TDLDLSLHNM TFSKDDWQTQ EGKLSMNASE FIYGSLHLFD PIINAEFSPQ GVALRQFTSR WEGGMVRTSG NWLRDGKTLI LDDAAIAGLE YTLPKNWQQL WMETTPGWLN SLQLKRFSAS RNLIIDIDPD FPWQLTTLDG YGANLTLVTD HKWGVWSGSA NLNAAAATFN RVDVRRPSLA LTANSSTVNI SELSAFTEKG ILEATASVSQ TPQRQTHISL NGRGVPVNIL QQWGWPELPL TGDGNIQLTA SGDIQANAPL KPTVSGQLHA VNAAKQQVTQ TMNAGIVSSG EVTSAELVQ
|
| |