Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3990 |
Symbol | |
ID | 6146307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4068346 |
End bp | 4070055 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618816 |
Product | AsmA family protein |
Protein accession | YP_001745955 |
Protein GI | 170683972 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTA TTGGGAAGCT GCTTCTCTAC ATTCTCATCG CTCTGTTAGT GGTGATCGCT GGCCTCTATT TTCTGCTGCA AACTCGCTGG GGAGCAGAAC ATATCAGCGC ATGGGTTTCC GAGAATAGCG ACTATCATCT GGCCTTCGGG GCGATGGATC ACCGTTTTTC CGCGCCATCT CATATCGTGC TGGAGAACGT CACGTTTGGC CGTGATGGCC AGCCCGCGAC CCTGGTGGCA AAAAGTGTCG ACATTGCGCT AAGCAGTCGG CAACTGACCG AACCACGCCA TGTCGATACC ATCCTGCTGG AAAACGGGAC GCTGAATCTC ACCGACCAGA CCGCGCCGCT ACCGTTCAAA GCCGATCGTC TGCAACTACG TGATATGGCG TTTAATAGCC CGAATAGCGA ATGGAAACTG AGCGCGCAGC GGGTAAATGG CGGCGTGGTT CCTTGGTCAC CAGAAGCCGG TAAAGTGCTG GGTACGAAGG CGCAGATTCA GTTTAGTGCC GGATCGCTTT CGCTCAATGA TGTTCCTGCC ACCAATGTAC TGATTGAAGG CAGTATTGAT AACGATCGCG TTACGCTGAC TAACCTGGGT GCCGACATCG CCCGCGGGAC ATTAACCGGA AACGCGCAGC GTAACGCTGA CGGCAGCTGG CAAGTGGAAA ACCTGCGCAT GGCGGATATC CGTCTACAAA GCGAAAAATC GCTAACCGAC TTCTTTGCGC CATTACGCTC TGTCCCGTCG TTGCAGATTG GTCGCCTGGA AGTCATCGAC GCTCGTTTGC AAGGTCCGGA CTGGGCGGTG ACCGACCTCG ATCTCAGCTT GCGCAACATG ACCTTCAGTA AAGATGACTG GCAGACACAG GAAGGTAAAC TGTCGATGAA CGCCAGCGAG TTTATTTATG GCTCGCTGCA TTTATTTGAT CCGATTATAA ACGCGGAATT TTCCCCGCAG GGCGTAGCAC TGCGCCAGTT CACCAGCCGC TGGGAAGGGG GCATGGTCAG AACGTCAGGG AACTGGTTGC GTGACGGGAA AATGTTGATC CTCGATGATG CGGCAATTGC CGGACTGGAA TATACCTTGC CAAAAAACTG GCAACAGTTG TGGATGGAAA CGACACCCGG TTGGTTAAAC AGCCTGCAAC TGAAGAGATT TAGCGCCAGC CGCAATCTGA TCATTGATAT CGACCCTGAC TTCCCGTGGC AGCTCACCGC GCTCGATGGT TACGGTGCCA ACCTGACGCT GGTTACCGAT CATAAATGGG GCGTCTGGAG TGGTTCGGCG AATCTGAATG CCGCCGCCGC GACATTCAAT CGTGTTGATG TTCGTCGCCC ATCGCTGGCG CTGACCGCCA ACAGCAGCAC GGTGAATATC AGCGAACTGA GTGCATTTAC TGAAAAAGGC ATTCTGGAAG CCACCGCCAG TGTTTCACAA ACGCCACAAC GTCAGACCCA TATCAGCCTG AATGGACGCG GTGTGCCGGT GAATATTTTG CAACAGTGGG GATGGCCTGA ATTACCGTTG ACTGGCGACG GCAATATTCA GCTTACCGCC AGCGGCAATA TTCAGGCCAA TATCCCGCTG AAACCTACGG TTAGCGGGCA ATTGCATGCC GTGAATGCCG CAAAGCAGCA AGTGACTCAA ACCATGAATG CGGGCGTAGT TTCCAGTGGT GAAGTTACGT CGACGGAGTC GGTGCAGTAA
|
Protein sequence | MKFIGKLLLY ILIALLVVIA GLYFLLQTRW GAEHISAWVS ENSDYHLAFG AMDHRFSAPS HIVLENVTFG RDGQPATLVA KSVDIALSSR QLTEPRHVDT ILLENGTLNL TDQTAPLPFK ADRLQLRDMA FNSPNSEWKL SAQRVNGGVV PWSPEAGKVL GTKAQIQFSA GSLSLNDVPA TNVLIEGSID NDRVTLTNLG ADIARGTLTG NAQRNADGSW QVENLRMADI RLQSEKSLTD FFAPLRSVPS LQIGRLEVID ARLQGPDWAV TDLDLSLRNM TFSKDDWQTQ EGKLSMNASE FIYGSLHLFD PIINAEFSPQ GVALRQFTSR WEGGMVRTSG NWLRDGKMLI LDDAAIAGLE YTLPKNWQQL WMETTPGWLN SLQLKRFSAS RNLIIDIDPD FPWQLTALDG YGANLTLVTD HKWGVWSGSA NLNAAAATFN RVDVRRPSLA LTANSSTVNI SELSAFTEKG ILEATASVSQ TPQRQTHISL NGRGVPVNIL QQWGWPELPL TGDGNIQLTA SGNIQANIPL KPTVSGQLHA VNAAKQQVTQ TMNAGVVSSG EVTSTESVQ
|
| |