Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4159 |
Symbol | |
ID | 5587366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4147081 |
End bp | 4148790 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640927777 |
Product | AsmA family protein |
Protein accession | YP_001465137 |
Protein GI | 157159023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA TTGGGAAGCT GCTTCTCTAC ATTCTCATCG CTCTGTTAGT GGTGATCGCT GGCCTCTATT TTCTTCTGCA AACCCGCTGG GGAGCAGAAC ATATCAGCGC ATGGGTTTCC GAGAATAGCG ACTATCATCT GGCCTTCGGG GCGATGGATC ACCGTTTTTC CGCGCCATCT CATATCGTGC TGGAGAACGT CACGTTTGGT CGTGATGGTC AGCCCGCGAC CCTGGTGGCA AAAAGTGTCG ACATTGCGCT AAGCAGTCGG CAACTGACCG AACCACGCCA TGTCGATACC ATCCTGCTGG AAAACGGGAC GCTGAATCTC ACCGACCAGA CCGCGCCGCT ACCGTTCAAA GCCGATCGTC TGCAACTGCG TGATATGGCG TTTAATAGCC CGAATAGCGA ATGGAAACTG AGCGCGCAGC GGGTAAATGG CGGCGTGGTT CCGTGGTCAC CAGAAGCCGG TAAAGTGCTG GGTACGAAGG CGCAGATTCA GTTTAGTGCC GGATCGCTTT CGCTCAATGA TGTTCCTGCC ACCAATGTAC TGATTGAAGG CAGTATTGAT AATGATCGCG TTACGCTGAC TAACCTGGGT GCCGACATCG CCCGCGGGAC ATTAACCGGA AACGCGCAGC GTAACGCCGA CGGCAGCTGG CAAGTGGAAA ACCTGCGCAT GGCGGATATA CGTCTACAAA GCGAAAAATC GCTAACCGAC TTCTTTGCGC CATTACGCTC TGTCCCGTCG TTGCAGATTG GTCGCCTGGA AGTGATCGAT GCTCGTTTGC AAGGTCCGGA CTGGGCGGTG ACCGACCTCG ATCTCAGCTT GCGCAACATG ACCTTCAGTA AAGATGACTG GCAGACACAA GAAGGCAAAC TGTCGATGAA CGCTAGAGAG TTCATTTATG GTTCGCTGCA TTTATTTGAC CCGATTATAA ACGCGGAATT TTCCCCGCAG GGCGTAGCGC TGCGCCAGTT CACCAGCCGC TGGGAAGGGG GTATGGTCAG AACGTCAGGG AACTGGCTGC GTGACGGGAA AACGTTGATC CTTGATGATG CGGCAATTGC CGGGCTGGAA TATACCTTGC CGAAAAACTG GCAACAGTTG TGGATGGAAA CGACACCCGG TTGGTTAAAC AGCCTGCAAC TGAAGAGATT TAGCGCCAGC CGCAATCTGA TCATTGATAT CGACCCTGAC TTCCCGTGGC AGCTCACCAC GCTCGATGGT TACGGTGCCA ACCTGACGCT GGTTACCGAT CATAAATGGG GCGTCTGGAG TGGCTCGGCG AATCTGAATG CCGCCGCCGC GACATTCAAT CGTGTTGATG TTCGTCGCCC GTCGCTGGCG CTGACCGCCA ACAGCAGCAC GGTGAATATC AGCGAACTGA GTGCATTTAC TGAAAAAGGC ATTCTGGAAG CCACCGCCAG TGTTTCACAA ACGCCACAAC GTCAGACACA TATCAGCCTG AATGGACGCG GTGTGCCGGT GAATATTTTG CAACAGTGGG GATGGCCTAA ATTACCGTTG ACTGGCGACG GCAATATTCA GCTTACCGCC AGTGGCGATA TTCAGGCCAA TGTCCCGTTG AAACCTACGG TTAGCGGGCA ATTGCATGCC GTGAACGCCG CAAAGCAGCA AGTGACTCAA ACCATGAATG CTGGCATCGT TTCCAGCGGT GAAGTTACAT CGACGGAGCC GGTGCGGTAA
|
Protein sequence | MKFIGKLLLY ILIALLVVIA GLYFLLQTRW GAEHISAWVS ENSDYHLAFG AMDHRFSAPS HIVLENVTFG RDGQPATLVA KSVDIALSSR QLTEPRHVDT ILLENGTLNL TDQTAPLPFK ADRLQLRDMA FNSPNSEWKL SAQRVNGGVV PWSPEAGKVL GTKAQIQFSA GSLSLNDVPA TNVLIEGSID NDRVTLTNLG ADIARGTLTG NAQRNADGSW QVENLRMADI RLQSEKSLTD FFAPLRSVPS LQIGRLEVID ARLQGPDWAV TDLDLSLRNM TFSKDDWQTQ EGKLSMNARE FIYGSLHLFD PIINAEFSPQ GVALRQFTSR WEGGMVRTSG NWLRDGKTLI LDDAAIAGLE YTLPKNWQQL WMETTPGWLN SLQLKRFSAS RNLIIDIDPD FPWQLTTLDG YGANLTLVTD HKWGVWSGSA NLNAAAATFN RVDVRRPSLA LTANSSTVNI SELSAFTEKG ILEATASVSQ TPQRQTHISL NGRGVPVNIL QQWGWPKLPL TGDGNIQLTA SGDIQANVPL KPTVSGQLHA VNAAKQQVTQ TMNAGIVSSG EVTSTEPVR
|
| |