Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3866 |
Symbol | |
ID | 5593012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3860040 |
End bp | 3861749 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922976 |
Product | AsmA family protein |
Protein accession | YP_001460454 |
Protein GI | 157163136 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 60 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA TTGGGAAGCT GCTTCTCTAC ATTCTCATCG CTCTGTTAGT GGCGATCGCT GGCCTCTATT TTCTTCTGCA AACCCGCTGG GGAGCAGAAC ATATCAGCGC ATGGGTTTCC GAGAATAGCG ACTATCATCT GGCCTTCGGG GCGATGGATC ACCGTTTTTC CGCGCCATCT CATATCGTGC TGGAGAACGT CACGTTTGGT CGTGATGGCC AGCCCGCGAC CCTGGTGGCC AAAAGTGTCG ACATTGCGCT AAGCAGTCGG CAACTGACCG AACCACGCCA TGTCGATACC ATCCTGCTGG AAAACGGGAC GCTGAATCTC ACCGACCAGA CCGCGCCGCT ACCGTTCAAA GCCGATCGTC TGCAACTGCG TGATATGGCG TTTAATAGCC CGAATAGCGA ATGGAAACTG AGCGCGCAGC GGGTAAATGG CGGCGTAGTT CCGTGGTCAC CAAAAGCCGG TAAAGTGCTG GGTACGAAGG CGCAGATTCA GTTTAGTGCC GGATCGCTTT CGCTCAATGA TGTTCCTGCC ACCAATGTAC TGATTGAAGG CAGTATTGAT AACGATCGCG TTACGCTGAC TAACCTGGGT GCCGACATCG CGCGCGGGAC ATTAACCGGA AACGCGCAGC GTAACGCCGA CGGCAGCTGG CAAGTGGAAA ACCTGCGCAT GGCGGATATC CGTCTACAAA GCGAAAAATC GCTAACCGAC TTCTTTGCGC CATTACGCTC TGTCCCGTCG TTGCAGATTG GTCGCCTGGA AGTGATCGAT GCTCGTTTGC AAGGTCCGGA CTGGGCGGTG ACCGACCTCG ATCTCAGCTT GCGCAACATG ACCTTCAGTA AAGATGACTG GCAGACACAA GAAGGCAAAC TGTCGATGAA CGCTAGCGAG TTCATTTATG GTTCGCTGCA TTTATTTGAC CCGATTATAA ACGCGGAATT TTCCCCGCAG GGCGTAGCGC TGCGCCAGTT CACCAGCCGC TGGGAAGGGG GTATGGTCAG AACGTCAGGG AACTGGCTGC GTGACGGGAA AACGTTGATC CTTGATGATG CGGCAATTGC CGGGCTGGAA TATACCTTGC CGAAAAACTG GCAACAGTTG TGGATGGAAA CGACACCCGG TTGGTTAAAC AGCCTGCAAC TGAAGAGATT TAGCGCCAGC CGCAATCTGA TCATTGATAT CGACCCTGAC TTCCCGTGGC AGCTCACCAC GCTCGATGGT TACGGTGCCA ACCTGACGCT GGTTACCGAT CATAAATGGG GCGTCTGGAG TGGCTCGGCG AATCTGAATG CCGCCGCCGC GACATTCAAT CGTGTTGATG TTCGTCGCCC GTCGCTGGCG CTGACCGCCA ACAGCAGCAC GGTGAATATC AGCGAACTGA GTGCATTTAC TGAAAAAGGC ATTCTGGAAG CCACTGCCAG TGTTTCACAA ACGCCACAAC GTCAGACCCA TATCAGCCTG AATGGACGCG GTGTGCCGGT GAATATTTTG CAACAATGGG GATGGCCTGA ATTACCGTTG ACTGGCGACG GCAATATTCA GCTTACCGCC AGTGGAGATA TTCAGGCCAA TGCCCCGCTG AAACCTACGG TTAGCGGGCA ATTGCATGCC GTGAACGCCG CAAAGCAGCA AGTGACTCAA ACCATGAATG CTGGCATCGT TTCCAGCGGT GAAGTTACAT CGACGGAGCC GGTGCGGTAA
|
Protein sequence | MKFIGKLLLY ILIALLVAIA GLYFLLQTRW GAEHISAWVS ENSDYHLAFG AMDHRFSAPS HIVLENVTFG RDGQPATLVA KSVDIALSSR QLTEPRHVDT ILLENGTLNL TDQTAPLPFK ADRLQLRDMA FNSPNSEWKL SAQRVNGGVV PWSPKAGKVL GTKAQIQFSA GSLSLNDVPA TNVLIEGSID NDRVTLTNLG ADIARGTLTG NAQRNADGSW QVENLRMADI RLQSEKSLTD FFAPLRSVPS LQIGRLEVID ARLQGPDWAV TDLDLSLRNM TFSKDDWQTQ EGKLSMNASE FIYGSLHLFD PIINAEFSPQ GVALRQFTSR WEGGMVRTSG NWLRDGKTLI LDDAAIAGLE YTLPKNWQQL WMETTPGWLN SLQLKRFSAS RNLIIDIDPD FPWQLTTLDG YGANLTLVTD HKWGVWSGSA NLNAAAATFN RVDVRRPSLA LTANSSTVNI SELSAFTEKG ILEATASVSQ TPQRQTHISL NGRGVPVNIL QQWGWPELPL TGDGNIQLTA SGDIQANAPL KPTVSGQLHA VNAAKQQVTQ TMNAGIVSSG EVTSTEPVR
|
| |