Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1267 |
Symbol | |
ID | 5595496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1268050 |
End bp | 1269570 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640920427 |
Product | hypothetical protein |
Protein accession | YP_001457989 |
Protein GI | 157160671 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.0117469 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGA AAAAACTCTC CGGGTTTAGT TTGGGACTTA TTGCTCTGGC GGTGGGTAAT GCATATGCAA CACAATTGTT GGATGATTAT AGTATAATTT CCTATATGAC TGATGAAGAA TCGCCGATTG AAATCAAAGA TAATAATCCG ATAAGTAATG GAGAGTATCT AACCACTGAA GACGAAAGCC ATGCTGTGAA AGTGGATGAC GGTGTAACTG GATATATAAA TAATGCCAGT GTGATGACTA GTGGTGATGG ATCTTATGGT ATTTCTGTTG ATAGTCAAAA CAAAGTATTA TATATAAGCG ATAGCGATAT TAAGACCTCT GGAAGCGTAT CTGACAAAGA AAATGGAGGG ATAACAGCCA GCGCAGTAGT CAGTGAATTT GGTGGCACCA TCTTTATGAA TGGTGATAAT TCAGTCGAGT CGGGTGGGGC ATATTCAGCG GGACTTTTAA GCCAGGTTAA TGATTCTGAA AAGATGGTAA ATAACACCCG TCTTGAAACC ACAGATAAAA CGAACATTGT TACCTCTGGG GAAAATGCAG TAGGTGTTCT TGCATGTTCA AGTCCTGGAG AGTCTCGAAC ATGTGTCGAT GCTGTAGATG ATGAAGTTAG TGATTCTAAC AGTTACGAAG TTATTAGCCG TGCTGATTTA AAAATGAATG GTGGTTCCAT AACAACTAAT GGCATTAATA GCTATGGTGC TTATGCTAAT GGGAAAAAAG CATATATTAA TTTAGATTAT GTGGCACTTG AAACTGTGGC TGATGGAAGT TATGCAGTTG CTATTCGACA AGGTAACATT GATATAAAAA ATAGTTCTAT TACAACAACA GGCACTAAAG CCCCCATTGC AAAAATATAC AATGGTGGAG AGTTATTTTT TTCCAATGTC ACCGCGGTAT CAAAACAAGA TAAAGGAATA TCAATTGATG CATCAAATAT CGATTCTCAA GCCAAAATAG CACTATTAAG TGTTGAACTT TCAAGTGCTT TGGATAGTAT TGATGTTAAC AAAACTACAA CGGATGTAAG TATCCTTAAT CGAAGTATTA TCACACCTGG TAATAATGTT CTGGTTAATA ATACTGGAGG TGACTTAAAC ATAATTTCGT CCGACTCTAT TCTAAATGGA GCGACTAAAC TCGTCAGCGG CACAACCACG CTGAAGCTTT CAGAAAATAC AATCTGGAAT ATGAAAGATG ACTCCGTTGT TACCCATCTG ACTAATTCAG ACAGTATTAT CAATCTTTCG TATGATGATG GTCAAACATT TACCCAAGGA AAAACATTAA CCGTAAAAGG TAATTATGTC GGTAATAATG GTCAGCTTAA TATCCGCACC GTATTAGGTG ATGATAAATC GGCTACGGAC AGACTTATTG TTGAGGGTAA TACTTCGGGT TCAACTACCG TCTATGTGAA AAATGCTGGA GGAAGCGGCG CGGCCACGCT AAACGGGATC GAACTCATAA CTGTGAATGG CGATGAATCT CCAGCAGATG CCTTCAGATA A
|
Protein sequence | MKLKKLSGFS LGLIALAVGN AYATQLLDDY SIISYMTDEE SPIEIKDNNP ISNGEYLTTE DESHAVKVDD GVTGYINNAS VMTSGDGSYG ISVDSQNKVL YISDSDIKTS GSVSDKENGG ITASAVVSEF GGTIFMNGDN SVESGGAYSA GLLSQVNDSE KMVNNTRLET TDKTNIVTSG ENAVGVLACS SPGESRTCVD AVDDEVSDSN SYEVISRADL KMNGGSITTN GINSYGAYAN GKKAYINLDY VALETVADGS YAVAIRQGNI DIKNSSITTT GTKAPIAKIY NGGELFFSNV TAVSKQDKGI SIDASNIDSQ AKIALLSVEL SSALDSIDVN KTTTDVSILN RSIITPGNNV LVNNTGGDLN IISSDSILNG ATKLVSGTTT LKLSENTIWN MKDDSVVTHL TNSDSIINLS YDDGQTFTQG KTLTVKGNYV GNNGQLNIRT VLGDDKSATD RLIVEGNTSG STTVYVKNAG GSGAATLNGI ELITVNGDES PADAFR
|
| |