Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2671 |
Symbol | |
ID | 5594984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2684965 |
End bp | 2689926 |
Gene Length | 4962 bp |
Protein Length | 1653 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640921787 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001459313 |
Protein GI | 157161995 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGT TACGCGTAGC CGCCTGCATG CTAATGCTGG CGCTGGCAGG GTGCGACAAC AACGATAACG CGCCAACAGC GGTGAAAAAA GATGCGCCTT CTGAAGTTAC TAAAGCGGCC TCTTCAGAAA ACGCGAGTTC AGCAAAACTC TCCGCGCCAG AGCGACAAAA ACTGGCACAA CAGAGTGCCG GTAAGGCGCT GACATTGCTG GATCTCTCTG AAGTCCAACT TGATGGTGCA GCCACGCTGG TGCTGACGTT CTCCATCCCT CTCGACCCGG ATCAGGATTT CTCACGCGTT ATTCATGTCG TCGATAAAAA AAGCGGCAAA GTGGATGGTG CCTGGGAGCT GTCAGATAAT CTTAAAGAGC TGCGTTTACG CCACCTCGAA CCGAAACGTG ATTTGATCGT TACTATTGGC AAGGAGGTCA AAGCACTCAA CAACGCAACC TTCAGTAAAG ATTACGAAAA AACTATAACT ACCCGCGACA TCCAACCCAG CGTCGGTTTT GCCAGCCGTG GTTCGCTGCT GCCTGGTAAA GTCGTTGAAG GGCTGCCGGT AATGGCGCTC AACGTTAATA ATGTCGATGT TAACTTCTTT CGCGTTAAGC CAGAATCTCT GCCAGCATTC ATTAGCCAAT GGGAATACCG CAATTCGCTG GCGAACTGGC AGTCAGACAA ACTGCTGCAG ATGGCGGATC TGGTCTACAC CGGACGGTTT GATCTCAATC CTGCGCGTAA CACCCGTGAA AAATTATTGC TGCCGCTGGG CGATATCAAA CCGCTTCAGC AGGCGGGCGT GTATCTGGCT GTGATGAATC AGGCTGGACG TTACGATTAC AGTAATCCCG CGACGCTGTT TACGTTAAGT GATATCGGCG TTTCAGCTCA CCGTTATCAC AATCGTCTTG ATATCTTTAC CCAAAGTCTG GAAAACGGCG CGGCCCAGCA AGGAATTGAA GTCTCTTTAT TAAATGAGAA AGGGCAGACT CTGACTCAGG CAACCAGTGA CGCTCAGGGG CATGTGCAGC TGGAAAATGA TAAAAACGCG GCATTACTGT TGGCGCGTAA AGACGGTCAG ACAACGCTAC TCGATTTAAA ACTTCCGGCG CTGGACTTAG CAGAATTTAA CATTGCTGGC GCGCCAGGCT ATAGCAAACA GTTTTTCATG TTTGGCCCGC GCGATCTTTA TCGCCCGGGT GAAACGGTAA TCCTCAATGG TTTGCTGCGT GATGCAGACG GTAAAGCGTT GCCCGATCAA CCCATCAAGT TAGACGTGAT TAAACCCGAC GGGCAGGTTC TCAGGAGCGT CGTTAGTCAG CCGGAGAATG GCCTCTACCA CTTAACCTGG CCACTCGATA GCAATGCGGC AACCGGTATG TGGCATATTC GCGCTAACAC GGGCGATAAC CAGTATCGGA TGTGGGATTT CCACGTCGAA GATTTTATGC CAGAGCGTAT GGCGCTGAAT CTGACCGGTG AGAAAACCCC GCTTACGCCG AATGATGAAG TGAAATTCTC CGTGGTGGGA TACTACCTGT ACGGTGCGCC TGCTAATGGT AATACTTTGC AAGGGCAACT TTTCCTGCGC CCACTGCGTG AGGCTGTGTC AGCCTTACCT GGTTTTGAAT TCGGCGATAT AGCTGCCGAA AACCTTTCCC GCACGCTGGA TGAAGTTCAG TTGACGCTGG ATGATAAAGG GCGCGGCGAA GTTTCTACAG AAAGCCAGTG GAAGGAAACG CATTCCCCAT TACAGGTTAT TTTCCAGGGT AGTTTGCTGG AATCGGGCGG TCGCCCGGTG ACGCGCCGCG CTGAGCAGGC TATCTGGCCT GCCGATGCAT TGCCGGGGAT CCGTCCGCAG TTCGCCTCGA AATCGGTTTA CGATTATCGT ACTGACAGCA CGGTGAAACA GCCCATTGTT GATGAAGGCA GTAACGCCGC TTTTGACATC GTTTATAGCG ATGCGCAAGG CGTGAAAAAA GCCGTGTCGG GCTTGCAGGT GCGCCTGATT CGCGAACGCC GCGATTACTA CTGGAACTGG TCAGAAGATG AAGGCTGGCA GTCACAGTTT GATCAAAAAG ATCTGATCGA AAATGAACAA ACTCTGGATC TGAAAGCGGA CGAAACCGGC AAGGTTAGTT TTCCGGTAGA GTGGGGTGCT TATCGTCTGG AAGTCAAAGC GCCGAATGAA GCGGTCAGTA GTGTTCGTTT CTGGGCTGGC TATAGCTGGC AGGACAACAG CGACGGGAGC GGTGCCGTGC GACCCGACCG TGTCACGCTG AAACTGGATA AAGCCAGTTA TCGCCCTGGC GATACCATTA AGTTGCATAT TGCCGCGCCA ACGGCGGGTA AAGGTTATGC GATGGTCGAG TCCAGTGAAG GGCCGCTGTG GTGGCAAGAG ATTGATGTTC CGGCTCAAGG GCTGGATCTG ACGATTCCGG TCGATAAAAC CTGGAATCGT CATGATCTTT ATTTGAGTAC GCTGGTGGTG CGTCCTGGCG ATAAATCTCG CTCCGCGACG CCAAAACGCG CGGTTGGTGT GTTGCATCTG CCGCTTGGTG ATGAAAACCG TCGCCTCGAT CTGGCGCTGG AAACACCAGC AAAAATGCGT CCCAATCAAC CATTAACCGT GAAAATTAAA GCCAGCACTA AAAATGGCGA GAAGCCTAAA CAGGTGAATG TGCTGGTGTC TGCCGTTGAT AGTGGTGTGC TGAATATTAC TGACTACGTC ACGCCAGATC CGTGGCAGGC GTTCTTTGGT CAGAAACGCT ATGGCGCAGA CATTTACGAT ATTTACGGTC AGGTTATTGA AGGTCAGGGG CGTCTGGCAG CTCTGCGTTT CGGTGGCGAT GGTGATGAGC TGAAACGTGG TGGTAAACCG CCGGTCAATC ACGTCAATAT TGTCGCGCAG CAGGCGCTGC CGGTAACGCT CAACGAACAG GGCGAAGGCT CGGTTACACT GCCGATTGGC GATTTTAACG GTGAATTGCG CGTCATGGCG CAAGCCTGGA CGGCAGATGA CTTCGGTAGC AACGAAAGTA AAGTGATAGT TGCCGCACCG GTGATTGCTG AACTGAACAT GCCGCGCTTT ATGGCGAGTG GCGATACCTC GCGTCTGACG CTGGATATCA CTAATCTTAC CGATAAACCG CAAAAACTGA ACGTTGCCCT GACCGCCAGT GGTTTGCTTG AACTGGTCAG CGATTCACCC GCAGCCGTTG AATTAGCGCC AGGTGTGCGT ACTACGCTGT TTATCCCGGT GCGAGCATTG CCGGGTTATG GCGATGGAGA AATTCAGGCC ACCATTAGCG GGTTAGCGTT ACCGGGTGAA ACCGTTGCCG ATCAGCATAA GCAGTGGAAA ATCGGCGTCC GTCCGGCGTT CCCGGCACAA ACGGTTAATT ACGGTACGGC GTTACAGCCT GGTGAGACAT GGGCGATTCC GGCGGATGGA TTGCAAAACT TCTCGCCTGT TACGCTGGAA GGGCAATTGT TGTTGAGCGG CAAACCACCG CTGAACATCG CACGTTATAT CAAAGAGTTA AAAGCGTATC CGTACGGCTG TCTTGAGCAA ACCGCCAGCG GCCTGTTTCC GTCACTTTAT ACCAACGCAG CCCAACTGCA GGCGTTGGGC ATCAAAGGCG ACAGTGATGA GAAACGCCGT GCATCGGTCG ATATCGGCAT TTCCCGTTTG CTGCAAATGC AACGTGATAA CGGCGGCTTT GCGCTGTGGG ATAAAAACGG TGACGAAGAG TACTGGCTGA CGGCTTACGT GATGGATTTC CTGGTCCGCG CAGGCGAACA GGGTTACAGC GTGCCGACAG ACGCCATTAA CCGGGGTAAT GAGCGTCTGC TGCGCTATTT ACAAGATCCG GGCATGATGT CGATCCCGTA CGCGGATAAT CTCAAAGCCA GTAAATTCGC CGTACAGTCT TACGCTGCGC TGGTGTTGGC CCGTCAGCAA AAGGCTCCGC TGGGTGCGCT GCGTGAAATC TGGGAGCATC GTGCAGATGC CGCTTCTGGT TTACCGCTGC TGCAACTTGG CGTTGCGCTG AAAACCATGG GTGATGCGAC GCGTGGTGAA GAAGCGATTG CGCTGGCGCT GAAAACGCCG CGTAATAGTG ATGAGCGGAT ATGGCTGGGT GATTACGGTA GTTCACTGCG CGACAACGCG TTAATGCTCT CCTTGCTGGA AGAAAATAAA CTGCTACCCG ATGAGCAGTA CACTTTGCTG AACACACTTT CGCAGCAGGC GTTTGGTGAA CGCTGGCTAT CGACGCAGGA AAGTAACGCG TTGTTCCTGG CTGCCCGTAC GATTCAGGAT TTACCCGGTA AATGGCAGGC GCAAACCTCT TTCTCAGCTG AGCAGCTGAC AGGCGAGAAA GCGCAAAACA GCAATCTGAA TAGCGATCAA CTTGTCACCT TGCAGGTGAG CAACAGTGGC GATCAGCCGT TATGGTTGCG TATGGATGCC AGCGGTTATC CGCAATCCGC ACCTTTACCG GCGAACAATG TGCTGCAAAT CGAGCGTCAT ATTCTTGGTA CTGATGGTAA GAGCAAATCG CTGGACTCGT TACGTAGCGG CGATCTGGTG CTGGTGTGGT TGCAGGTAAA AGCCAGTAAC AGCGTGCCGG ATGCGTTAGT CGTGGATCTG CTGCCTGCGG GTCTGGAACT GGAAAACCAG AATCTGGCGA ACGGTAGCGC CAGCCTGGAG CAAAGTGGTG GCGAAGTGCA GAACTTACTG AACCAGATGC AGCAGGCGAG CATTAAGCAC ATTGAGTTCC GTGACGATCG CTTTGTGGCG GCGGTTGCCG TTGATGAATA CCAACCGGTA ACGCTGGTGT ATCTGGCGCG GGCGGTGACG CCGGGAACGT ATCAGGTACC GCAACCGATG GTGGAATCAA TGTATGTTCC CCAATGGCGG GCGACCGGCG CGGCTGAAGA TCTGCTGATT GTCAGACCGT AA
|
Protein sequence | MKKLRVAACM LMLALAGCDN NDNAPTAVKK DAPSEVTKAA SSENASSAKL SAPERQKLAQ QSAGKALTLL DLSEVQLDGA ATLVLTFSIP LDPDQDFSRV IHVVDKKSGK VDGAWELSDN LKELRLRHLE PKRDLIVTIG KEVKALNNAT FSKDYEKTIT TRDIQPSVGF ASRGSLLPGK VVEGLPVMAL NVNNVDVNFF RVKPESLPAF ISQWEYRNSL ANWQSDKLLQ MADLVYTGRF DLNPARNTRE KLLLPLGDIK PLQQAGVYLA VMNQAGRYDY SNPATLFTLS DIGVSAHRYH NRLDIFTQSL ENGAAQQGIE VSLLNEKGQT LTQATSDAQG HVQLENDKNA ALLLARKDGQ TTLLDLKLPA LDLAEFNIAG APGYSKQFFM FGPRDLYRPG ETVILNGLLR DADGKALPDQ PIKLDVIKPD GQVLRSVVSQ PENGLYHLTW PLDSNAATGM WHIRANTGDN QYRMWDFHVE DFMPERMALN LTGEKTPLTP NDEVKFSVVG YYLYGAPANG NTLQGQLFLR PLREAVSALP GFEFGDIAAE NLSRTLDEVQ LTLDDKGRGE VSTESQWKET HSPLQVIFQG SLLESGGRPV TRRAEQAIWP ADALPGIRPQ FASKSVYDYR TDSTVKQPIV DEGSNAAFDI VYSDAQGVKK AVSGLQVRLI RERRDYYWNW SEDEGWQSQF DQKDLIENEQ TLDLKADETG KVSFPVEWGA YRLEVKAPNE AVSSVRFWAG YSWQDNSDGS GAVRPDRVTL KLDKASYRPG DTIKLHIAAP TAGKGYAMVE SSEGPLWWQE IDVPAQGLDL TIPVDKTWNR HDLYLSTLVV RPGDKSRSAT PKRAVGVLHL PLGDENRRLD LALETPAKMR PNQPLTVKIK ASTKNGEKPK QVNVLVSAVD SGVLNITDYV TPDPWQAFFG QKRYGADIYD IYGQVIEGQG RLAALRFGGD GDELKRGGKP PVNHVNIVAQ QALPVTLNEQ GEGSVTLPIG DFNGELRVMA QAWTADDFGS NESKVIVAAP VIAELNMPRF MASGDTSRLT LDITNLTDKP QKLNVALTAS GLLELVSDSP AAVELAPGVR TTLFIPVRAL PGYGDGEIQA TISGLALPGE TVADQHKQWK IGVRPAFPAQ TVNYGTALQP GETWAIPADG LQNFSPVTLE GQLLLSGKPP LNIARYIKEL KAYPYGCLEQ TASGLFPSLY TNAAQLQALG IKGDSDEKRR ASVDIGISRL LQMQRDNGGF ALWDKNGDEE YWLTAYVMDF LVRAGEQGYS VPTDAINRGN ERLLRYLQDP GMMSIPYADN LKASKFAVQS YAALVLARQQ KAPLGALREI WEHRADAASG LPLLQLGVAL KTMGDATRGE EAIALALKTP RNSDERIWLG DYGSSLRDNA LMLSLLEENK LLPDEQYTLL NTLSQQAFGE RWLSTQESNA LFLAARTIQD LPGKWQAQTS FSAEQLTGEK AQNSNLNSDQ LVTLQVSNSG DQPLWLRMDA SGYPQSAPLP ANNVLQIERH ILGTDGKSKS LDSLRSGDLV LVWLQVKASN SVPDALVVDL LPAGLELENQ NLANGSASLE QSGGEVQNLL NQMQQASIKH IEFRDDRFVA AVAVDEYQPV TLVYLARAVT PGTYQVPQPM VESMYVPQWR ATGAAEDLLI VRP
|
| |