Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2804 |
Symbol | |
ID | 5589744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2797724 |
End bp | 2802685 |
Gene Length | 4962 bp |
Protein Length | 1653 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640926455 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001463842 |
Protein GI | 157154935 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGT TACGCGTAGC CGCCTGCATG CTAATGCTGG CGCTGGCAGG GTGCGACAAC AACGATAACG CGCCAACAGC GGTGAAAAAA GATGCGCCTT CTGAAGTTAC TAAAGCGGCC TCTTCAGAAA ACGCGAGTTC AGCAAAACTC TCCGTGCCGG AGAGACAAAA ACTGGCCCAA CAGAGTGCCG GTAAGGTGCT GACATTGCTG GATCTCTCTG AAGTCCAACT TGATGGTGCA GCCACGCTGG TGCTGACGTT CTCCATCCCT CTCGACCCGG ATCAGGATTT CTCACGCGTT ATTCATGTCG TCGATAAAAA AAGCGGCAAA GTGGATGGTG CCTGGGAGCT GTCAGATAAT CTTAAAGAGC TGCGTTTACG CCACCTCGAA CCGAAACGTG ATTTGATCGT TACTATTGGC AAGGAGGTCA AAGCACTCAA CAACGCAACC TTCAGTAAAG ATTACGAAAA AACTATAACT ACCCGCGACA TCCAACCCAG CGTCGGTTTT GCCAGCCGTG GTTCGCTGCT GCCTGGTAAA GTCGTTGAAG GGCTGCCGGT AATGGCGCTC AACGTTAATA ATGTCGATGT TAACTTCTTC CGCGTTAAGC CAGAATCTCT GCCAGCATTC ATTAGCCAAT GGGAATACCG CAATTCGCTG GCGAACTGGC AGTCAGACAA ACTGCTGCAG ATGGCGGATC TGGTCTACAC CGGACGGTTT GATCTCAATC CTGCGCGTAA CACCCGTGAA AAATTATTGC TGCCGCTGGG CGATATCAAA CCGCTTCAGC AGGCGGGCGT GTATCTGGCT GTGATGAATC AGGCTGGACG TTACGATTAC AGTAATCCCG CGACGCTGTT TACGTTAAGT GATATCGGCG TTTCAGCTCA CCGTTATCAC AATCGTCTGG ATATCTTTAC CCAAAGTCTG GAAAACGGCG CGGCCCAGCA AGGAATTGAA ATCTCTTTAT TAAATGAGAA AGGGCAGACT CTGACTCAGG CAACCAGTGA CGCTCAGGGG CATGTGCAGC TGGAAAATGA TAAAAACGCG GCATTACTGT TGGCGCGTAA AGACGGTCAG ACAACGCTAC TCGATTTAAA ACTTCCGGCG CTGGACTTAG CAGAATTTAA CATTGCTGGC GCGCCAGGCT ATAGCAAACA GTTTTTCATG TTTGGCCCAC GCGATCTTTA TCGCCCGGGT GAAACGGTAA TCCTCAATGG TTTGCTGCGT GATGCAGACG GTAAAGCGTT GCCCAATCAA CCCATCAAGT TAGACGTGAT TAAACCCGAT GGGCAGGTAC TCAGGAGCGT CGTTAGTCAG CCGGAAAATG GCCTCTACCA CTTTACCTGG CCACTCGATA GCAATGCGGC AACCGGTATG TGGCATATTC GCGCTAACAC GGGCGATAAC CAGTATCGGA TGTGGGATTT CCACGTCGAA GATTTTATGC CAGAGCGCAT GGCGCTGAAT CTGACCGGTG AGAAAACCCC GCTAACGCCG AAAGATGAAG TGAAATTCTC CGTGGTGGGG TACTACCTGT ATGGTGCACC TGCTAATGGT AATACTTTGC AAGGGCAACT TTTCCTGCGC CCACTGCGTG AAGCTGTGTC AGCCTTACCT GGTTTTGAAT TCGGCGATAT AGCTGCCGAA AATCTTTCCC GCACGCTGGA TGAAGTTCAG TTGACGCTGG ATGATAAAGG GCGCGGCGAA GTTTCTACAG AAAGCCAGTG GAAGGAAACG CATTCCCCAT TACAGGTTAT TTTCCAGGGT AGTTTGCTGG AATCGGGCGG TCGCCCGGTG ACGCGCCGCG CTGAGCAGGC TATCTGGCCT GCCGATGCAT TGCCGGGGAT CCGTCCGCAG TTCGCCTCGA AATCGGTTTA CGATTATCGC ACTGACAGCA CGGTGAAACA GCCCATTGTT GATGAAGGCA GTAACGCCGG TTTTGACATC GTTTATAGCG ATGCGCAAGG CGTGAAAAAA GCCGTGTCGG GCTTGCAGGT GCGCCTGATT CGCGAACGCC GCGATTACTA CTGGAACTGG TCAGAAGATG AAGGCTGGCA GTCACAGTTT GATCAAAAAG ATCTGATCGA AAATGAACAA ACTCTGGATC TTCAGGCGGA TGAAACCGGT AAGGTCAGTT TCCCGGTAGA GTGGGGCGCT TATCGTCTGG AAGTCAAAGC GCCGAATGAA GCGGTCAGTA GCGTGCGTTT CTGGGCTGGC TATAGCTGGC AGGACAACAG CGACGGGAGC GGTGCCGTGC GACCCGACCG TGTCACGCTG AAACTGGATA AAGCCAGTTA TCGCCCTGGC GATACCATTA AGTTGCATAT TGCCGCGCCA ACGGCGGGTA AAGGTTATGC GATGGTCGAG TCCAGTGAAG GGCCGTTGTG GTGGCAAGAG ATTGATGTTC CGGCTCAAGG GCTGGATCTG ACGATTCCGG TCGATAAAAC CTGGAATCGT CATGATCTGT ATTTAAGTAC GCTGGTAGTA CGTCCTGGCG ATAAATCTCG CTCCGCGACG CCAAAACGCG CAGTTGGGGT GTTGCATCTG CCGCTTGGCG ATGAAAACCG TCGCCTCGAT CTGGCGCTGG AAACACCAGC AAAAATGCGG CCGAATCAAC CATTAACCGT GAAAATTAAA GCCAGCACTA AAAATGGCGA GAAGCCTAAA CAGGTGAATG TGCTGGTGTC TGCCGTTGAT AGTGGTGTGC TGAATATTAC TGACTACGTC ACGCCAGATC CGTGGCAGGC GTTCTTTGGT CAGAAACGCT ATGGCGCAGA CATTTACGAT ATTTACGGTC AGGTTATTGA AGGTCAGGGG CGTCTGGCAG CTCTGCGTTT CGGTGGCGAT GGTGATGAGC TGAAACGTGG TGGTAAACCG CCGGTCAATC ACGTCAATAT TGTCGCGCAG CAGGCGCTGC CGGTAACGCT CAACGAACAG GGCGAAGGCT CGGTTACACT GCCGATTGGC GATTTTAACG GTGAATTACG CGTCATGGCG CAAGCCTGGA CGGCAGATGA CTTCGGTAGC AACGAAAGCA AAGTGATAGT TGCCGCACCG GTGATTGCTG AACTGAACAT GCCGCGCTTT ATGGCGAGTG GCGATACCTC GCGTCTGACG CTGGATATCA CTAATCTTAC CGATAAACCG CAAAAACTGA ACGTTGCCCT GACCGCCAGT GGTTTGCTTG AACTGGTCAG CGATTCACCC GCAGCCGTTG AATTAGCGCC AGGTGTGCGT ACTACGCTGT TTATCCCGGT GCGAGCATTG CCGGGTTATG GCGATGGAGA AATTCAGGCC ACCATTAGCG GGTTAGCGTT ACCGGGTGAA ACCGTTGCCG ATCAGCATAA GCAGTGGAAA ATCGGCGTCC GTCCGGCGTT CCCGGCACAA ACGGTTAATT ACGGTACGGC GTTACAGCCT GGTGAGACAT GGGCGATTCC GGCGGATGGA TTGCAAAACT TCTCGCCTGT TACGCTGGAA GGGCAATTGT TGTTGAGCGG CAAACCACCG CTGAACATTG CACGTTATAT CAAAGAGTTA AAAGCGTATC CGTACGGCTG TCTTGAGCAA ACCGCCAGCG GCCTGTTCCC GTCACTTTAT ACCAACGCAG CCCAACTGCA GGCGTTGGGC ATCAAAGGCG ACAGTGATGA GAAACGCCGT GCATCGGTCG ATATCGGCAT TTCCCGTTTG CTGCAAATGC AACGTGATAA CGGCGGCTTT GCGCTGTGGG ATAAAAACGG TGACGAAGAG TACTGGCTGA CGGCTTACGT GATGGATTTC CTGGTCCGCG CAGGCGAACA GGGTTACAGC GTGCCGACAG ACGCCATTAA CCGGGGTAAT GAGCGTCTGC TGCGCTATTT ACAAGATCCG GGCATGATGT CGATCCCGTA CGCGGATAAT CTCAAAGCCA GTAAATTCGC CGTACAGTCT TACGCTGCGC TGGTGTTGGC CCGTCAACAA AAGGCTCCGC TGGGTGCGCT GCGTGAAATC TGGGAGCATC GTGCAGATGC CGCTTCTGGT TTACCGCTGC TGCAACTTGG CGTTGCGCTG AAAACCATGG GTGATGCAAC GCGTGGTGAA GAAGCGATTG TGCTGGCGCT GAAAACGCCG AGAAATAGTG ATGAGCGGAT ATGGCTGGGT GATTACGGTA GTCCACTGCG CGACAACGCG TTAATGCTCT CCTTGCTGGA AGAAAATAAA CTGCTACCCG ATGAGCAGTA CACTTTGCTG AACACACTTT CGCAGCAGGC GTTTGGTGAA CGCTGGCTAT CGACGCAGGA AAGTAACGCG GTGTTCCTGG CTGCCCGTAC GATTCAGGAT TTACCGGGTA AATGGCAGGC GCAAACCTCT TTCTCAGCTG AGCCGCTGAC AGGCGAGAAA ACGCTAAACA GCAATCTGAA TAGCGATCAA CTTGCCACCT TGCAGGTGAG AAACAGTGGC GATCAGCCGT TATGGTTGCG TATGGATGCC AGCGGTTATC CGCAATCCGC ACCTTTACCG GCGAACAATG TGCTGCAAAT CGAGCGTCAT ATTCTTGGTA CTGATGGTAA GAGCAAATCG CTGGACTCGT TACGTAGCGG CGATCTGGTG CTGGTCTGGT TGCAGGTAAA AGCCAGTAAC AGCGTGCCGG ATGCGTTAGT CGTGGATCTG CTGCCTGCGG GTCTGGAACT GGAAAATCAG AATCTGGCGA ACGGTAGCGC CAGCCTGGAG CAAAGTGGTG GCGAAGTGCA AAACTTACTG AACCAGATGC AGCAGGCGAG CATTAAGCAC ATTGAGTTCC GTGACGATCG CTTTGTGGCG GCGGTTGCCG TTGATGAATA CCAACCGGTA ACGCTGGTGT ATCTGGCGCG GGCGGTGACG CCGGGAACGT ATCAGGTACC GCAACCGATG GTGGAATCAA TGTATGTTCC CCAATGGCGG GCGACCGGCG CGGCTGAAGA TCTGCTGATT GTCCGACCGT AA
|
Protein sequence | MKKLRVAACM LMLALAGCDN NDNAPTAVKK DAPSEVTKAA SSENASSAKL SVPERQKLAQ QSAGKVLTLL DLSEVQLDGA ATLVLTFSIP LDPDQDFSRV IHVVDKKSGK VDGAWELSDN LKELRLRHLE PKRDLIVTIG KEVKALNNAT FSKDYEKTIT TRDIQPSVGF ASRGSLLPGK VVEGLPVMAL NVNNVDVNFF RVKPESLPAF ISQWEYRNSL ANWQSDKLLQ MADLVYTGRF DLNPARNTRE KLLLPLGDIK PLQQAGVYLA VMNQAGRYDY SNPATLFTLS DIGVSAHRYH NRLDIFTQSL ENGAAQQGIE ISLLNEKGQT LTQATSDAQG HVQLENDKNA ALLLARKDGQ TTLLDLKLPA LDLAEFNIAG APGYSKQFFM FGPRDLYRPG ETVILNGLLR DADGKALPNQ PIKLDVIKPD GQVLRSVVSQ PENGLYHFTW PLDSNAATGM WHIRANTGDN QYRMWDFHVE DFMPERMALN LTGEKTPLTP KDEVKFSVVG YYLYGAPANG NTLQGQLFLR PLREAVSALP GFEFGDIAAE NLSRTLDEVQ LTLDDKGRGE VSTESQWKET HSPLQVIFQG SLLESGGRPV TRRAEQAIWP ADALPGIRPQ FASKSVYDYR TDSTVKQPIV DEGSNAGFDI VYSDAQGVKK AVSGLQVRLI RERRDYYWNW SEDEGWQSQF DQKDLIENEQ TLDLQADETG KVSFPVEWGA YRLEVKAPNE AVSSVRFWAG YSWQDNSDGS GAVRPDRVTL KLDKASYRPG DTIKLHIAAP TAGKGYAMVE SSEGPLWWQE IDVPAQGLDL TIPVDKTWNR HDLYLSTLVV RPGDKSRSAT PKRAVGVLHL PLGDENRRLD LALETPAKMR PNQPLTVKIK ASTKNGEKPK QVNVLVSAVD SGVLNITDYV TPDPWQAFFG QKRYGADIYD IYGQVIEGQG RLAALRFGGD GDELKRGGKP PVNHVNIVAQ QALPVTLNEQ GEGSVTLPIG DFNGELRVMA QAWTADDFGS NESKVIVAAP VIAELNMPRF MASGDTSRLT LDITNLTDKP QKLNVALTAS GLLELVSDSP AAVELAPGVR TTLFIPVRAL PGYGDGEIQA TISGLALPGE TVADQHKQWK IGVRPAFPAQ TVNYGTALQP GETWAIPADG LQNFSPVTLE GQLLLSGKPP LNIARYIKEL KAYPYGCLEQ TASGLFPSLY TNAAQLQALG IKGDSDEKRR ASVDIGISRL LQMQRDNGGF ALWDKNGDEE YWLTAYVMDF LVRAGEQGYS VPTDAINRGN ERLLRYLQDP GMMSIPYADN LKASKFAVQS YAALVLARQQ KAPLGALREI WEHRADAASG LPLLQLGVAL KTMGDATRGE EAIVLALKTP RNSDERIWLG DYGSPLRDNA LMLSLLEENK LLPDEQYTLL NTLSQQAFGE RWLSTQESNA VFLAARTIQD LPGKWQAQTS FSAEPLTGEK TLNSNLNSDQ LATLQVRNSG DQPLWLRMDA SGYPQSAPLP ANNVLQIERH ILGTDGKSKS LDSLRSGDLV LVWLQVKASN SVPDALVVDL LPAGLELENQ NLANGSASLE QSGGEVQNLL NQMQQASIKH IEFRDDRFVA AVAVDEYQPV TLVYLARAVT PGTYQVPQPM VESMYVPQWR ATGAAEDLLI VRP
|
| |