Gene EcHS_A2671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2671 
Symbol 
ID5594984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2684965 
End bp2689926 
Gene Length4962 bp 
Protein Length1653 aa 
Translation table11 
GC content53% 
IMG OID640921787 
Productalpha-2-macroglobulin domain-containing protein 
Protein accessionYP_001459313 
Protein GI157161995 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT TACGCGTAGC CGCCTGCATG CTAATGCTGG CGCTGGCAGG GTGCGACAAC 
AACGATAACG CGCCAACAGC GGTGAAAAAA GATGCGCCTT CTGAAGTTAC TAAAGCGGCC
TCTTCAGAAA ACGCGAGTTC AGCAAAACTC TCCGCGCCAG AGCGACAAAA ACTGGCACAA
CAGAGTGCCG GTAAGGCGCT GACATTGCTG GATCTCTCTG AAGTCCAACT TGATGGTGCA
GCCACGCTGG TGCTGACGTT CTCCATCCCT CTCGACCCGG ATCAGGATTT CTCACGCGTT
ATTCATGTCG TCGATAAAAA AAGCGGCAAA GTGGATGGTG CCTGGGAGCT GTCAGATAAT
CTTAAAGAGC TGCGTTTACG CCACCTCGAA CCGAAACGTG ATTTGATCGT TACTATTGGC
AAGGAGGTCA AAGCACTCAA CAACGCAACC TTCAGTAAAG ATTACGAAAA AACTATAACT
ACCCGCGACA TCCAACCCAG CGTCGGTTTT GCCAGCCGTG GTTCGCTGCT GCCTGGTAAA
GTCGTTGAAG GGCTGCCGGT AATGGCGCTC AACGTTAATA ATGTCGATGT TAACTTCTTT
CGCGTTAAGC CAGAATCTCT GCCAGCATTC ATTAGCCAAT GGGAATACCG CAATTCGCTG
GCGAACTGGC AGTCAGACAA ACTGCTGCAG ATGGCGGATC TGGTCTACAC CGGACGGTTT
GATCTCAATC CTGCGCGTAA CACCCGTGAA AAATTATTGC TGCCGCTGGG CGATATCAAA
CCGCTTCAGC AGGCGGGCGT GTATCTGGCT GTGATGAATC AGGCTGGACG TTACGATTAC
AGTAATCCCG CGACGCTGTT TACGTTAAGT GATATCGGCG TTTCAGCTCA CCGTTATCAC
AATCGTCTTG ATATCTTTAC CCAAAGTCTG GAAAACGGCG CGGCCCAGCA AGGAATTGAA
GTCTCTTTAT TAAATGAGAA AGGGCAGACT CTGACTCAGG CAACCAGTGA CGCTCAGGGG
CATGTGCAGC TGGAAAATGA TAAAAACGCG GCATTACTGT TGGCGCGTAA AGACGGTCAG
ACAACGCTAC TCGATTTAAA ACTTCCGGCG CTGGACTTAG CAGAATTTAA CATTGCTGGC
GCGCCAGGCT ATAGCAAACA GTTTTTCATG TTTGGCCCGC GCGATCTTTA TCGCCCGGGT
GAAACGGTAA TCCTCAATGG TTTGCTGCGT GATGCAGACG GTAAAGCGTT GCCCGATCAA
CCCATCAAGT TAGACGTGAT TAAACCCGAC GGGCAGGTTC TCAGGAGCGT CGTTAGTCAG
CCGGAGAATG GCCTCTACCA CTTAACCTGG CCACTCGATA GCAATGCGGC AACCGGTATG
TGGCATATTC GCGCTAACAC GGGCGATAAC CAGTATCGGA TGTGGGATTT CCACGTCGAA
GATTTTATGC CAGAGCGTAT GGCGCTGAAT CTGACCGGTG AGAAAACCCC GCTTACGCCG
AATGATGAAG TGAAATTCTC CGTGGTGGGA TACTACCTGT ACGGTGCGCC TGCTAATGGT
AATACTTTGC AAGGGCAACT TTTCCTGCGC CCACTGCGTG AGGCTGTGTC AGCCTTACCT
GGTTTTGAAT TCGGCGATAT AGCTGCCGAA AACCTTTCCC GCACGCTGGA TGAAGTTCAG
TTGACGCTGG ATGATAAAGG GCGCGGCGAA GTTTCTACAG AAAGCCAGTG GAAGGAAACG
CATTCCCCAT TACAGGTTAT TTTCCAGGGT AGTTTGCTGG AATCGGGCGG TCGCCCGGTG
ACGCGCCGCG CTGAGCAGGC TATCTGGCCT GCCGATGCAT TGCCGGGGAT CCGTCCGCAG
TTCGCCTCGA AATCGGTTTA CGATTATCGT ACTGACAGCA CGGTGAAACA GCCCATTGTT
GATGAAGGCA GTAACGCCGC TTTTGACATC GTTTATAGCG ATGCGCAAGG CGTGAAAAAA
GCCGTGTCGG GCTTGCAGGT GCGCCTGATT CGCGAACGCC GCGATTACTA CTGGAACTGG
TCAGAAGATG AAGGCTGGCA GTCACAGTTT GATCAAAAAG ATCTGATCGA AAATGAACAA
ACTCTGGATC TGAAAGCGGA CGAAACCGGC AAGGTTAGTT TTCCGGTAGA GTGGGGTGCT
TATCGTCTGG AAGTCAAAGC GCCGAATGAA GCGGTCAGTA GTGTTCGTTT CTGGGCTGGC
TATAGCTGGC AGGACAACAG CGACGGGAGC GGTGCCGTGC GACCCGACCG TGTCACGCTG
AAACTGGATA AAGCCAGTTA TCGCCCTGGC GATACCATTA AGTTGCATAT TGCCGCGCCA
ACGGCGGGTA AAGGTTATGC GATGGTCGAG TCCAGTGAAG GGCCGCTGTG GTGGCAAGAG
ATTGATGTTC CGGCTCAAGG GCTGGATCTG ACGATTCCGG TCGATAAAAC CTGGAATCGT
CATGATCTTT ATTTGAGTAC GCTGGTGGTG CGTCCTGGCG ATAAATCTCG CTCCGCGACG
CCAAAACGCG CGGTTGGTGT GTTGCATCTG CCGCTTGGTG ATGAAAACCG TCGCCTCGAT
CTGGCGCTGG AAACACCAGC AAAAATGCGT CCCAATCAAC CATTAACCGT GAAAATTAAA
GCCAGCACTA AAAATGGCGA GAAGCCTAAA CAGGTGAATG TGCTGGTGTC TGCCGTTGAT
AGTGGTGTGC TGAATATTAC TGACTACGTC ACGCCAGATC CGTGGCAGGC GTTCTTTGGT
CAGAAACGCT ATGGCGCAGA CATTTACGAT ATTTACGGTC AGGTTATTGA AGGTCAGGGG
CGTCTGGCAG CTCTGCGTTT CGGTGGCGAT GGTGATGAGC TGAAACGTGG TGGTAAACCG
CCGGTCAATC ACGTCAATAT TGTCGCGCAG CAGGCGCTGC CGGTAACGCT CAACGAACAG
GGCGAAGGCT CGGTTACACT GCCGATTGGC GATTTTAACG GTGAATTGCG CGTCATGGCG
CAAGCCTGGA CGGCAGATGA CTTCGGTAGC AACGAAAGTA AAGTGATAGT TGCCGCACCG
GTGATTGCTG AACTGAACAT GCCGCGCTTT ATGGCGAGTG GCGATACCTC GCGTCTGACG
CTGGATATCA CTAATCTTAC CGATAAACCG CAAAAACTGA ACGTTGCCCT GACCGCCAGT
GGTTTGCTTG AACTGGTCAG CGATTCACCC GCAGCCGTTG AATTAGCGCC AGGTGTGCGT
ACTACGCTGT TTATCCCGGT GCGAGCATTG CCGGGTTATG GCGATGGAGA AATTCAGGCC
ACCATTAGCG GGTTAGCGTT ACCGGGTGAA ACCGTTGCCG ATCAGCATAA GCAGTGGAAA
ATCGGCGTCC GTCCGGCGTT CCCGGCACAA ACGGTTAATT ACGGTACGGC GTTACAGCCT
GGTGAGACAT GGGCGATTCC GGCGGATGGA TTGCAAAACT TCTCGCCTGT TACGCTGGAA
GGGCAATTGT TGTTGAGCGG CAAACCACCG CTGAACATCG CACGTTATAT CAAAGAGTTA
AAAGCGTATC CGTACGGCTG TCTTGAGCAA ACCGCCAGCG GCCTGTTTCC GTCACTTTAT
ACCAACGCAG CCCAACTGCA GGCGTTGGGC ATCAAAGGCG ACAGTGATGA GAAACGCCGT
GCATCGGTCG ATATCGGCAT TTCCCGTTTG CTGCAAATGC AACGTGATAA CGGCGGCTTT
GCGCTGTGGG ATAAAAACGG TGACGAAGAG TACTGGCTGA CGGCTTACGT GATGGATTTC
CTGGTCCGCG CAGGCGAACA GGGTTACAGC GTGCCGACAG ACGCCATTAA CCGGGGTAAT
GAGCGTCTGC TGCGCTATTT ACAAGATCCG GGCATGATGT CGATCCCGTA CGCGGATAAT
CTCAAAGCCA GTAAATTCGC CGTACAGTCT TACGCTGCGC TGGTGTTGGC CCGTCAGCAA
AAGGCTCCGC TGGGTGCGCT GCGTGAAATC TGGGAGCATC GTGCAGATGC CGCTTCTGGT
TTACCGCTGC TGCAACTTGG CGTTGCGCTG AAAACCATGG GTGATGCGAC GCGTGGTGAA
GAAGCGATTG CGCTGGCGCT GAAAACGCCG CGTAATAGTG ATGAGCGGAT ATGGCTGGGT
GATTACGGTA GTTCACTGCG CGACAACGCG TTAATGCTCT CCTTGCTGGA AGAAAATAAA
CTGCTACCCG ATGAGCAGTA CACTTTGCTG AACACACTTT CGCAGCAGGC GTTTGGTGAA
CGCTGGCTAT CGACGCAGGA AAGTAACGCG TTGTTCCTGG CTGCCCGTAC GATTCAGGAT
TTACCCGGTA AATGGCAGGC GCAAACCTCT TTCTCAGCTG AGCAGCTGAC AGGCGAGAAA
GCGCAAAACA GCAATCTGAA TAGCGATCAA CTTGTCACCT TGCAGGTGAG CAACAGTGGC
GATCAGCCGT TATGGTTGCG TATGGATGCC AGCGGTTATC CGCAATCCGC ACCTTTACCG
GCGAACAATG TGCTGCAAAT CGAGCGTCAT ATTCTTGGTA CTGATGGTAA GAGCAAATCG
CTGGACTCGT TACGTAGCGG CGATCTGGTG CTGGTGTGGT TGCAGGTAAA AGCCAGTAAC
AGCGTGCCGG ATGCGTTAGT CGTGGATCTG CTGCCTGCGG GTCTGGAACT GGAAAACCAG
AATCTGGCGA ACGGTAGCGC CAGCCTGGAG CAAAGTGGTG GCGAAGTGCA GAACTTACTG
AACCAGATGC AGCAGGCGAG CATTAAGCAC ATTGAGTTCC GTGACGATCG CTTTGTGGCG
GCGGTTGCCG TTGATGAATA CCAACCGGTA ACGCTGGTGT ATCTGGCGCG GGCGGTGACG
CCGGGAACGT ATCAGGTACC GCAACCGATG GTGGAATCAA TGTATGTTCC CCAATGGCGG
GCGACCGGCG CGGCTGAAGA TCTGCTGATT GTCAGACCGT AA
 
Protein sequence
MKKLRVAACM LMLALAGCDN NDNAPTAVKK DAPSEVTKAA SSENASSAKL SAPERQKLAQ 
QSAGKALTLL DLSEVQLDGA ATLVLTFSIP LDPDQDFSRV IHVVDKKSGK VDGAWELSDN
LKELRLRHLE PKRDLIVTIG KEVKALNNAT FSKDYEKTIT TRDIQPSVGF ASRGSLLPGK
VVEGLPVMAL NVNNVDVNFF RVKPESLPAF ISQWEYRNSL ANWQSDKLLQ MADLVYTGRF
DLNPARNTRE KLLLPLGDIK PLQQAGVYLA VMNQAGRYDY SNPATLFTLS DIGVSAHRYH
NRLDIFTQSL ENGAAQQGIE VSLLNEKGQT LTQATSDAQG HVQLENDKNA ALLLARKDGQ
TTLLDLKLPA LDLAEFNIAG APGYSKQFFM FGPRDLYRPG ETVILNGLLR DADGKALPDQ
PIKLDVIKPD GQVLRSVVSQ PENGLYHLTW PLDSNAATGM WHIRANTGDN QYRMWDFHVE
DFMPERMALN LTGEKTPLTP NDEVKFSVVG YYLYGAPANG NTLQGQLFLR PLREAVSALP
GFEFGDIAAE NLSRTLDEVQ LTLDDKGRGE VSTESQWKET HSPLQVIFQG SLLESGGRPV
TRRAEQAIWP ADALPGIRPQ FASKSVYDYR TDSTVKQPIV DEGSNAAFDI VYSDAQGVKK
AVSGLQVRLI RERRDYYWNW SEDEGWQSQF DQKDLIENEQ TLDLKADETG KVSFPVEWGA
YRLEVKAPNE AVSSVRFWAG YSWQDNSDGS GAVRPDRVTL KLDKASYRPG DTIKLHIAAP
TAGKGYAMVE SSEGPLWWQE IDVPAQGLDL TIPVDKTWNR HDLYLSTLVV RPGDKSRSAT
PKRAVGVLHL PLGDENRRLD LALETPAKMR PNQPLTVKIK ASTKNGEKPK QVNVLVSAVD
SGVLNITDYV TPDPWQAFFG QKRYGADIYD IYGQVIEGQG RLAALRFGGD GDELKRGGKP
PVNHVNIVAQ QALPVTLNEQ GEGSVTLPIG DFNGELRVMA QAWTADDFGS NESKVIVAAP
VIAELNMPRF MASGDTSRLT LDITNLTDKP QKLNVALTAS GLLELVSDSP AAVELAPGVR
TTLFIPVRAL PGYGDGEIQA TISGLALPGE TVADQHKQWK IGVRPAFPAQ TVNYGTALQP
GETWAIPADG LQNFSPVTLE GQLLLSGKPP LNIARYIKEL KAYPYGCLEQ TASGLFPSLY
TNAAQLQALG IKGDSDEKRR ASVDIGISRL LQMQRDNGGF ALWDKNGDEE YWLTAYVMDF
LVRAGEQGYS VPTDAINRGN ERLLRYLQDP GMMSIPYADN LKASKFAVQS YAALVLARQQ
KAPLGALREI WEHRADAASG LPLLQLGVAL KTMGDATRGE EAIALALKTP RNSDERIWLG
DYGSSLRDNA LMLSLLEENK LLPDEQYTLL NTLSQQAFGE RWLSTQESNA LFLAARTIQD
LPGKWQAQTS FSAEQLTGEK AQNSNLNSDQ LVTLQVSNSG DQPLWLRMDA SGYPQSAPLP
ANNVLQIERH ILGTDGKSKS LDSLRSGDLV LVWLQVKASN SVPDALVVDL LPAGLELENQ
NLANGSASLE QSGGEVQNLL NQMQQASIKH IEFRDDRFVA AVAVDEYQPV TLVYLARAVT
PGTYQVPQPM VESMYVPQWR ATGAAEDLLI VRP