Gene EcHS_A2368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2368 
Symbol 
ID5594535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2370780 
End bp2375294 
Gene Length4515 bp 
Protein Length1504 aa 
Translation table11 
GC content54% 
IMG OID640921495 
Productalpha-2-macroglobulin family protein 
Protein accessionYP_001459029 
Protein GI157161711 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGAACAG GGCTTGCTAA TGCTGATGAT TCGCTTCCTT CCAGTAACTA TGCGCCGCCC 
GCCGGGGGAA CATTCTTTTT GCTTGCTGAC AGCAGTTTTA GCAGCAGTGA AGAGGCGAAA
GTGCGACTGG AAGCGCCGGG GCGTGATTAT CGGCGCTATC AGATGGAAGA GTACGGCGGC
GTGGACGTTC GCCTGTATCG TATTCCTGAC CCGATGGCAT TTTTGCGCCA GCAGAAAAAC
CTGCATCGCA TTGTGGTGCA ACCGCAATAT CTGGGCGACG GGCTGAACAA TACGCTGACC
TGGCTGTGGG ATAACTGGTA CGGCAAATCT CGCCGCGTGA TGCAGCGTAC TTTCTCTTCT
CAGTCACGGC AGAATGTGAC TCAGGCATTA CCCGAATTAC ATCTCGGCAA TGCCATTATT
AAACCTTCCC GTTATGTACA GAACAACCAG TTTTCCCCGC TGAAAAAATA TCCCTTGGTG
GAACAGTTCC GTTATCCACT ATGGCAGGCT AAACCGGTCG AGCCGCAGCA AGGGGTAAAA
CTGGAAGGCG CATCCAGCAA TTTCATCTCG CCGCAGCCGG GTAACATTTA TATTCCTCTC
GGCCAACAAG AGCCGGGACT GTACCTCGTC GAGGCGATGG TTGGTGGGTA TCGGGCGACG
ACGGTGGTGT TTGTTTCCGA TACCGTGGCG CTTAGCAAAG TGTCAGGTAA CGAGCTTCTG
GTATGGACCG CGGGTAAAAA ACAGGGTGAA GCGAAGCCCG GCTCAGAGAT CTTGTGGACT
GACGGTCTTG GCGTGATGAC CCGCGGTGTG ACCGATGACA GCGGTACCTT GCAGTTACAA
CATATATCGC CAGAACGTTC ATACATTCTG GGTAAGGATG CTGAAGGCGG CGTTTTTGTC
TCCGAGAACT TCTTCTACGA AAGCGAAATC TACAACACCC GCTTGTATAT TTTTACCGAT
CGCCCGCTAT ATCGCGCAGG CGATCGTGTC GATGTTAAAG TGATGGGCCG CGAGTTCCAC
GATCCGTTGC ATTCATCCCC CATCGTCAGC GCCCCGGCGA AGCTTTCGGT GCTGGACGCT
AACGGCAGTC TGTTGCAAAC CGTCGATGTC ACGCTGGATG CGCGCAATGG CGGGCAGGGA
AGTTTCCGCC TGCCAGAAAA TGCCGTAGCC GGAGGTTATG AGTTACGTCT TGCTTACCGC
AATCAGGTCT ATAGCAGCAG TTTTCGCGTG GCAAACTACA TCAAGCCACA TTTCGAGATT
GGTTTAGCTC TCGACAAAAA AGAGTTCAAA ACTGGCGAAG CGGTCAGCGG CAAACTGCAA
CTTCTCTACC CGGATGGCGA GCCGGTAAAA AATGCCCGCG TGCAGTTAAG TTTGCGCGCT
CAGCAATTAT CAATGGTCGG TAACGATTTG CGTTATGCCG GACGTTTCCC CGTGTCGCTG
GAAGGCAGCG AAACGGTGTC CGACGCCAGC GGTCATGTGG CGTTAAATCT CCCCGCCGCC
GATAAACCGA GCCGCTATTT GTTAACCGTC TCCGCCAGTG ACGGCGCGGC GTATCGCGTC
ACCACCACCA AAGAGATCCT CATTGAACGC GGTCTGGCGC ATTACTCATT AAGTACTGCC
GCACAATACA GTAATAGCGG CGAGTCGGTT GTGTTCCGTT ATGCCGCGCT GGAATCTTCA
AAACAGGTTC CTGTTACGTA TGAATGGTTG CGTCTCGAAG ACCGCACGAG CCATAGCGGA
GAGCTACCGT CAGGCGGCAA ATCCTTTACC GTCAATTTCG CTAAACCTGG CAACTACAAT
CTGACATTAC GCGATAAAGA CGGCTTAATT CTCGCTGGGT TAAGTCATGC CGTCAGCGGT
AAGGGCAGCA CGGCGCATAC TGGTACGGTA GATATCGTGG CGGATAAAAC GCTGTACCAG
CCAGGCGAAA CCGCGAAGAT GCTGATTACC TTTCCGGAGC CAATTGATGA AGCATTATTG
ACGCTGGAAC GCGATCGCGT GGAACAGCAG TCGCTGCTTT CGCATCCGGC AAACTGGCTA
ACGCTACAAC GTTTAAACGA TACCCAGTAT GAAGCCCGGG TTCCAGTGAG CAATTCCTTT
GCGCCTAACA TCACTTTTTC GGTGCTGTAT ACCCGTAATG GTCAGTACAG TTTTCAGAAC
GCCGGGATCA AAGTTGCCGT TCCTCAGCTT GATATCCGGG TGAAAACGGA CAAAACCCAT
TACCAGCCTG GTGAACTGGT CAATGTCGAA TTAACCTCGT CGCTGAAAGG TAAACCTGTT
TCTGCGCAGC TAACGGTAGG CGTGGTCGAT GAAATGATCT ACGCGCTGCA ACCAGAAATC
GCGCCGAATA TCGGCAAATT TTTCTATCCG CTGGGGCGTA ACAATGTGCG TACCAGCTCC
AGTCTGTCGT TTATCAGCTA CGACCAGGCG CTCTCCAGCG AGCCGGTTGC GCCTGGCGCG
ACTAACCGCA GCGAGCGGCG AGTAAAAATG CTTGAACGTC CACGGCGTGA AGAGGTGGAT
ACCGCGGCAT GGATGCCGTC ACTCACAACC GATAAACAAG GCAAAGCGTA TTTCACGTTC
CTGATGCCTG ATTCGTTAAC CCGCTGGCGT ATCACCGCGC GTGGGATGAA CGGCGACGGG
CTGGTCGGGC AGGGGCGTGC TTATCTGCGT TCGGAAAAAA ATCTCTACAT GAAGTGGAGT
ATGCCAACGG TGTATCGCGT GGGCGACAAA CCGGCGGCAG GACTGTTTAT CTTCAGTCAG
CAGGATAACG AACCGGTGGC GCTGGTGACT AAATTTGCAG GCGCTGAGAT GCGCCAGACG
CTGACGCTGC ACAAAGGGGC GAATTATATT TCGCTGACGC AGAATATTCA GCAATCTGGC
TTGTTAAGTG CAGAACTGCA ACAAAATGGG CAAGTGCAGG ACAGCATTAG CACAAAACTG
TCTTTTGTGG ACAACAGCTG GCCCGTTGAA CAGCAGAAAA ATGTCATGCT CGGCGGTGGC
GAGAACGCGC TGATGTTGCC CGAGCAGGCG AGTAATATCC GGCTACAAAG TAGTGAAACG
CCGCAGGAGA TTTTCCGCAA CAATCTTGAT GCGTTAGTCG ATGAACCGTG GGGTGGGGTG
ATCAACACCG GTAGCCGTCT GATCCCGCTC AGTCTCGCCT GGCGTTCGCT TGCCGATCAT
CAAAGTGCCG CCGCTAACGA CATTCGTCAG ATGATTCAGG ATAACCGTCT GCGGCTGATG
CAACTGGCGG GGCCCGGAGC GCGCTTTACC TGGTGGGGTG AAGATGGCAA TGGTGACGCC
TTCCTTACGG CATGGGCATG GTACGCCGAC TGGCAGGCCA GCCAGGCGCT CGGCGTAACG
CAACAACCGG AATACTGGCA GCATATGCTC GACAGTTATG CCGAGCAGGC GGATAACATG
CCGTTATTGC ATCGGGCGCT GGTGCTGGCG TGGGCACAGG AGATGAATCT GCCGTGCAAA
ACGTTGTTGA AAGGGTTGGA TGAAGCTATC GCCCGGCGCG GAACTAAAGA TGAAGATTTC
TCTGAGGAAG ACATCCGCGA TATCAATGAC AGCCTGATCC TCGATACACC GGAATCTCCA
CTGGCAGATG CGGTGGCAAA CGTCTTAACC ATGACGTTGC TGAAAAAAGC GCAGTTGAAG
TCCACGGTGA TGCCACAGGT TCAGCAATAT GCGTGGGATA AAGCGGCAAA CAGCAATCAG
CCGCTGGCGC ACACGGTTGT GCTGCTCAAT AGCGGGGGCG ACGCTACCCA GGCGGCCGCT
ATTTTAAGTG GTTTGACCGC TGAGCAATCC ACTATTGAGC GCGCACTGGC CATGAACTGG
CTGGCGAAAT ATATGGCGAC AATGCCTCCG GTTGTGTTGC CTGCGCCTGC GGGCGCATGG
GCTAAACATA AGTTAACTGG AGGGGGCGAA GACTGGCGTT GGGTTGGTCA GGGTGTGCCG
GACATTCTCT CTTTTGGTGA CGAATTATCC CCGCAAAATG TGCAGGTCCG CTGGCGTGAA
CCGGCAAAAA CGGCTCAACA AAGTAACATT CCGGTGACCG TTGAACGCCA GTTGTATCGG
CTTATCCCTG GTGAAGAAGA GATGAGCTTT ACTCTGCAAC CGGTGACCAG CAATGAGATT
GACAGCGATG CGCTGTATCT CGATGAAATC ACGCTTACCA GCGAGCAGGA TGCAGTTCTG
CGCTATGGTC AGGTGGAAGT ACCGCTCCCA CCGGGAGCCG ACGTTGAGCG CACAACATGG
GGCATTTCGG TCAATAAACC CAACGCCGCG AAACAGCAGG GGCAATTGCT GGAAAAAGCG
CGAAATGAAA TGGGCGAACT GGCCTATATG GTGCCGGTGA AAGAACTGAC GGGAACGGTC
ACTTTCCGCC ATTTGCTGCG CTTCTCGCAA AAAGGGCAAT TCGTTCTGCC TCCTGCTCGT
TATGTGCGTT CCTATGCACC TGCACAGCAA AGTGTTGCGG CAGGGAGTGA ATGGACCGGG
ATGCAGGTGA AATAA
 
Protein sequence
MGTGLANADD SLPSSNYAPP AGGTFFLLAD SSFSSSEEAK VRLEAPGRDY RRYQMEEYGG 
VDVRLYRIPD PMAFLRQQKN LHRIVVQPQY LGDGLNNTLT WLWDNWYGKS RRVMQRTFSS
QSRQNVTQAL PELHLGNAII KPSRYVQNNQ FSPLKKYPLV EQFRYPLWQA KPVEPQQGVK
LEGASSNFIS PQPGNIYIPL GQQEPGLYLV EAMVGGYRAT TVVFVSDTVA LSKVSGNELL
VWTAGKKQGE AKPGSEILWT DGLGVMTRGV TDDSGTLQLQ HISPERSYIL GKDAEGGVFV
SENFFYESEI YNTRLYIFTD RPLYRAGDRV DVKVMGREFH DPLHSSPIVS APAKLSVLDA
NGSLLQTVDV TLDARNGGQG SFRLPENAVA GGYELRLAYR NQVYSSSFRV ANYIKPHFEI
GLALDKKEFK TGEAVSGKLQ LLYPDGEPVK NARVQLSLRA QQLSMVGNDL RYAGRFPVSL
EGSETVSDAS GHVALNLPAA DKPSRYLLTV SASDGAAYRV TTTKEILIER GLAHYSLSTA
AQYSNSGESV VFRYAALESS KQVPVTYEWL RLEDRTSHSG ELPSGGKSFT VNFAKPGNYN
LTLRDKDGLI LAGLSHAVSG KGSTAHTGTV DIVADKTLYQ PGETAKMLIT FPEPIDEALL
TLERDRVEQQ SLLSHPANWL TLQRLNDTQY EARVPVSNSF APNITFSVLY TRNGQYSFQN
AGIKVAVPQL DIRVKTDKTH YQPGELVNVE LTSSLKGKPV SAQLTVGVVD EMIYALQPEI
APNIGKFFYP LGRNNVRTSS SLSFISYDQA LSSEPVAPGA TNRSERRVKM LERPRREEVD
TAAWMPSLTT DKQGKAYFTF LMPDSLTRWR ITARGMNGDG LVGQGRAYLR SEKNLYMKWS
MPTVYRVGDK PAAGLFIFSQ QDNEPVALVT KFAGAEMRQT LTLHKGANYI SLTQNIQQSG
LLSAELQQNG QVQDSISTKL SFVDNSWPVE QQKNVMLGGG ENALMLPEQA SNIRLQSSET
PQEIFRNNLD ALVDEPWGGV INTGSRLIPL SLAWRSLADH QSAAANDIRQ MIQDNRLRLM
QLAGPGARFT WWGEDGNGDA FLTAWAWYAD WQASQALGVT QQPEYWQHML DSYAEQADNM
PLLHRALVLA WAQEMNLPCK TLLKGLDEAI ARRGTKDEDF SEEDIRDIND SLILDTPESP
LADAVANVLT MTLLKKAQLK STVMPQVQQY AWDKAANSNQ PLAHTVVLLN SGGDATQAAA
ILSGLTAEQS TIERALAMNW LAKYMATMPP VVLPAPAGAW AKHKLTGGGE DWRWVGQGVP
DILSFGDELS PQNVQVRWRE PAKTAQQSNI PVTVERQLYR LIPGEEEMSF TLQPVTSNEI
DSDALYLDEI TLTSEQDAVL RYGQVEVPLP PGADVERTTW GISVNKPNAA KQQGQLLEKA
RNEMGELAYM VPVKELTGTV TFRHLLRFSQ KGQFVLPPAR YVRSYAPAQQ SVAAGSEWTG
MQVK