Gene EcolC_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1423 
Symbol 
ID6067710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1562208 
End bp1566722 
Gene Length4515 bp 
Protein Length1504 aa 
Translation table11 
GC content54% 
IMG OID641600842 
Productalpha-2-macroglobulin domain-containing protein 
Protein accessionYP_001724413 
Protein GI170019459 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.427509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000103653 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGGAACAG GGCTTGCTAA TGCTGATGAT TCGCTTCCTT CCAGCAACTA TGCGCCGCCT 
GCCGGGGGAA CATTCTTTTT GCTTGCTGAC AGCAGTTTTA GCAGCAGTGA AGAGGCGAAA
GTGCGACTGG AAGCGCCGGG GCGTGATTAT CGGCGCTATC AGATGGAAGA GTACGGCGGC
GTGGACGTTC GCCTGTATCG TATTCCTGAC CCGATGGCAT TTTTGCGCCA GCAGAAAAAC
CTGCATCGCA TTGTGGTGCA ACCGCAATAT CTGGGCGACG GGCTGAACAA TACGCTAACC
TGGCTGTGGG ATAACTGGTA CGGCAAATCT CGCCGCGTGA TGCAGCGTAC TTTCTCTTCT
CAGTCACGGC AGAATGTGAC TCAGGCGTTA CCCGAATTAC AGCTCGGCAA TGCCATTATT
AAACCTTCCC GTTATGTACA GAACAACCAG TTTTCCCCGC TGAAAAAATA TCCCCTGGTG
GAACAGTTCC GTTATCCACT ATGGCAGGCT AAACCGTTCG AGCCGCAGCA AGGGGTAAAA
CTGGAAGGCG CATCCAGCAA TTTCATCTCG CCGCAGCCGG GTAACATTTA TATTCCTCTC
GGCCAACAAG AGCCGGGACT GTACCTCGTC GAGGCGATGG TTGGTGGGTA TCGGGCGACG
ACGGTGGTGT TTGTTTCCGA TACCGTGGCG CTTAGCAAAG TGTCAGGCAA AGAGCTTCTG
GTGTGGACCG CGGGTAAAAA ACAGGGTGAA GCGAAGCCCG GCTCAGAGAT CTTGTGGACT
GACGGTCTTG GCGTGATGAC CCGCGGTGTG ACCGATGACA GCGGTACCTT GCAGTTACAA
CATATATCGC CAGAACGTTC ATACATTCTG GGTAAGGATG CTGAAGGCGG CGTTTTTGTC
TCCGAGAACT TCTTCTACGA AAGCGAAATC TACAACACCC GCTTGTATAT TTTTACCGAT
CGCCCGCTAT ATCGCGCAGG CGATCGTGTC GATGTTAAAG TGATCGGCCG CGAGTTCCAC
GATCCGTTGC ATTCATCCCC CATCGTCAGC GCCCCGGCGA AGCTTTCGGT GCTGGACGCC
AACGGCAGTC TGTTGCAAAC CGTCAATGTC ACGCTGGATG CGCGCAATGG CGGGCAGGGA
AGTTTCCGCC TGCCAGAAAA TGCCGTAGCC GGAGGTTATG AGTTACGTCT TGCTTACCGC
AATCAGGTCT ATAGCAGCAG TTTTCGCGTG GCAAACTACA TCAAGCCACA TTTCGAGATT
GGTTTAGCTC TCGACAAAAA AGAGTTCAAA ACTGGCGAAG CGGTCAGCGG CAAACTGCAA
CTGCTCTACC CGGATGGCGA GCCGGTAAAA AATGCCCGCG TGCAGTTAAG TTTGCGCGCT
CAGCAATTAT CAATGGTCGG TAACGATTTG CGTTATGCCG GACGTTTCCC CGTGTCGCTG
GAAGGCAGCG AAACGGTGTC CGACGCCAGC GGTCATGTGA CGTTAAATCT CCCCGCCGCC
GATAAACCGA GCCGCTATTT GTTAACCGTC TCCGCCAGTG ACGGCGCGGC GTATCGCGTC
ACCACCACCA AAGAGATCCT CATTGAACGC GGCCTGGCGC ATTACTCTTT AAGTACCGCC
GCACAATACA GTAATAGCGG CGAGTCGGTT GTGTTCCGTT ATGCCGCGCT GGAATCTTCA
AAACAGGTTC CTGTTACGTA TGAATGGTTG CGTCTCGAAG ACCGCACGAG CCATAGCGGA
GAGCTACCGT CAGGCGGCAA ATCCTTTACC GTCAATTTCG CTAAACCTGG CAACTACAAT
CTGACATTAC GCGATAAAGA CGGCTTAATT CTCGCTGGGT TAAGTCATGC CGTCAGCGGT
AAGGGCAGCA CGGCGCATAC TGGTACGGTA GATATCGTGG CGGATAAAAC GCTGTACCAG
CCAGGCGAAA CCGCGAAGAT GCTGATTACC TTTCCGGAGC CAATTGATGA AGCATTATTG
ACGCTGGAAC GCGATCGCGT GGAACAGCAG TCGCTGCTTT CGCATCCGGC AAACTGGCTA
ACGCTACAAC GTTTAAACGA TACCCAGTAT GAAGCCCGGG TTCCAGTGAG CAATTCCTTT
GCGCCTAACA TCACTTTTTC GGTGCTGTAT ACCCGTAATG GTCAGTACAG TTTTCAGAAC
GCCGGGATCA AAGTTGCCGT TCCTCAGCTT GATATCCGGG TGAAAACGGA CAAAACCCAT
TACCAGCCTG GTGAACTGGT CAATGTCGAA TTAACCTCGT CGCTGAAAGG TAAACCTGTT
TCTGCGCAGC TAACGGTAGG CGTGGTCGAT GAAATGATCT ACGCGCTGCA ACCAGAAATC
GCGCCGAATA TCGGCAAATT TTTCTATCCG CTGGGGCGTA ACAATGTGCG TACCAGCTCC
AGTCTGTCGT TTATCAGCTA CGACCAGGCG CTCTCCAGCG AGCCGGTTGC GCCTGGCGCA
ACTAACCGCA GCGAGCGGCG AGTAAAAATG CTTGAACGTC CACGGCGTGA AGAGGTGGAT
ACCGCGGCAT GGATGCCGTC ACTCACAACC GATAAACAAG GCAAAGCGTA TTTCACGTTC
CTGATGCCTG ATTCGTTAAC CCGCTGGCGT ATCACCGCGC GTGGGATGAA CGGCGACGGG
CTGGTCGGGC AGGGGCGTGC TTATCTGCGT TCGGAAAAAA ATCTCTACAT GAAGTGGAGT
ATGCCAACGG TGTATCGCGT GGGCGACAAA CCGGCGGCAG GACTGTTTAT CTTCAGTCAG
CAGGATAACG AACCGGTGGC GCTGGTGACT AAATTTGCAG GCGCTGAGAT GCGCCAGACG
CTGACGCTGC ACAAAGGGGC GAATTATATT TCGCTGACGC AGAATATTCA GCAATCTGGC
TTGTTAAGTG CAGAACTGCA ACAAAATGGG CAAGTGCAGG ACAGCATTAG CACAAAACTG
TCTTTTGTGG ATAACAGCTG GCCCGTTGAA CAGCAGAAAA ATGTCATGCT CGGTGGTGGC
GATAACGCGC TGATGTTGCC CGAGCAGGCG AGCAATATCC GGCTACAAAG TAGTGAAACG
CCGCAGGAGA TTTTCCGCAA CAATCTTGAT GCGTTAGTCG ATGAACCGTG GGGTGGCGTA
ATCAACACCG GTAGCCGTCT GATCCCGCTC AGTCTCGCCT GGCGTTCGCT TGCCGATCAT
CAAAGTGCCG CCGCTAACGA CATTCGTCAG ATGATTCAGG ATAACCGTCT GCGGCTGATG
CAACTGGCGG GGCCCGGAGC GCGCTTTACC TGGTGGGGTG AAGATGGCAA TGGTGACGCC
TTCCTTACGG CATGGGCATG GTACGCCGAC TGGCAGGCCA GCCAGGCGAT CGGCGTAACG
CAACAACCGG AATACTGGCA GCATATGCTC GACAGCTACG CGGAGCAGGC AGATAACATG
CCGTTATTGC ATCGGGCGCT GGTGCTGGCA TGGGCGCAGG AGATGAATTT GCCGTGCAAA
ACGTTGTTGA AAGGGTTGGA TGAAGCTATC GCCCGGCGCG GAACTAAAAC TGAAGATTTC
TCTGAGGAAG ACACCCGCGA TATCAATGAT AGCCTGATCC TCGATACACC GGAGTCTCCA
CTGGCAGATG CGGTGGCAAA CGTCTTAACC ATGACGTTGC TGAAAAAAGC GCAGTTGAAG
TCCACGGTGA TGCCACAGGT TCAGCAATAT GCGTGGGATA AAGCGGCAAA CAGCAATCAG
CCGCTGGCGC ACACGGTTGT GCTGCTTAAT AGCGGTGGCG ACGCTACCCA GACGGCCGCT
ATTTTAAGTG GTTTGACCGC TGAGCAATCC ACTATTGAGC GCGCGCTGGC CATGAACTGG
CTGGCGAAAT ATATGGCGAC AATGCCTCCA GTTGTTTTGC CTGCGCCTGC GGGCGCATGG
GCTAAACATA AGTTAACTGG AGGGGGCGAA GACTGGCGTT GGGTTGGTCA GGGTGTGCCG
GACATTCTCT CTTTTGGTGA CGAATTATCG CCGCAAAATG TGCAGGTCCG CTGGCGTGAA
CCGGCAAAAA CGGCTCAACA AAGTAACATT CCGGTGACCG TTGAACGCCA GTTGTATCGG
CTTATCCCCG GTGAAGAAGA GATGAGCTTT ACTCTGCAGC CGGTGACCAG CAATGAGATT
GACAGCGATG CGCTGTATCT CGATGAAATT ACGCTTACCA GCGAGCAGGA TGCAGTTCTG
CGCTATGGTC AGGTGGAAGT ACCGCTCCCG CCGGGAGCCG ACGTTGAGCG CACAACATGG
GGCATTTCAG TCAATAAACC CAACGCCGCG AAACAGCAGG GGCAATTGCT GGAAAAAGCG
CGTAATGAAA TGGGCGAACT GGCCTATATG GTGCCGGTGA AAGAACTGAC GGGAACGGTC
ACTTTCCGCC ATTTGCTGCG CTTCTCGCAA AAAGGGCAAT TCGTTCTGCC TCCTGCTCGT
TATGTGCGTT CCTATGCACC TGCGCAGCAA AGTGTTGCGG CAGGGAGTGA ATGGACCGGG
ATGCAGGTGA AATAA
 
Protein sequence
MGTGLANADD SLPSSNYAPP AGGTFFLLAD SSFSSSEEAK VRLEAPGRDY RRYQMEEYGG 
VDVRLYRIPD PMAFLRQQKN LHRIVVQPQY LGDGLNNTLT WLWDNWYGKS RRVMQRTFSS
QSRQNVTQAL PELQLGNAII KPSRYVQNNQ FSPLKKYPLV EQFRYPLWQA KPFEPQQGVK
LEGASSNFIS PQPGNIYIPL GQQEPGLYLV EAMVGGYRAT TVVFVSDTVA LSKVSGKELL
VWTAGKKQGE AKPGSEILWT DGLGVMTRGV TDDSGTLQLQ HISPERSYIL GKDAEGGVFV
SENFFYESEI YNTRLYIFTD RPLYRAGDRV DVKVIGREFH DPLHSSPIVS APAKLSVLDA
NGSLLQTVNV TLDARNGGQG SFRLPENAVA GGYELRLAYR NQVYSSSFRV ANYIKPHFEI
GLALDKKEFK TGEAVSGKLQ LLYPDGEPVK NARVQLSLRA QQLSMVGNDL RYAGRFPVSL
EGSETVSDAS GHVTLNLPAA DKPSRYLLTV SASDGAAYRV TTTKEILIER GLAHYSLSTA
AQYSNSGESV VFRYAALESS KQVPVTYEWL RLEDRTSHSG ELPSGGKSFT VNFAKPGNYN
LTLRDKDGLI LAGLSHAVSG KGSTAHTGTV DIVADKTLYQ PGETAKMLIT FPEPIDEALL
TLERDRVEQQ SLLSHPANWL TLQRLNDTQY EARVPVSNSF APNITFSVLY TRNGQYSFQN
AGIKVAVPQL DIRVKTDKTH YQPGELVNVE LTSSLKGKPV SAQLTVGVVD EMIYALQPEI
APNIGKFFYP LGRNNVRTSS SLSFISYDQA LSSEPVAPGA TNRSERRVKM LERPRREEVD
TAAWMPSLTT DKQGKAYFTF LMPDSLTRWR ITARGMNGDG LVGQGRAYLR SEKNLYMKWS
MPTVYRVGDK PAAGLFIFSQ QDNEPVALVT KFAGAEMRQT LTLHKGANYI SLTQNIQQSG
LLSAELQQNG QVQDSISTKL SFVDNSWPVE QQKNVMLGGG DNALMLPEQA SNIRLQSSET
PQEIFRNNLD ALVDEPWGGV INTGSRLIPL SLAWRSLADH QSAAANDIRQ MIQDNRLRLM
QLAGPGARFT WWGEDGNGDA FLTAWAWYAD WQASQAIGVT QQPEYWQHML DSYAEQADNM
PLLHRALVLA WAQEMNLPCK TLLKGLDEAI ARRGTKTEDF SEEDTRDIND SLILDTPESP
LADAVANVLT MTLLKKAQLK STVMPQVQQY AWDKAANSNQ PLAHTVVLLN SGGDATQTAA
ILSGLTAEQS TIERALAMNW LAKYMATMPP VVLPAPAGAW AKHKLTGGGE DWRWVGQGVP
DILSFGDELS PQNVQVRWRE PAKTAQQSNI PVTVERQLYR LIPGEEEMSF TLQPVTSNEI
DSDALYLDEI TLTSEQDAVL RYGQVEVPLP PGADVERTTW GISVNKPNAA KQQGQLLEKA
RNEMGELAYM VPVKELTGTV TFRHLLRFSQ KGQFVLPPAR YVRSYAPAQQ SVAAGSEWTG
MQVK