Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1657 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 1803397 |
End bp | 1806516 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | |
Product | outer membrane autotransporter barrel domain protein |
Protein accession | ACX39321 |
Protein GI | 260448899 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.656845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAC ATCTGAATAC CTGCTACAGG CTGGTATGGA ATCACATGAC GGGCGCTTTC GTGGTTGCCT CCGAACTGGC CCGCGCACGG GGTAAACGTG GCGGTGTGGC GGTTGCACTG TCTCTTGCCG CAGTCACGTC ACTCCCGGTG CTGGCTGCTG ACATCGTTGT GCACCCGGGA GAAACCGTGA ACGGCGGAAC ACTGGCAAAT CATGACAACC AGATTGTCTT CGGTACGACC AACGGAATGA CCATCAGTAC CGGGCTGGAG TATGGGCCGG ATAACGAGGC CAATACCGGC GGGCAATGGG TACAGGATGG CGGAACAGCC AACAAAACGA CTGTCACCAG TGGTGGTCTT CAGAGAGTGA ACCCCGGTGG AAGTGTCTCA GACACGGTTA TCAGTGCCGG AGGCGGACAG AGCCTTCAGG GACGGGCTGT GAACACCACG CTGAATGGTG GCGAACAGTG GATGCATGAG GGGGCGATAG CCACAGGAAC CGTCATTAAT GATAAGGGCT GGCAGGTCGT CAAGCCCGGT ACAGTGGCAA CGGATACCGT TGTTAATACC GGGGCGGAAG GGGGACCGGA TGCAGAAAAC GGTGATACCG GGCAGTTTGT TCGCGGGGAT GCCGTACGCA CAACCATCAA TAAAAACGGT CGCCAGATTG TGAGAGCTGA AGGAACGGCA AATACCACTG TGGTTTATGC CGGCGGCGAC CAGACTGTAC ATGGTCACGC ACTGGATACC ACGCTGAATG GGGGATACCA GTATGTGCAC AACGGCGGTA CAGCGTCTGA CACTGTTGTG AACAGTGACG GCTGGCAGAT TGTCAAAAAC GGGGGTGTGG CCGGGAATAC CACCGTTAAT CAGAAGGGCA GACTGCAGGT GGACGCCGGT GGTACAGCCA CGAATGTCAC CCTGAAGCAG GGCGGCGCAC TGGTTACCAG TACGGCTGCA ACCGTTACCG GCATAAACCG CCTGGGAGCA TTCTCTGTTG TGGAGGGTAA AGCTGATAAT GTCGTACTGG AAAATGGCGG ACGCCTGGAT GTGCTGACCG GACACACAGC CACTAATACC CGCGTGGATG ATGGCGGAAC GCTGGATGTC CGCAACGGTG GCACCGCCAC CACCGTATCC ATGGGAAATG GCGGTGTACT GCTGGCCGAT TCCGGTGCCG CTGTCAGTGG TACCCGGAGC GACGGAAAGG CATTCAGTAT CGGAGGCGGT CAGGCGGATG CCCTGATGCT GGAAAAAGGC AGTTCATTCA CGCTGAACGC CGGTGATACG GCCACGGATA CCACGGTAAA TGGCGGACTG TTCACCGCCA GGGGCGGCAC ACTGGCGGGC ACCACCACGC TGAATAACGG CGCCATACTT ACCCTTTCCG GGAAGACGGT GAACAACGAT ACCCTGACCA TCCGTGAAGG CGATGCACTC CTGCAGGGAG GCTCTCTCAC CGGTAACGGC AGCGTGGAAA AATCAGGAAG TGGCACACTC ACTGTCAGCA ACACCACACT CACCCAGAAA GCCGTCAACC TGAATGAAGG CACGCTGACG CTGAACGACA GTACCGTCAC CACGGATGTC ATTGCTCAGC GCGGTACAGC CCTGAAGCTG ACCGGCAGCA CTGTGCTGAA CGGTGCCATT GACCCCACGA ATGTCACTCT CGCCTCCGGT GCCACCTGGA ATATCCCCGA TAACGCCACG GTGCAGTCGG TGGTGGATGA CCTCAGCCAT GCCGGACAGA TTCATTTCAC CTCCACCCGC ACAGGGAAGT TCGTACCGGC AACCCTGAAA GTGAAAAACC TGAACGGACA GAATGGCACC ATCAGCCTGC GTGTACGCCC GGATATGGCA CAGAACAATG CTGACAGACT GGTCATTGAC GGCGGCAGGG CAACCGGAAA AACCATCCTG AACCTGGTGA ACGCCGGCAA CAGTGCGTCG GGGCTGGCGA CCAGCGGTAA GGGTATTCAG GTGGTGGAAG CCATTAACGG TGCCACCACG GAGGAAGGGG CCTTTGTCCA GGGGAACAGG CTGCAGGCCG GTGCCTTTAA CTACTCCCTC AACCGGGACA GTGATGAGAG CTGGTATCTG CGCAGTGAAA ATGCTTATCG TGCAGAAGTC CCCCTGTATG CCTCCATGCT GACACAGGCA ATGGACTATG ACCGGATTGT GGCAGGCTCC CGCAGCCATC AGACCGGTGT AAATGGTGAA AACAACAGCG TCCGTCTCAG CATTCAGGGC GGTCATCTCG GTCACGATAA CAATGGCGGT ATTGCCCGTG GGGCCACGCC GGAAAGCAGC GGCAGCTATG GATTCGTCCG TCTGGAGGGT GACCTGATGA GAACAGAGGT TGCCGGTATG TCTGTGACCG CGGGGGTATA TGGTGCTGCT GGCCATTCTT CCGTTGATGT TAAGGATGAT GACGGCTCCC GTGCCGGCAC GGTCCGGGAT GATGCCGGCA GCCTGGGCGG ATACCTGAAT CTGGTACACA CGTCCTCCGG CCTGTGGGCT GACATTGTGG CACAGGGAAC CCGCCACAGC ATGAAAGCGT CATCGGACAA TAACGACTTC CGCGCCCGGG GCTGGGGCTG GCTGGGCTCA CTGGAAACCG GTCTGCCCTT CAGTATCACT GACAACCTGA TGCTGGAGCC ACAACTGCAG TATACCTGGC AGGGACTTTC CCTGGATGAC GGTAAGGACA ACGCCGGTTA TGTGAAGTTC GGGCATGGCA GTGCACAACA TGTGCGTGCC GGTTTCCGTC TGGGCAGCCA CAACGATATG ACCTTTGGCG AAGGCACCTC ATCCCGTGCC CCCCTGCGTG ACAGTGCAAA ACACAGTGTG AGTGAATTAC CGGTGAACTG GTGGGTACAG CCTTCTGTTA TCCGCACCTT CAGCTCCCGG GGAGATATGC GTGTGGGGAC TTCCACTGCA GGCAGCGGGA TGACGTTCTC TCCCTCACAG AATGGCACAT CACTGGACCT GCAGGCCGGA CTGGAAGCCC GTGTCCGGGA AAATATCACC CTGGGCGTTC AGGCCGGTTA TGCCCACAGC GTCAGCGGCA GCAGCGCTGA AGGGTATAAC GGTCAGGCCA CACTGAATGT GACCTTCTGA
|
Protein sequence | MKRHLNTCYR LVWNHMTGAF VVASELARAR GKRGGVAVAL SLAAVTSLPV LAADIVVHPG ETVNGGTLAN HDNQIVFGTT NGMTISTGLE YGPDNEANTG GQWVQDGGTA NKTTVTSGGL QRVNPGGSVS DTVISAGGGQ SLQGRAVNTT LNGGEQWMHE GAIATGTVIN DKGWQVVKPG TVATDTVVNT GAEGGPDAEN GDTGQFVRGD AVRTTINKNG RQIVRAEGTA NTTVVYAGGD QTVHGHALDT TLNGGYQYVH NGGTASDTVV NSDGWQIVKN GGVAGNTTVN QKGRLQVDAG GTATNVTLKQ GGALVTSTAA TVTGINRLGA FSVVEGKADN VVLENGGRLD VLTGHTATNT RVDDGGTLDV RNGGTATTVS MGNGGVLLAD SGAAVSGTRS DGKAFSIGGG QADALMLEKG SSFTLNAGDT ATDTTVNGGL FTARGGTLAG TTTLNNGAIL TLSGKTVNND TLTIREGDAL LQGGSLTGNG SVEKSGSGTL TVSNTTLTQK AVNLNEGTLT LNDSTVTTDV IAQRGTALKL TGSTVLNGAI DPTNVTLASG ATWNIPDNAT VQSVVDDLSH AGQIHFTSTR TGKFVPATLK VKNLNGQNGT ISLRVRPDMA QNNADRLVID GGRATGKTIL NLVNAGNSAS GLATSGKGIQ VVEAINGATT EEGAFVQGNR LQAGAFNYSL NRDSDESWYL RSENAYRAEV PLYASMLTQA MDYDRIVAGS RSHQTGVNGE NNSVRLSIQG GHLGHDNNGG IARGATPESS GSYGFVRLEG DLMRTEVAGM SVTAGVYGAA GHSSVDVKDD DGSRAGTVRD DAGSLGGYLN LVHTSSGLWA DIVAQGTRHS MKASSDNNDF RARGWGWLGS LETGLPFSIT DNLMLEPQLQ YTWQGLSLDD GKDNAGYVKF GHGSAQHVRA GFRLGSHNDM TFGEGTSSRA PLRDSAKHSV SELPVNWWVQ PSVIRTFSSR GDMRVGTSTA GSGMTFSPSQ NGTSLDLQAG LEARVRENIT LGVQAGYAHS VSGSSAEGYN GQATLNVTF
|
| |