Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3061 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 3290165 |
End bp | 3293137 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | TPR repeat-containing protein |
Protein accession | ACX40689 |
Protein GI | 260450267 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGAGA ATAACCTTAA TCGCGTCATC GGATGGTCTG GTTTACTGCT GACGTCTTTA TTGAGTACCA GCGCACTCGC AGACAATATC GGCACCAGCG CAGAAGAGCT GGGGCTGAGC GATTATCGCC ATTTTGTTAT TTATCCCCGT CTCGATAAAG CGCTGAAGGC ACAGAAAAAT AACGACGAAG CAACCGCCAT CCGCGAATTT GAATATATAC ACCAGCAGGT GCCGGATAAT ATTCCGCTGA CTTTATACCT TGCGGAAGCC TATCGCCATT TTGGTCATGA TGACCGGGCG CGGCTGTTGC TTGAGGATCA ACTGAAACGT CACCCAGGAG ATGCCCGACT TGAGCGCAGT CTGGCGGCTA TTCCGGTTGA AGTGAAAAGC GTTACGACAG TTGAAGAACT GCTTGCCCAG CAAAAAGCGT GCGATGCTGC GCCGACCCTG CGTTGTCGCA GTGAAGTCGG GCAGAATGCC CTGCGGCTGG CACAGTTACC TGTCGCCAGA GCGCAACTGA ACGATGCGAC GTTTGCTGCA TCGCCGGAAG GAAAAACGCT GCGAACCGAT CTGCTGCAAC GGGCAATCTA CCTGAAACAA TGGTCCCAGG CAGATACGCT ATACAATGAA GCACGCCAGC AGAACACATT AAGCGCGGCA GAACGCCGTC AGTGGTTTGA CGTGCTTCTT GCCGGGCAGC TGGACGATCG GATCCTGGCA CTGCAATCAC AGGGGATCTT CACCGATCCT CAGTCATATA TTACTTACGC GACCGCGCTG GCTTATCGTG GCGAAAAAGC ACGCCTCCAG CATTATCTCA TTGAAAATAA GCCACTATTT ACCACGGACG CACAAGAGAA AAGTTGGCTC TATCTGTTAT CTAAATACAG CGCTAACCCC GTTCAGGCGT TGGCGAATTA TACGGTACAG TTTGCCGACA ACCGCCAGTA TGTTGTTGGC GCGACGCTAC CGGTGCTGTT AAAAGAAGGT CAGTACGACG CAGCGCAAAA ACTGCTCGCC ACCCTCCCCG CCAATGAAAT GCTTGAGGAG CGTTATGCTG TCAGCGTGGC GACCCGTAAC AAGGCTGAAG CTCTGCGTCT GGCACGATTG CTGTATCAGC AAGAACCGGC AAATCTTACC CGCCTGGATC AACTAACCTG GCAACTGATG CAGAACGAGC AGTCACGCGA AGCTGCCGAT TTATTGCTGC AACGCTATCC TTTCCAGGGC GATGCGCGTG TCAGCCAGAC TTTAATGGCG CGACTGGCGT CTCTGCTGGA AAGTCATCCT TACCTGGCAA CGCCGGCGAA GGTGGCGATT TTATCGAAAC CCTTACCGCT GGCGGAGCAA CGTCAGTGGC AAAGTCAGTT GCCGGGTATT GCAGATAATT GCCCGGCAAT AGTTCGCTTG CTGGGCGATA TGTCGCCTTC CTACGATGCC GCCGCCTGGA ACCGTCTGGC AAAGTGTTAT CGGGACACGC TACCCGGTGT GGCGTTGTAT GCATGGCTTC AGGCCGAACA ACGACAACCG AGCGCCTGGC AACATCGTGC GGTAGCCTAT CAGGCGTATC AGGTTGAGGA CTACGCCACC GCACTGGCGG CCTGGCAGAA AATCAGTCTT CACGACATGA GCAATGAGGA TCTGCTTGCT GCTGCCAATA CCGCCCAGGC GGCAGGAAAT GGTGCGGCTC GCGATCGCTG GCTACAACAG GCAGAAAAAC GTGGACTGGG AAGCAATGCC CTCTACTGGT GGCTGCATGC GCAACGTTAC ATTCCTGGTC AGCCGGAACT CGCACTGAAC GATCTCACGC GCTCAATCAA TATTGCGCCT TCTGCCAACG CTTACGTTGC GCGGGCGACA ATTTATCGCC AACGTCATAA TGTCCCGGCC GCGGTGAGTG ATTTGCGCGC CGCGCTGGAA CTGGAACCGA ATAATAGCAA CACCCAGGCA GCGCTTGGTT ACGCCTTGTG GGATAGCGGT GATATCGCAC AGTCGCGGGA AATGCTCGAA CCGGCGCATA AAGGGCTTCC GGACGATCCG GCACTGATCC GACAACTGGC CTACGTGAAC CAGCGTCTGG ATGACATGCC TGCGACGCAG CACTACGCCC GGCTGGTGAT TGATGACATT GATAATCAGG CGCTGATAAC CCCACTGACC CCAGAACAAA ATCAACAACG CTTCAATTTC CGCCGTTTGC ATGAGGAGGT CGGTCGCCGC TGGACGTTCA GTTTCGATTC TTCCATCGGC TTGCGTTCCG GCGCAATGAG TACCGCTAAC AATAATGTCG GCGGCGCAGC GCCAGGGAAA AGCTATCGTA GCTACGGACA ACTGGAAGCC GAGTACCGCA TCGGACGCAA TATGCTGCTG GAAGGCGACC TGCTCTCAGT TTATAGCCGC GTCTTTGCCG ATACCGGAGA AAACGGGGTG ATGATGCCGG TGAAAAATCC GATGTCCGGC ACCGGTCTGC GCTGGAAGCC GCTGCGCGAT CAGATCTTTT TCATCGCCGT CGAACAGCAG TTGCCGCTGA ACGGCCAAAA TGGCGCATCC GATACCATGC TGCGCGCCAG CGCCTCATTC TTTAATGGCG GCAAATACAG CGACGAATGG CACCCGAACG GTTCAGGCTG GTTTGCCCAA AACCTGTACC TCGATGCGGC GCAATATATC CGCCAGGATA TTCAGGCGTG GACGGCAGAT TATCGCGTCA GCTGGCATCA GAAGGTAGCT AACGGACAGA CTATTGAGCC TTACGCTCAC GTTCAGGACA ACGGCTATCG TGATAAAGGC ACTCAGGGCG CGCAGCTTGG CGGAGTCGGG GTCCGCTGGA ATATCTGGAC CGGCGAGACG CACTACGACG CCTGGCCGCA CAAAGTCAGT CTCGGCGTCG AGTATCAACA TACCTTTAAG GCGATTAATC AACGTAACGG AGAGCGCAAC AACGCGTTTC TCACCATTGG AGTGCACTGG TAA
|
Protein sequence | MKENNLNRVI GWSGLLLTSL LSTSALADNI GTSAEELGLS DYRHFVIYPR LDKALKAQKN NDEATAIREF EYIHQQVPDN IPLTLYLAEA YRHFGHDDRA RLLLEDQLKR HPGDARLERS LAAIPVEVKS VTTVEELLAQ QKACDAAPTL RCRSEVGQNA LRLAQLPVAR AQLNDATFAA SPEGKTLRTD LLQRAIYLKQ WSQADTLYNE ARQQNTLSAA ERRQWFDVLL AGQLDDRILA LQSQGIFTDP QSYITYATAL AYRGEKARLQ HYLIENKPLF TTDAQEKSWL YLLSKYSANP VQALANYTVQ FADNRQYVVG ATLPVLLKEG QYDAAQKLLA TLPANEMLEE RYAVSVATRN KAEALRLARL LYQQEPANLT RLDQLTWQLM QNEQSREAAD LLLQRYPFQG DARVSQTLMA RLASLLESHP YLATPAKVAI LSKPLPLAEQ RQWQSQLPGI ADNCPAIVRL LGDMSPSYDA AAWNRLAKCY RDTLPGVALY AWLQAEQRQP SAWQHRAVAY QAYQVEDYAT ALAAWQKISL HDMSNEDLLA AANTAQAAGN GAARDRWLQQ AEKRGLGSNA LYWWLHAQRY IPGQPELALN DLTRSINIAP SANAYVARAT IYRQRHNVPA AVSDLRAALE LEPNNSNTQA ALGYALWDSG DIAQSREMLE PAHKGLPDDP ALIRQLAYVN QRLDDMPATQ HYARLVIDDI DNQALITPLT PEQNQQRFNF RRLHEEVGRR WTFSFDSSIG LRSGAMSTAN NNVGGAAPGK SYRSYGQLEA EYRIGRNMLL EGDLLSVYSR VFADTGENGV MMPVKNPMSG TGLRWKPLRD QIFFIAVEQQ LPLNGQNGAS DTMLRASASF FNGGKYSDEW HPNGSGWFAQ NLYLDAAQYI RQDIQAWTAD YRVSWHQKVA NGQTIEPYAH VQDNGYRDKG TQGAQLGGVG VRWNIWTGET HYDAWPHKVS LGVEYQHTFK AINQRNGERN NAFLTIGVHW
|
| |