Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0667 |
Symbol | |
ID | 6144160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 678020 |
End bp | 679447 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641615557 |
Product | DnaJ domain-containing protein |
Protein accession | YP_001742763 |
Protein GI | 170681689 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.910417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATT GCTGGAAGAT TCTCGAAATA GAGGAAACGA CTGACGTCGA TATTATCCGC CGCGCTTATC TGGCGCTGTT ACCGTCCTTT CATCCAGAAA CCGATCCGCA GGGTTTTAAA CAACTTCGTC AGGCGTATGA GGACGCGCTA CGGATTGCGC AGTCGCCTGC TAAATCTGTT TGGCAACCAG AAGAACATGA AGTTGCAGAA CATGAAATTC TGCTCGCCTT TCGTGCGTTA CTCGCCTCTG ATAGCGAACG TTTTCTGCCC TCTGCCTGGC AGCGATTCAT TCAGCAATTA AATTATTGCT CGATGGAAGA TATTGATGAA TTACGCTGGT CGCTGTGCAC AATAGCCATG AACACTGCCC ATTTATCCTT CGAGTGCGTG GTGTTATTAG CAGAAAGATT GCGGTGGTTG CAGGAGGAAA ACGTCGGGGA AATAGACGAA GAAGAACTGG AATCCTTTTT ATATGCCATT GCGAAGGGGA ATGTTTTTAA CTTCCAGACC ATTCTGCATC TGCCCGTTGC CGTACAAAAT GACACCATTG ATTTTTACCA AATGTTCGCT CGGATTTGGT CATCGCATCC AGAATGGCTG ACATTGTATT TAGCGCAACA TCGCGCAGTG ATTATCCCCG ATGATGCAAA TTTGCACAGA AATTTACTCC GCTGGTATAG CACAAGCCGT CTGGATATCC CCGAACTTCT GGATTACGCC CGGTCGTGGC GGGAAGCTGA ACCTGATAAT GAAGATGCGC GTTATTATGA ATACGCGCAA CGCGTCTATT GTGGAGAAGG CGAAAGCCTG CTGGCAGAAC TTTGTGACTA CTGGCGCGAG TATCCCTCCA CCCAGGCGGA TGCTTTAATG TTGCAATGGT GCCGTCAGCA TCGGGTCGAT TATTACCCGT TAGTGGTGAT GATGATTGAA GCGCGTGATC TGGTTAACGA TCAGGGAAAA CCGCTACTTT ATGTCCCTGG TGACAGCGCT CGTACGCGTT TTCATTTATA CGAAATACTC AGTGATGAAA AACTCTCTGC GCTGGGGCGT TCTCTGGTCG AGATGGTTTT GCACAAAGGA CGTAAGCCGC GGATCTCACT CACGCGTGAT ACAGAACATC CCTTATGGCC ATTATATTTA GTTGCTAAAC AATTAGTTCA GGCCAGCCAA CCGACAGAAG AATCATTAAT GCCGATCGTC AGCCGCCTTG ATGCAGAAGA TCGTTGTCCA CTGGAAGCAT TAATGATTCG TCGATTATTA ATTCAGGCGG CAAATTTTAC CGAGAAGCAA ACCGTCGAAC CGGAGCCGCA ACCGCAGTCG ATGCCCGTTG ACGATGGTGG GCCGGGCTGC CTGGGCGTCA TCAAAATTAT TTTCTACATT TTTATCTTTG CCGGTTTGAT AGGGAAAATA CTCCATCTGT TCGGGTGA
|
Protein sequence | MKNCWKILEI EETTDVDIIR RAYLALLPSF HPETDPQGFK QLRQAYEDAL RIAQSPAKSV WQPEEHEVAE HEILLAFRAL LASDSERFLP SAWQRFIQQL NYCSMEDIDE LRWSLCTIAM NTAHLSFECV VLLAERLRWL QEENVGEIDE EELESFLYAI AKGNVFNFQT ILHLPVAVQN DTIDFYQMFA RIWSSHPEWL TLYLAQHRAV IIPDDANLHR NLLRWYSTSR LDIPELLDYA RSWREAEPDN EDARYYEYAQ RVYCGEGESL LAELCDYWRE YPSTQADALM LQWCRQHRVD YYPLVVMMIE ARDLVNDQGK PLLYVPGDSA RTRFHLYEIL SDEKLSALGR SLVEMVLHKG RKPRISLTRD TEHPLWPLYL VAKQLVQASQ PTEESLMPIV SRLDAEDRCP LEALMIRRLL IQAANFTEKQ TVEPEPQPQS MPVDDGGPGC LGVIKIIFYI FIFAGLIGKI LHLFG
|
| |