Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2207 |
Symbol | |
ID | 6144268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2220673 |
End bp | 2222937 |
Gene Length | 2265 bp |
Protein Length | 754 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617083 |
Product | hypothetical protein |
Protein accession | YP_001744257 |
Protein GI | 170683776 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.01066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.239269 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAA CGACAGTCGG CGTATGCATA ATTTGCGGAA TTTTTCCGTT GCTGATTTTG CCCCAATTGC CAGGGACAGT AACCCTTGCG TTTCTGACTC TCTTCGCCTG TGTACTGGCA TTTATCCCTG TTAAAACCGT CCGTTATATC GCGCTGACGT TGCTGTTTTT CGTTTGGGGC ATATTAGCAG CAAAGCAAAT TTTGTGGGCA GGAGAAACCT TAACTGGCGC GACGCAGGAT GCAATAGTTG AGATCACTGC TACTGACGGC ATGACCACTC ATTACGGTCA AATCACTCAT CTACAAAGTC GACGTATATT CCCTGCGCCA GGCCTCGTAC TGTATGGCGA ATATCTTCCG CAAGCGGTTT GTGCCGGACA AGTATGGTCA ATGAAACTCA AAGTTCGTGC AGTTCATGGT CAACTTAATG ATGGCGGCTT TGATAGCCAG CGTTATGCCA TTGCCCAGCA TCAACCGCTC ACCGGCCGTT TTCTGCAGGC AAGTGTCATT GAACCGAATT GTAGCCTGCG TGCACAGTAT CTGGCGTCAT TACAAACAAC GCTGCAACCC TATCCGTGGA ATGCGGTTAT TCTTGGTTTA GGTATGGGGG AACGGTTATC CGTTCCCAAA GAAATCAAAA ATATCATGCG CGATACTGGA ACGGCGCATT TAATGGCGAT ATCGGGATTG CATATCGCTT TTGCGGCGTT GCTGGCTGCC GGACTCATTC GCGGTGGGCA AGTTTTTCTG CCTGGGCGCT GGATCCACTG GCAAATGCCA TTAATTGGCG GAATCTGCTG TGCTGCTTTT TATGCCTGGC TGACTGGGAT GCAACCTCCT GCATTGCGTA CCGTGGTGGC GCTTGCTACG TGGGGAATGC TTAAGTTAAG TGGGCGACAA TGGAGTGGCT GGGATGTATG GATATGTTGT CTGGCGGCAA TTTTGCTGAT GGATCCTGTT GCCATTCTCT CGCAAAGTTT ATGGCTCTCT GCCGCTGCGG TCGCGGCACT GATTTTTTGG TATCAGTGGT TTCCCGGTCC TGAGTGGCAA CTGCCGCCGG TATTGCGTGC ACTTGTTTCC CTCATCCATC TGCAACTGGG AATCACACTC CTGCTTATGC CCGTGCAAAT CGTCATATTT CATGGCATTA GTCTGACCTC GTTTATTGCA AATCTATTTG CAATTCCCCT GGTGACATTT ATCACGGTTC CGTTGATCCT CGCCGCTATG GTTGTGCATT TAAGCGGGCC GTTAATCCTG GAAGAGGGAT TATGGTTTCT TGCCGACCGG TCTTTGGCTT TACTTTTCTG GGGGTTAAAG AGTTTGCCGG AAGGGTGGAT CAACATTGCT GAACGTTGGC AATGGCTATC ATTTTCCCCA TGGTTCTTAC TGGTGGTATG GCGATTAAAC GCCTGGCGAA CGTTGCCAGC AATGTGTGTG GCTGGAGGCT TGCTGATGTG CTGGCCGCTG TGGCAAAAAC CTCGACCTGA CGAGTGGCAA GTGTACATGC TTGATGTCGG GCAAGGGCTG GCAATGGTGA TAGCCAGAAA CGGCAAAGCG ATTCTCTATG ACACGGGACT GGCCTGGCCT GAAGGGGATA GTGGGCAACA ACTGATTATC CCCTGGCTCC ACTGGCATAA TCTTGAACCG GAAGGCGTTA TTCTGAGTCA TGAACATCTG GATCACCGGG GAGGGCTGGA CTCAATATTG CATACATGGC CGATGTTATG GATCAGAAGT CCGTTAAACT GGGAACACCA TCAGCCCTGT GTGCGTGGCG AAGCGTGGCA ATGGCAAGGA TTGCGTTTCA GCGTGCACTG GCCTTTACAA GCTAGCAACG ATAAAGGAAA TAACCATTCC TGTGTGGTTA AGGTTGATGA CGGGACGAAT AGCATTCTTC TAACCGGTGA TATTGAAGCC CCCGCTGAAC AAAAGATGCT AAGCCGTTAC TGGCAGCAAG TGCAGGCAAC ATTGCTTCAG GTACCTCACC ATGGCAGTAA TACCTCATCA TCGTTGCCAT TAATTCAGCG AGTGAATGGA AAAGTGGCAC TCGCATCGGC ATCGCGCTAT AACGCATGGC GACTGCCCTC AAGCAAAGTT AAACATCGCT ATCAACAACA GGGTTATAAG TGGCTTGATA CTCCACATCA GGGTCAAATA ACGGTCGATT TTTCAGCGCA AGGCTGGCGG ATTAGCAGCC TCAGAGAGCA AATTTTACCT CGTTGGTATC ATCAGTGGTT TGGCGTGCCA GTGGATAACG GGTAG
|
Protein sequence | MKITTVGVCI ICGIFPLLIL PQLPGTVTLA FLTLFACVLA FIPVKTVRYI ALTLLFFVWG ILAAKQILWA GETLTGATQD AIVEITATDG MTTHYGQITH LQSRRIFPAP GLVLYGEYLP QAVCAGQVWS MKLKVRAVHG QLNDGGFDSQ RYAIAQHQPL TGRFLQASVI EPNCSLRAQY LASLQTTLQP YPWNAVILGL GMGERLSVPK EIKNIMRDTG TAHLMAISGL HIAFAALLAA GLIRGGQVFL PGRWIHWQMP LIGGICCAAF YAWLTGMQPP ALRTVVALAT WGMLKLSGRQ WSGWDVWICC LAAILLMDPV AILSQSLWLS AAAVAALIFW YQWFPGPEWQ LPPVLRALVS LIHLQLGITL LLMPVQIVIF HGISLTSFIA NLFAIPLVTF ITVPLILAAM VVHLSGPLIL EEGLWFLADR SLALLFWGLK SLPEGWINIA ERWQWLSFSP WFLLVVWRLN AWRTLPAMCV AGGLLMCWPL WQKPRPDEWQ VYMLDVGQGL AMVIARNGKA ILYDTGLAWP EGDSGQQLII PWLHWHNLEP EGVILSHEHL DHRGGLDSIL HTWPMLWIRS PLNWEHHQPC VRGEAWQWQG LRFSVHWPLQ ASNDKGNNHS CVVKVDDGTN SILLTGDIEA PAEQKMLSRY WQQVQATLLQ VPHHGSNTSS SLPLIQRVNG KVALASASRY NAWRLPSSKV KHRYQQQGYK WLDTPHQGQI TVDFSAQGWR ISSLREQILP RWYHQWFGVP VDNG
|
| |