Gene EcSMS35_2207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2207 
Symbol 
ID6144268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2220673 
End bp2222937 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content50% 
IMG OID641617083 
Producthypothetical protein 
Protein accessionYP_001744257 
Protein GI170683776 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.01066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.239269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA CGACAGTCGG CGTATGCATA ATTTGCGGAA TTTTTCCGTT GCTGATTTTG 
CCCCAATTGC CAGGGACAGT AACCCTTGCG TTTCTGACTC TCTTCGCCTG TGTACTGGCA
TTTATCCCTG TTAAAACCGT CCGTTATATC GCGCTGACGT TGCTGTTTTT CGTTTGGGGC
ATATTAGCAG CAAAGCAAAT TTTGTGGGCA GGAGAAACCT TAACTGGCGC GACGCAGGAT
GCAATAGTTG AGATCACTGC TACTGACGGC ATGACCACTC ATTACGGTCA AATCACTCAT
CTACAAAGTC GACGTATATT CCCTGCGCCA GGCCTCGTAC TGTATGGCGA ATATCTTCCG
CAAGCGGTTT GTGCCGGACA AGTATGGTCA ATGAAACTCA AAGTTCGTGC AGTTCATGGT
CAACTTAATG ATGGCGGCTT TGATAGCCAG CGTTATGCCA TTGCCCAGCA TCAACCGCTC
ACCGGCCGTT TTCTGCAGGC AAGTGTCATT GAACCGAATT GTAGCCTGCG TGCACAGTAT
CTGGCGTCAT TACAAACAAC GCTGCAACCC TATCCGTGGA ATGCGGTTAT TCTTGGTTTA
GGTATGGGGG AACGGTTATC CGTTCCCAAA GAAATCAAAA ATATCATGCG CGATACTGGA
ACGGCGCATT TAATGGCGAT ATCGGGATTG CATATCGCTT TTGCGGCGTT GCTGGCTGCC
GGACTCATTC GCGGTGGGCA AGTTTTTCTG CCTGGGCGCT GGATCCACTG GCAAATGCCA
TTAATTGGCG GAATCTGCTG TGCTGCTTTT TATGCCTGGC TGACTGGGAT GCAACCTCCT
GCATTGCGTA CCGTGGTGGC GCTTGCTACG TGGGGAATGC TTAAGTTAAG TGGGCGACAA
TGGAGTGGCT GGGATGTATG GATATGTTGT CTGGCGGCAA TTTTGCTGAT GGATCCTGTT
GCCATTCTCT CGCAAAGTTT ATGGCTCTCT GCCGCTGCGG TCGCGGCACT GATTTTTTGG
TATCAGTGGT TTCCCGGTCC TGAGTGGCAA CTGCCGCCGG TATTGCGTGC ACTTGTTTCC
CTCATCCATC TGCAACTGGG AATCACACTC CTGCTTATGC CCGTGCAAAT CGTCATATTT
CATGGCATTA GTCTGACCTC GTTTATTGCA AATCTATTTG CAATTCCCCT GGTGACATTT
ATCACGGTTC CGTTGATCCT CGCCGCTATG GTTGTGCATT TAAGCGGGCC GTTAATCCTG
GAAGAGGGAT TATGGTTTCT TGCCGACCGG TCTTTGGCTT TACTTTTCTG GGGGTTAAAG
AGTTTGCCGG AAGGGTGGAT CAACATTGCT GAACGTTGGC AATGGCTATC ATTTTCCCCA
TGGTTCTTAC TGGTGGTATG GCGATTAAAC GCCTGGCGAA CGTTGCCAGC AATGTGTGTG
GCTGGAGGCT TGCTGATGTG CTGGCCGCTG TGGCAAAAAC CTCGACCTGA CGAGTGGCAA
GTGTACATGC TTGATGTCGG GCAAGGGCTG GCAATGGTGA TAGCCAGAAA CGGCAAAGCG
ATTCTCTATG ACACGGGACT GGCCTGGCCT GAAGGGGATA GTGGGCAACA ACTGATTATC
CCCTGGCTCC ACTGGCATAA TCTTGAACCG GAAGGCGTTA TTCTGAGTCA TGAACATCTG
GATCACCGGG GAGGGCTGGA CTCAATATTG CATACATGGC CGATGTTATG GATCAGAAGT
CCGTTAAACT GGGAACACCA TCAGCCCTGT GTGCGTGGCG AAGCGTGGCA ATGGCAAGGA
TTGCGTTTCA GCGTGCACTG GCCTTTACAA GCTAGCAACG ATAAAGGAAA TAACCATTCC
TGTGTGGTTA AGGTTGATGA CGGGACGAAT AGCATTCTTC TAACCGGTGA TATTGAAGCC
CCCGCTGAAC AAAAGATGCT AAGCCGTTAC TGGCAGCAAG TGCAGGCAAC ATTGCTTCAG
GTACCTCACC ATGGCAGTAA TACCTCATCA TCGTTGCCAT TAATTCAGCG AGTGAATGGA
AAAGTGGCAC TCGCATCGGC ATCGCGCTAT AACGCATGGC GACTGCCCTC AAGCAAAGTT
AAACATCGCT ATCAACAACA GGGTTATAAG TGGCTTGATA CTCCACATCA GGGTCAAATA
ACGGTCGATT TTTCAGCGCA AGGCTGGCGG ATTAGCAGCC TCAGAGAGCA AATTTTACCT
CGTTGGTATC ATCAGTGGTT TGGCGTGCCA GTGGATAACG GGTAG
 
Protein sequence
MKITTVGVCI ICGIFPLLIL PQLPGTVTLA FLTLFACVLA FIPVKTVRYI ALTLLFFVWG 
ILAAKQILWA GETLTGATQD AIVEITATDG MTTHYGQITH LQSRRIFPAP GLVLYGEYLP
QAVCAGQVWS MKLKVRAVHG QLNDGGFDSQ RYAIAQHQPL TGRFLQASVI EPNCSLRAQY
LASLQTTLQP YPWNAVILGL GMGERLSVPK EIKNIMRDTG TAHLMAISGL HIAFAALLAA
GLIRGGQVFL PGRWIHWQMP LIGGICCAAF YAWLTGMQPP ALRTVVALAT WGMLKLSGRQ
WSGWDVWICC LAAILLMDPV AILSQSLWLS AAAVAALIFW YQWFPGPEWQ LPPVLRALVS
LIHLQLGITL LLMPVQIVIF HGISLTSFIA NLFAIPLVTF ITVPLILAAM VVHLSGPLIL
EEGLWFLADR SLALLFWGLK SLPEGWINIA ERWQWLSFSP WFLLVVWRLN AWRTLPAMCV
AGGLLMCWPL WQKPRPDEWQ VYMLDVGQGL AMVIARNGKA ILYDTGLAWP EGDSGQQLII
PWLHWHNLEP EGVILSHEHL DHRGGLDSIL HTWPMLWIRS PLNWEHHQPC VRGEAWQWQG
LRFSVHWPLQ ASNDKGNNHS CVVKVDDGTN SILLTGDIEA PAEQKMLSRY WQQVQATLLQ
VPHHGSNTSS SLPLIQRVNG KVALASASRY NAWRLPSSKV KHRYQQQGYK WLDTPHQGQI
TVDFSAQGWR ISSLREQILP RWYHQWFGVP VDNG