Gene EcE24377A_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1011 
Symbol 
ID5587004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1030233 
End bp1032497 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content50% 
IMG OID640924717 
Producthypothetical protein 
Protein accessionYP_001462131 
Protein GI157155375 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00460048 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA CGACAGTCGG CGTATGCATA ATTTGCGGAA TTTTTCCGTT GCTGATTTTG 
CCCCAATTGC CTGGGACATT AACCCTTGCG TTTCTGACTC TCTTCGCCTG CGTACTGGCA
TTTATCCCTG TTAAAACCGT CCGTTATATC GCGCTGACGT TGCTGTTTTT CGTTTGGGGC
ATATTATCAG CAAAGCAAAT TTTGTGGGCA GGAGAAACCT TAACTGGCGC GACGCAGGAT
GCAATTGTTG AGATCACTGC AACTGACGGC ATGACCACTC ATTACGGTCA AATTACTCAT
CTACAAGGTC GACGTATATT CCCTGCGCCA GGCCTTGTGC TGTATGGCGA ATATCTTCCG
CAAGCGGTTT GTGCTGGACA ACAATGGTCA ATGAAACTCA AAGTTCGTGC AGTTCATGGC
CAACTTAATG ATGGCGGCTT TGATAGCCAG CGTTATGCCA TTGCCCAGCA TCAGCCGCTC
ACCGGCCGCT TTCTGCAGGC AAGTGTCATT GAACCGAATT GTAGCCTGCG TGCACAGTAT
CTGGCGTCAC TACAAACAAC GCTGCAACCC TATATGTGGA ATGCGGTTAT TCTTGGTTTA
GGTATGGGGG AACGGTTATC CGTCCCCAAA GAAATCAAAA ATATCATGCG TGATACTGGA
ACGGCGCATT TAATGGCGAT ATCGGGATTG CACATCGCTT TTGCGGCGTT GTTGGCTGCC
GGACTCATTC GCGGTGGGCA AGTTTATCTG CCTGGGCGCT GGATCCACTG GCAAATGCCA
TTAATTGGTG GAATCTGCTG TGCTGCTTTT TATGCCTGGT TGACGGGAAT GCAACCTCCT
GCATTGCGTA CCGTGGTGGC GCTTGCTACG TGGGGAATGC TTAAGTTAAG TGGGCGACAG
TGGAGTGGCT GGGATGTATG GATATGTTGT CTGGCGGCAA TTTTGCTGAT GGATCCTGTT
GCCATTCTCT CGCAAAGTTT ATGGCTCTCT GCCGCTGCGG TCGCGGCACT GATTTTTTGG
TACCAGTGGT TTCCCTGTCC TGAGTGGCAA CTGCCGCCGG TATTGCGTGC AGTTGTTTCC
CTCATTCATT TGCAACTGGG AATCACACTC CTGCTTATGC CTGTGCAAAT CGTCATATTT
CATGGCATTA GTCTGACCTC GTTTATTGCA AATCTATTAG CAATTCCCTT GGTGACATTT
ATCACGGTTC CGTTGATCCT CGCCGCGATG GTTGTGCATT TAAGCGGGCC GTTAATCCTG
GAGCAAGGGT TATGGTTTCT TGCCGACCGG TCTTTGGCTT TACTTTTCTG GGGGTTAAAG
AGTTTGCCAG AAGGGTGGAT CAACATTGCT GAACGTTGGC AATGGCTATC ATTTTCCCCA
TGGTTCTTAC TGGTGGTATG GCGATTAAAC GCCTGGCGAA CGTTGCCAGC AATGTGTGTG
GCTGTAGGCT TGCTGATGTG CTGGCCGCTG TGGCAAAAAC CTCGACCTGA CGAGTGGCAG
GTGTACATGC TTGATGTCGG GCAAGGGCTG GCAATGGTGA TAGCCAGAAA CGGCAAAGCG
ATTCTCTATG ACACAGGACT GGCCTGGCCC GAAGGGGATA GTGGGCAACA ACTGATTATC
CCCTGGCTCC ACTGGCATAA TCTTGAACCG GAAGGCGTTA TTCTGAGTCA TGAACATCTG
GATCACCGGG GAGGGCTGGA CTCAATATTG CACACATGGC CGATGTTATG GATCAGAAGT
CCGTTAAACT GGGAACATCA TCAGCCCTGT GTGCGTGGCG AAGCGTGGCA ATGGCAAGGA
TTGCGTTTCA GCGTGCACTG GCCTTTACAA GCTAGCAACG ATAAAGGAAA TAACCATTCC
TGTGTGGTTA AGGTTGATGA CGGGACGAAT AGCATTCTTC TAACCGGTGA TATTGAAGTC
CCCGCTGAAC AAAAGATGCT AAGCCGTTAC TGGCAGCAAG TGCAGACAAC ATTGCTTCAG
GTACCTCACC ATGGCAGTAA TACCTCATCA TCGTTGCCAT TAATTCAGCG AGTGAATGGA
AAAGTGGCAC TCGCATCGGC ATCGCGCTAT AACGCATGGC GATTGCCCTC TAATAAAGTT
AAGCATCGCT ATCAACAGCA AGGATATCAA TGGCTTGATA CTCCACATCA GGGTCAAGTG
ACGGTCAATT TTTCAGCGCA AGGCTGGCGG ATTAGCAGCC TCAGAGAGCA AATTTTACCT
CGTTGGTATC ATCAGTGGTT TGGCGTGCCA GTGGATAACG GGTAG
 
Protein sequence
MKITTVGVCI ICGIFPLLIL PQLPGTLTLA FLTLFACVLA FIPVKTVRYI ALTLLFFVWG 
ILSAKQILWA GETLTGATQD AIVEITATDG MTTHYGQITH LQGRRIFPAP GLVLYGEYLP
QAVCAGQQWS MKLKVRAVHG QLNDGGFDSQ RYAIAQHQPL TGRFLQASVI EPNCSLRAQY
LASLQTTLQP YMWNAVILGL GMGERLSVPK EIKNIMRDTG TAHLMAISGL HIAFAALLAA
GLIRGGQVYL PGRWIHWQMP LIGGICCAAF YAWLTGMQPP ALRTVVALAT WGMLKLSGRQ
WSGWDVWICC LAAILLMDPV AILSQSLWLS AAAVAALIFW YQWFPCPEWQ LPPVLRAVVS
LIHLQLGITL LLMPVQIVIF HGISLTSFIA NLLAIPLVTF ITVPLILAAM VVHLSGPLIL
EQGLWFLADR SLALLFWGLK SLPEGWINIA ERWQWLSFSP WFLLVVWRLN AWRTLPAMCV
AVGLLMCWPL WQKPRPDEWQ VYMLDVGQGL AMVIARNGKA ILYDTGLAWP EGDSGQQLII
PWLHWHNLEP EGVILSHEHL DHRGGLDSIL HTWPMLWIRS PLNWEHHQPC VRGEAWQWQG
LRFSVHWPLQ ASNDKGNNHS CVVKVDDGTN SILLTGDIEV PAEQKMLSRY WQQVQTTLLQ
VPHHGSNTSS SLPLIQRVNG KVALASASRY NAWRLPSNKV KHRYQQQGYQ WLDTPHQGQV
TVNFSAQGWR ISSLREQILP RWYHQWFGVP VDNG