Gene EcSMS35_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3688 
Symbol 
ID6144574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3744990 
End bp3747311 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content57% 
IMG OID641618515 
Productputative transcriptional accessory protein 
Protein accessionYP_001745655 
Protein GI170683595 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.913461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATG ATTCGTTCTG CCGCATTATT GCGGGTGAAA TTCAGGCGCG CCCGGAACAG 
GTTGACGCTG CCGTTCGCCT GCTTGACGAA GGGAATACCG TGCCGTTTAT CGCACGTTAT
CGTAAGGAAA TCACCGGCGG TCTGGATGAC ACGCAGCTGC GTAATCTGGA AACGCGTCTG
AGCTATCTGC GCGAGCTGGA AGAGAGGCGT CAGGCGATCC TCAAATCCAT TTCCGAGCAA
GGCAAACTCA CCGATGATCT GGCGAAGGCC ATCAACGCCA CCTTAAGCAA AACCGAACTC
GAAGACCTCT ACCTACCCTA CAAACCTAAA CGCCGCACCC GCGGGCAAAT CGCCATTGAA
GCAGGGCTTG AGCCGTTGGC TGACCTGCTG TGGAGCGATC CGTCACACAC GCCAGAAGTC
GCGGCTGCAC AATATGTTGA TGCAGATAAA GGCGTGGCAG ATACCAAAGC CGCGCTGGAC
GGCGCGCGCT ATATCCTGAT GGAACGGTTT GCCGAAGATG CCGCGCTGCT GGCAAAAGTG
CGTGATTATC TGTGGAAGAA CGCGCATTTG GTTTCGACGG TGGTGAGCGG TAAAGAAGAG
GAAGGGGCGA AATTCCGCGA CTATTTCGAT CATCACGAAC CGCTATCCAC GGTGCCTTCT
CACCGCGCGC TGGCGATGTT CCGTGGGCGT AACGAAGGCA TACTTCAGCT TTCGCTGAAT
GCCGATCCGC AGTTCGAAGA GCCGCCGAAA GAGAGCTATT GCGAGCAAAT CATCATGGAT
CACCTTGGCC TGCGCCTGAA CAATGCCCCG GCGGATAGCT GGCGCAAAGG CGTGGTGAGC
TGGACGTGGC GCATCAAGGT GCTGATGCAT CTGGAAACCG AACTGATGGG CACCGTGCGC
GAACGCGCGG AAGATGAAGC AATCAACGTC TTTGCCCGTA ACCTGCACGA TCTGCTGATG
GCGGCCCCAG CCGGACTGCG TGCAACGATG GGCCTCGATC CGGGTCTGCG TACCGGGGTA
AAAGTAGCGG TGGTCGATGC AACTGGCAAA CTGGTGGCGA CCGATACCAT TTATCCACAT
ACCGGACAAG CCGCAAAAGC AGCGATGACC GTTGCTGCCC TGTGTGAAAA GCATAACGTT
GAACTGGTGG CGATCGGCAA CGGTACAGCT TCCCGCGAAA CCGAGCGTTT CTATCTCGAC
GTGCAGAAGC AGTTCCCGAA AGTGACCGCG CAGAAAGTGA TCGTCAGCGA AGCTGGCGCG
TCGGTTTACT CGGCTTCCGA GCTGGCAGCA CAGGAGTTCC CGGATCTCGA CGTTTCGCTG
CGTGGCGCGG TGTCTATAGC CCGCCGTTTG CAGGATCCGC TGGCGGAGCT GGTGAAAATC
GATCCGAAAT CTATCGGCGT AGGTCAGTAT CAGCATGACG TCAGCCAGAC GCAACTGGCC
CGCAAACTGG ACGCAATAGT AGAAGACTGC GTAAACGCCG TTGGCGTCGA TCTCAACACC
GCTTCTGTTC CGCTGTTAAC TCGCGTGGCG GGCCTGACGC GCATGATGGC GCAAAACATC
GTTGCCTGGC GCGATGAGAA CGGTCAGTTC CAGAACCGTC AGCAACTGCT GAAAGTGAGC
CGTCTGGGGC CGAAAGCCTT CGAGCAGTGC GCGGGCTTCT TGCGCATTAA CCATGGTGAT
AACCCGCTGG ACGCGTCTAC CGTCCACCCG GAAGCCTATC CGGTGGTGGA ACGCATTCTG
GCAGCAACAC AGCAGGCACT GAAAGATCTG ATGGGTAACA GCAGCGAGTT GCGTAACCTG
AAAGCGTCTG ACTTTACTGA TGAAAAATTC GGTGTACCGA CGGTAACTGA CATCATCAAA
GAGCTGGAAA AACCGGGTCG CGATCCGCGT CCGGAATTTA AAACCGCTCA GTTTGCCGAT
GGCGTCGAAA CGATGAACGA CCTGCAACCG GGTATGATCC TCGAAGGTGC GGTGACCAAC
GTCACCAACT TTGGCGCGTT TGTCGATATT GGCGTTCATC AGGACGGCCT GGTTCACATC
TCTTCATTGT CGAACAAGTT TGTGGAAGAT CCGCATACCG TGGTGAAAGC GGGCGATATT
GTGAAGGTGA AAGTGCTGGA AGTGGATCTT CAGCGCAAAC GTATCGCCCT GACTATGCGT
CTGGACGAGC AGCCTGGCGA AACCAACGCT CGTCGCGGCG GCGGTAATGA ACGTCCGCAG
AACAACCGCC CGGCAGCCAA ACCGCGTGGG CGTGAAGCGC AGCCTGCCGG TAATAGCGCG
ATGATGGATG CGCTGGCGGC GGCAATGGGC AAAAAACGTT AA
 
Protein sequence
MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL 
SYLRELEERR QAILKSISEQ GKLTDDLAKA INATLSKTEL EDLYLPYKPK RRTRGQIAIE
AGLEPLADLL WSDPSHTPEV AAAQYVDADK GVADTKAALD GARYILMERF AEDAALLAKV
RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGILQLSLN
ADPQFEEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR
ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH
TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA
SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA
RKLDAIVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS
RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL
KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN
VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR
LDEQPGETNA RRGGGNERPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR