Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3688 |
Symbol | |
ID | 6144574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3744990 |
End bp | 3747311 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618515 |
Product | putative transcriptional accessory protein |
Protein accession | YP_001745655 |
Protein GI | 170683595 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.913461 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATG ATTCGTTCTG CCGCATTATT GCGGGTGAAA TTCAGGCGCG CCCGGAACAG GTTGACGCTG CCGTTCGCCT GCTTGACGAA GGGAATACCG TGCCGTTTAT CGCACGTTAT CGTAAGGAAA TCACCGGCGG TCTGGATGAC ACGCAGCTGC GTAATCTGGA AACGCGTCTG AGCTATCTGC GCGAGCTGGA AGAGAGGCGT CAGGCGATCC TCAAATCCAT TTCCGAGCAA GGCAAACTCA CCGATGATCT GGCGAAGGCC ATCAACGCCA CCTTAAGCAA AACCGAACTC GAAGACCTCT ACCTACCCTA CAAACCTAAA CGCCGCACCC GCGGGCAAAT CGCCATTGAA GCAGGGCTTG AGCCGTTGGC TGACCTGCTG TGGAGCGATC CGTCACACAC GCCAGAAGTC GCGGCTGCAC AATATGTTGA TGCAGATAAA GGCGTGGCAG ATACCAAAGC CGCGCTGGAC GGCGCGCGCT ATATCCTGAT GGAACGGTTT GCCGAAGATG CCGCGCTGCT GGCAAAAGTG CGTGATTATC TGTGGAAGAA CGCGCATTTG GTTTCGACGG TGGTGAGCGG TAAAGAAGAG GAAGGGGCGA AATTCCGCGA CTATTTCGAT CATCACGAAC CGCTATCCAC GGTGCCTTCT CACCGCGCGC TGGCGATGTT CCGTGGGCGT AACGAAGGCA TACTTCAGCT TTCGCTGAAT GCCGATCCGC AGTTCGAAGA GCCGCCGAAA GAGAGCTATT GCGAGCAAAT CATCATGGAT CACCTTGGCC TGCGCCTGAA CAATGCCCCG GCGGATAGCT GGCGCAAAGG CGTGGTGAGC TGGACGTGGC GCATCAAGGT GCTGATGCAT CTGGAAACCG AACTGATGGG CACCGTGCGC GAACGCGCGG AAGATGAAGC AATCAACGTC TTTGCCCGTA ACCTGCACGA TCTGCTGATG GCGGCCCCAG CCGGACTGCG TGCAACGATG GGCCTCGATC CGGGTCTGCG TACCGGGGTA AAAGTAGCGG TGGTCGATGC AACTGGCAAA CTGGTGGCGA CCGATACCAT TTATCCACAT ACCGGACAAG CCGCAAAAGC AGCGATGACC GTTGCTGCCC TGTGTGAAAA GCATAACGTT GAACTGGTGG CGATCGGCAA CGGTACAGCT TCCCGCGAAA CCGAGCGTTT CTATCTCGAC GTGCAGAAGC AGTTCCCGAA AGTGACCGCG CAGAAAGTGA TCGTCAGCGA AGCTGGCGCG TCGGTTTACT CGGCTTCCGA GCTGGCAGCA CAGGAGTTCC CGGATCTCGA CGTTTCGCTG CGTGGCGCGG TGTCTATAGC CCGCCGTTTG CAGGATCCGC TGGCGGAGCT GGTGAAAATC GATCCGAAAT CTATCGGCGT AGGTCAGTAT CAGCATGACG TCAGCCAGAC GCAACTGGCC CGCAAACTGG ACGCAATAGT AGAAGACTGC GTAAACGCCG TTGGCGTCGA TCTCAACACC GCTTCTGTTC CGCTGTTAAC TCGCGTGGCG GGCCTGACGC GCATGATGGC GCAAAACATC GTTGCCTGGC GCGATGAGAA CGGTCAGTTC CAGAACCGTC AGCAACTGCT GAAAGTGAGC CGTCTGGGGC CGAAAGCCTT CGAGCAGTGC GCGGGCTTCT TGCGCATTAA CCATGGTGAT AACCCGCTGG ACGCGTCTAC CGTCCACCCG GAAGCCTATC CGGTGGTGGA ACGCATTCTG GCAGCAACAC AGCAGGCACT GAAAGATCTG ATGGGTAACA GCAGCGAGTT GCGTAACCTG AAAGCGTCTG ACTTTACTGA TGAAAAATTC GGTGTACCGA CGGTAACTGA CATCATCAAA GAGCTGGAAA AACCGGGTCG CGATCCGCGT CCGGAATTTA AAACCGCTCA GTTTGCCGAT GGCGTCGAAA CGATGAACGA CCTGCAACCG GGTATGATCC TCGAAGGTGC GGTGACCAAC GTCACCAACT TTGGCGCGTT TGTCGATATT GGCGTTCATC AGGACGGCCT GGTTCACATC TCTTCATTGT CGAACAAGTT TGTGGAAGAT CCGCATACCG TGGTGAAAGC GGGCGATATT GTGAAGGTGA AAGTGCTGGA AGTGGATCTT CAGCGCAAAC GTATCGCCCT GACTATGCGT CTGGACGAGC AGCCTGGCGA AACCAACGCT CGTCGCGGCG GCGGTAATGA ACGTCCGCAG AACAACCGCC CGGCAGCCAA ACCGCGTGGG CGTGAAGCGC AGCCTGCCGG TAATAGCGCG ATGATGGATG CGCTGGCGGC GGCAATGGGC AAAAAACGTT AA
|
Protein sequence | MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL SYLRELEERR QAILKSISEQ GKLTDDLAKA INATLSKTEL EDLYLPYKPK RRTRGQIAIE AGLEPLADLL WSDPSHTPEV AAAQYVDADK GVADTKAALD GARYILMERF AEDAALLAKV RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGILQLSLN ADPQFEEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA RKLDAIVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR LDEQPGETNA RRGGGNERPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR
|
| |