Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2489 |
Symbol | |
ID | 6147095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2536010 |
End bp | 2537368 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641617361 |
Product | SMC domain-containing protein |
Protein accession | YP_001744533 |
Protein GI | 170684247 |
COG category | [R] General function prediction only |
COG ID | [COG3950] Predicted ATP-binding protein involved in virulence |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000889729 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATA TCCGCACGCT TAAGCTCACT AATCTGGGGC GGTTTGAAGA ACTTGAAGTT CATCTGGCTC CAGTGGAGGA GTTCAAGAGC AATGTGACCG TTTTTATTGG TAATAATGGT GCAGGTAAAA CATCAATATT AAAATCGTTG GCAACCAGCC TGAGTTGGTT CGTTGCCCGA GTTCGTACTG AAAAAGGTAA CGGTAGCCCT ATTCCTGAAG ACGCTATTCT GAACGGTAGG AGTTCGGCGA CAATTGAACT TCAGGTACTG AATACGCATC CAGCGACGGA GGCCGCTACG CCCTACCGTT GGTTGCTTGC CAGAACGGCC AGTGGGAAAA AATCGACCAC CGCCTCCAGC CTGCAAGAGG CCAGCCAATT GGCTGCGTTT TATCGAGATC AATACACCCA GAATAGCGGG GCATCCTTCC CGCTTATCGC CTTCTATCCC GTAGAACGTG TCGTGCTGGA TGTGCCGTTG AAAATAAAAG AACGTCATAA TTTTTTGCAA CTGGATGGCT ACGATAACGC CCTGAATCAG GGTATTGATT TCCGCCGTTT CTTTGAGTGG TTTCGCAATC GCGAAGATGC AGAAAATGAA TCGGGCTTAC CCCAAGACGT TCTGGATAAG CTCAGTACCA GGATAGATCT CGATAACACC GTCTTAAATG CATTAACGGC AATCATGGCC TCGTCCCGGG ATCGCCAGTT GACCGCCGTC AGAACGGCCA TTAGTCGCTT TATGCCAGGG TTCAGCAACT TACGCGTCAG GCGTAAACCT CGCCTGCATA TGTCGATTGA TAAAAATGGC CAGACACTGA ATGTGCTGCA ATTATCGCAG GGTGAAAAAT CACTGATGGC GTTAGTCGGC GATATTGCTC GCCGCCTGGC AATGATGAAC CCGATGTTAG AAAACCCGCT AAACGGCGAG GGAATTGTAT TAATTGATGA AGTGGACATG CACCTGCATC CAACATGGCA GCGTACAATC ATCCAGCGTC TGACGACAAC ATTCCCACAT TGCCAGTTTG TCTTAACAAC CCACTCTCCT TTAGTGATCA GTGATTACAA AGATGTGCTG GTTTATTCTC TGGATAATGG CGAATTAACG CAGCTCCCGT CTCTGTATGG GCAAGATGCG AATACTGTGC TTTTGAATGT GATGGATACG GATATTCGCA ATGCGACAGT GGCAGAAAAA CTTAACGATC TTTTGGATCT GATTCAGAAA AACGACTTTA TCAACGCTAA CGCTCTTCTG AATACGCTAA GCCTGGAACT TCCTGAAAAC CATCTTGAAC TGGTGAAAGC CAGAATGCTT CTGCGCAAAC AGGAAATTAA ACATGCGCGA AATAACTAA
|
Protein sequence | MMNIRTLKLT NLGRFEELEV HLAPVEEFKS NVTVFIGNNG AGKTSILKSL ATSLSWFVAR VRTEKGNGSP IPEDAILNGR SSATIELQVL NTHPATEAAT PYRWLLARTA SGKKSTTASS LQEASQLAAF YRDQYTQNSG ASFPLIAFYP VERVVLDVPL KIKERHNFLQ LDGYDNALNQ GIDFRRFFEW FRNREDAENE SGLPQDVLDK LSTRIDLDNT VLNALTAIMA SSRDRQLTAV RTAISRFMPG FSNLRVRRKP RLHMSIDKNG QTLNVLQLSQ GEKSLMALVG DIARRLAMMN PMLENPLNGE GIVLIDEVDM HLHPTWQRTI IQRLTTTFPH CQFVLTTHSP LVISDYKDVL VYSLDNGELT QLPSLYGQDA NTVLLNVMDT DIRNATVAEK LNDLLDLIQK NDFINANALL NTLSLELPEN HLELVKARML LRKQEIKHAR NN
|
| |