Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4744 |
Symbol | |
ID | 6145960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4843468 |
End bp | 4844970 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619559 |
Product | hypothetical protein |
Protein accession | YP_001746667 |
Protein GI | 170681088 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0318785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.98435 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAC CCCTGTTAAT TGCCCGCACG CCGGACACAG AACTGTTTTT ACTGCCGGGA ATGGCTAACC GTCACGGGCT GATTACTGGC GCAACGGGGA CGGGTAAAAC TGTCACGTTG CAAAAACTGG CAGAGTCATT GTCGGAAATC GGTGTTCCGG TGTTTATGGC CGATGTGAAA GGCGATCTGA CCGGCGTCGA GCAGGCAGGA ACGGCGTCGG AAAAACTGCT CGCAAGGCTT AAAAATATCG GCGTCAATGA CTGGCAACCG CATGCCAATC CGGTGGTGGT GTGGGATATC TTTGGCGAGA AAGGCCATCC GGTGCGGGCG ACGGTTTCGG ATCTGGGGCC GCTGTTGCTG GCACGACTGT TGAATCTCAA CGATGTGCAA TCTGGCGTGC TGAATATCAT TTTCCGCATT GCTGACGATC AGGGATTGTT GCTGCTCGAC TTTAAAGATC TGCGGGCAAT TACCCAGTAC ATCGGCGATA ACGCCAAATC CTTCCAGAAT CAGTACGGAA ATATCAGTAG CGCATCGGTT GGTGCCATCC AGCGCGGATT ACTGTCGCTG GAACAGCAAG GCGCAGCACA CTTCTTTGGT GAGCCGATGC TGGATATCAA AGACTGGATG CGCACCGATG CCAACGGTAA AGGCGTTATC AATATCCTCA GCGCCGAGAA ACTTTATCAG ATGCCGAAAC TGTACGCCGC CAGCCTGCTG TGGATGCTTT CAGAATTGTA TGAACAATTG CCTGAAGCGG GCGATCTGGA GAAACCAAAA CTGGTGTTTT TCTTCGACGA GGCACATCTG CTGTTTAACG ATGCACCGCA GGTACTGCTG GATAAGATTG AGCAGGTGAT AAGGCTTATT CGCTCAAAAG GCGTGGGCGT CTGGTTCGTT TCGCAAAACC CGTCTGATAT TCCGGATAAT GTGCTCGGGC AGCTAGGTAA TCGCGTTCAG CACGCTTTGC GGGCTTTTAC GCCAAAAGAT CAGAAAGCAG TGAAGGCAGC GGCGCAAACC ATGCGGGCCA ATCCGGCATT TGATACCGAA AAGGCAATCC AGGAACTGGG GACCGGCGAG GCGTTAATCT CGTTTCTCGA TGCAAAAGGA AGTCCTTCTG TGGTGGAACG GGCGATGGTG ATCGCACCTT GTTCGCGAAT GGGGCCGGTG ACGGAAGATG AGCGTAATGG CCTGATTAAT CACTCTCCGG TGTATGGCAA ATATGAAGAT GAGGTGGACC GCGAGTCCGC CTATGAGATG CTGCAAAAAG GCTTTCAGGC CAGTACCGAG CAGCAAAATA ATCCCCCCGT GAAAGGTAAA GAGGTGGCGG TGGATGACGG TATTCTTGGT GGATTGAAGG ATATTTTGTT TGGCACTACC GGACCACGCG GCGGGAAGAA AGATGGTGTG GTGCAAACAA TGGCGAAAAG CGCCGCTCGC CAGGTGACGA ATCAGATTGT ACGCGGGATG TTGGGGAGTT TGCTGGGGGG GAGAAAAAGG TAA
|
Protein sequence | MSEPLLIART PDTELFLLPG MANRHGLITG ATGTGKTVTL QKLAESLSEI GVPVFMADVK GDLTGVEQAG TASEKLLARL KNIGVNDWQP HANPVVVWDI FGEKGHPVRA TVSDLGPLLL ARLLNLNDVQ SGVLNIIFRI ADDQGLLLLD FKDLRAITQY IGDNAKSFQN QYGNISSASV GAIQRGLLSL EQQGAAHFFG EPMLDIKDWM RTDANGKGVI NILSAEKLYQ MPKLYAASLL WMLSELYEQL PEAGDLEKPK LVFFFDEAHL LFNDAPQVLL DKIEQVIRLI RSKGVGVWFV SQNPSDIPDN VLGQLGNRVQ HALRAFTPKD QKAVKAAAQT MRANPAFDTE KAIQELGTGE ALISFLDAKG SPSVVERAMV IAPCSRMGPV TEDERNGLIN HSPVYGKYED EVDRESAYEM LQKGFQASTE QQNNPPVKGK EVAVDDGILG GLKDILFGTT GPRGGKKDGV VQTMAKSAAR QVTNQIVRGM LGSLLGGRKR
|
| |