Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3653 |
Symbol | |
ID | 6146260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3712951 |
End bp | 3714255 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618480 |
Product | hypothetical protein |
Protein accession | YP_001745620 |
Protein GI | 170680136 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.874459 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0000157987 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGATCTGT ATATTCAGAT TATCGTGGTG GCGTGCCTGA CGGGTATGAC ATCGCTTCTG GCGCATCGCT CGGCAGCCGT TTTTCATGAC GGTATCCGCC CGATCCTGCC GCAACTGATT GAAGGCTATA TGAACCGTCG CGAGGCGGGG AGTATCGCTT TTGGTCTGAG CATTGGTTTT GTGGCCTCGG TGGGGATCTC TTTTACCCTG AAAACCGGGC TGCTCAACGC ATGGTTACTC TTTCTTCCTA CCGATATCCT CGGCGTACTG GCGATAAACA GCCTGATGGC GTTTGGTCTT GGCGCTATCT GGGGCGTGTT GATCCTTACT TGCCTGTTGC CGGTAAACCA GCTGCTGACC GCGCTACCGG TGGATGTATT AGGTAGCCTC GGGGAATTAA GCTCGCCGGT GGTTTCAGCT TTTGCACTCT TCCCGCTGGT GGCGATTTTC TACCAGTTTG GCTGGAAGCA AAGTCTGATC GCCGCCGTGG TGGTACTGAT GACCCGCGTG GTCGTCGTGC GCTATTTCCC ACATCTTAAC CCTGAATCCA TCGAAATCTT TATTGGTATG GTGATGCTGC TGGGGATCGC GATAACTCAC GACCTGCGTC ATCGTGATGA AAATGACATT GATGCCAGCG GGCTTTCGGT GTTTGAAGAA CGCACGTCAC GGATTATCAA AAACTTACCG TATATCGCCA TCGTGGGAGC ATTGATTGCC GCCGTTGCCA GCATGAAGAT TTTCGCCGGT AGTGAAGTGT CGATCTTCAC TCTGGAGAAA GCGTACTCCG CAGGCGTAAC GCCGGAACAA TCGCAAACAC TAATCAACCA GGCAGCTCTG GCGGAATTTA TGCGCGGGCT GGGTTTTGTG CCGATGATTG CCACCACCGC GCTAGCCACC GGCGTGTATG CAGTTGCGGG CTTTACCTTT GTTTATGCGG TGGGCTATCT CTCGCCGAAT CCGATGGTTG CAGCGGTATT AGGCGCAGTG GTTATTTCGG CGGAAGTCCT GCTGCTTCGT TCGATCGGCA AATGGCTGGG GCGCTACCCG TCGGTGCGTA ATGCGTCGGA TAACATCCGT AACGCCATGA ATATGCTGAT GGAAGTGGCG TTGCTGGTCG GTTCGATCTT TGCAGCAATC AAAATGGCGG GCTATACCGG ATTCTCTATC GCAGTTGCCA TTTACTTCCT CAACGAATCC CTGGGCCGTC CGGTACAGAA AATGGCGGCA CCGGTCGTGG CAGTAATGAT CACCGGTATT CTGCTGAATG TTCTTTACTG GCTTGGCCTG TTCGTTCCGG CTTAA
|
Protein sequence | MDLYIQIIVV ACLTGMTSLL AHRSAAVFHD GIRPILPQLI EGYMNRREAG SIAFGLSIGF VASVGISFTL KTGLLNAWLL FLPTDILGVL AINSLMAFGL GAIWGVLILT CLLPVNQLLT ALPVDVLGSL GELSSPVVSA FALFPLVAIF YQFGWKQSLI AAVVVLMTRV VVVRYFPHLN PESIEIFIGM VMLLGIAITH DLRHRDENDI DASGLSVFEE RTSRIIKNLP YIAIVGALIA AVASMKIFAG SEVSIFTLEK AYSAGVTPEQ SQTLINQAAL AEFMRGLGFV PMIATTALAT GVYAVAGFTF VYAVGYLSPN PMVAAVLGAV VISAEVLLLR SIGKWLGRYP SVRNASDNIR NAMNMLMEVA LLVGSIFAAI KMAGYTGFSI AVAIYFLNES LGRPVQKMAA PVVAVMITGI LLNVLYWLGL FVPA
|
| |