Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4696 |
Symbol | |
ID | 6147406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4794578 |
End bp | 4795921 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619512 |
Product | putative transporter |
Protein accession | YP_001746620 |
Protein GI | 170679891 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAACA GTATTTTAGT CATACTCTGT TTGATCGCTG TAAGTGCGTT CTTCTCGATG TCCGAGATCT CGCTTGCCGC CTCACGCAAA ATCAAACTTA AACTGCTGGC TGATGAAGGC AATATAAATG CCCAACGCGT TCTGAATATG CAGGAAAATC CCGGCATGTT CTTTACCGTG GTCCAAATCG GTCTGAACGC AGTGGCGATT CTCGGCGGTA TCGTCGGTGA TGCGGCATTT TCTCCAGCTT TTCACAGCCT GTTCTCCCGC TATATGTCGG CAGAACTCTC TGAGCAACTG AGCTTTATTC TCTCTTTCTC GTTAGTGACT GGCATGTTTA TCCTGTTTGC GGATTTAACC CCGAAACGCA TCGGTATGAT TGCGCCAGAA GCTGTGGCTT TGCGTATCAT CAACCCGATG CGCTTCTGCC TGTACGTTTG CACCCCGCTG GTGTGGTTCT TCAACGGACT GGCGAACATA ATCTTCCGTA TTTTCAAACT GCCAATGGTA CGTAAAGATG ACATCACTTC TGATGACATC TACGCGGTAG TGGAAGCCGG TGCGCTGGCA GGCGTGTTAC GTAAACAGGA ACACGAGCTG ATTGAAAACG TCTTTGAGCT GGAATCCCGT ACCGTTCCGT CTTCAATGAC ACCGCGTGAA AACGTGATTT GGTTTGATCT CCACGAAGAT GAGCAAAGTC TGAAGAATAA GGTGGCGGAA CATCCGCACT CTAAGTTTCT CGTCTGTAAT GAAGATATTG ACCACATCAT CGGATATGTC GATTCTAAAG ACCTGCTGAA CCGCGTGCTG GCTAACCAAA GCCTGGCACT GAACAGCGGC GTACAAATTC GCAACACGCT GATTGTGCCG GATACGTTAA CCCTTTCAGA AGCGTTGGAA AGTTTTAAAA CCGCAGGTGA AGACTTCGCG GTGATCATGA ACGAGTACGC GCTGGTAGTG GGGATCATCA CCCTCAACGA CGTGATGACC ACGCTGATGG GCGATCTGGT CGGTCAGGGG CTGGAAGAGC AGATTGTCGC CCGTGATGAG AACTCATGGC TGATTGACGG CGGCACCCCG ATTGACGACG TCATGCGCGT GCTGGATATT GACGAGTTCC CGCAGTCGGG CAACTACGAA ACCATCGGCG GCTTTATGAT GTTTATGCTG CGTAAGATCC CGAAACGCAC CGATTCGGTG AAATTCGCCG GCTACAAATT TGAAGTGGTG GATATCGATA ACTACCGTAT CGACCAACTG CTGGTGACCC GGATCGACAG CAAGGCCACC GCCCTTTCGC CAAAACTGCC TGACGCTAAA GATAAAGAAG AAAGCGTCGC GTAA
|
Protein sequence | MLNSILVILC LIAVSAFFSM SEISLAASRK IKLKLLADEG NINAQRVLNM QENPGMFFTV VQIGLNAVAI LGGIVGDAAF SPAFHSLFSR YMSAELSEQL SFILSFSLVT GMFILFADLT PKRIGMIAPE AVALRIINPM RFCLYVCTPL VWFFNGLANI IFRIFKLPMV RKDDITSDDI YAVVEAGALA GVLRKQEHEL IENVFELESR TVPSSMTPRE NVIWFDLHED EQSLKNKVAE HPHSKFLVCN EDIDHIIGYV DSKDLLNRVL ANQSLALNSG VQIRNTLIVP DTLTLSEALE SFKTAGEDFA VIMNEYALVV GIITLNDVMT TLMGDLVGQG LEEQIVARDE NSWLIDGGTP IDDVMRVLDI DEFPQSGNYE TIGGFMMFML RKIPKRTDSV KFAGYKFEVV DIDNYRIDQL LVTRIDSKAT ALSPKLPDAK DKEESVA
|
| |