Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1920 |
Symbol | |
ID | 6146894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1942450 |
End bp | 1943844 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616796 |
Product | hypothetical protein |
Protein accession | YP_001743972 |
Protein GI | 170683203 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.10975 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCCGTT TCGTTCCTCG CATTATTCCG TTTTATTTAC TCTTGCTGGC GGCAGGCGGT ACAGCTAACG CACAATCTAC CTTCGAGCAA AAAGCGGCAA ATCCCTTTGA TAATAACAAT GATGGTCTGC CGGATTTAGG CATGGCACCT GAAAATCATG ATGGGGAAAA ACACTTTGCG GAAATTGTGA AAGATTTCGG CGAAACCAGT ATGAATGATA ACGGGCTGGA TACTGGCGAG CAGGCAAAAG CTTTCGCATT AGGAAAAGTC CGCGACGCGC TTAGTCAACA GGTTAATCAG CACGTAGAGT CCTGGCTATC ACCGTGGGGA AATGCCAGTG TTGACGTCAA AGTGGATAAC GAAGGACATT TCACCGGCAG TCGCGGAAGC TGGTTTGTGC CGTTACAAGA TAATGATCGT TATCTCACCT GGAGCCAGCT TGGTCTTACT CAGCAGGATG ATGGCTTGGT GAGCAATGTG GGCGTAGGGC AACGCTGGGC GCGCGGCAAC TGGCTGGTGG GTTATAACAC TTTTTATGAC AACTTGCTGG ACGAAAATCT TCAGCGAGCG GGCTTTGGTG CCGAAGCGTG GGGCGAATAT TTGCGACTAT CGGCAAACTT TTATCAGCCG TTTGCTGCAT GGCATGAACA GACAGCCACG CAGGAACAGC GGATGGCGCG CGGGTACGAC CTGACAGCCC GGATGCGCAT GCCGTTCTAT CAACACCTCA ATACCAGTGT CAGCGTAGAA CAGTATTTTG GTGATCGTGT CGATTTGTTT AACTCTGGTA CGGGTTATCA CAATCCCGTC GCGTTGAGTC TGGGATTAAA TTACACCCCT GTGCCCTTAG TCACTGTGAC GGCCCGGCAT AAACAGGGTG AAAGTGGCGA GAATCAAAAT AACCTCGGGC TGAATCTTAA CTACCGCTTT GGTGTACCGC TCAAAAAACA ACTTTCTGCG GGCGAAGTTG CCGAAAGTCA GTCGTTACGT GGTAGTCGCT ATGACAATCC GCAGCGAAAT AATCTTCCGA CTCTTGAGTA CCGACAGCGA AAAACCTTAA CGGTGTTTCT GGCGACACCG CCGTGGGATC TAAAACCTGG CGAAACAGTG CCGCTGAAAT TACAAATCCG CAGTCGTTAC GGTATTCGGC AACTGATTTG GCAGGGCGAT ACGCAGATAT TAAGTTTGAC GCCGGGCGCA CAAGCCAACA GTGAGGAGGG CTGGACGCTG ATCATGCCTG ACTGGCAAAA CGGGGAAGGC GCAAGCAATC ACTGGCGATT GTCAGTAGTG GTGGAAGATA ACCAGGGGCA GCGTGTCTCC TCCAATGAGA TCACGCTAAC GCTTGTCGAA CCGTTCGACG CATTGTCAAA CGACGAACTG CGCTGGGAAC CGTAA
|
Protein sequence | MSRFVPRIIP FYLLLLAAGG TANAQSTFEQ KAANPFDNNN DGLPDLGMAP ENHDGEKHFA EIVKDFGETS MNDNGLDTGE QAKAFALGKV RDALSQQVNQ HVESWLSPWG NASVDVKVDN EGHFTGSRGS WFVPLQDNDR YLTWSQLGLT QQDDGLVSNV GVGQRWARGN WLVGYNTFYD NLLDENLQRA GFGAEAWGEY LRLSANFYQP FAAWHEQTAT QEQRMARGYD LTARMRMPFY QHLNTSVSVE QYFGDRVDLF NSGTGYHNPV ALSLGLNYTP VPLVTVTARH KQGESGENQN NLGLNLNYRF GVPLKKQLSA GEVAESQSLR GSRYDNPQRN NLPTLEYRQR KTLTVFLATP PWDLKPGETV PLKLQIRSRY GIRQLIWQGD TQILSLTPGA QANSEEGWTL IMPDWQNGEG ASNHWRLSVV VEDNQGQRVS SNEITLTLVE PFDALSNDEL RWEP
|
| |