Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1770 |
Symbol | |
ID | 6143987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1783840 |
End bp | 1786479 |
Gene Length | 2640 bp |
Protein Length | 879 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616646 |
Product | hypothetical protein |
Protein accession | YP_001743824 |
Protein GI | 170681090 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.930224 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGGTA AATATAAAGC CGTTCTCGCG CTGTTATTAC TGATTATTCT TGTGCCGTTG ACGCTGCTGA TGACGCTCGG GCTGTGGGTT CCTACGCTGG CGGGCATCTG GCTACCGCTC GGGACACGTA TTGCATTAGA TGAAAGCCCA CGCATTACGC GTAAAGGTTT AATCATTCCC GATCTCCGTT ATCTGGTGGG AGATTGTCAG CTTGCGCATA TCACCAACGC CAGCCTTTCA CATCCCAGCC GCTGGTTATT GAACGTCGGC ACGGTAGAAC TTGATTCTGC TTGTCTGGCG AAATTGCCGC AGACGGAGCA ATCGCCAGCC GCTCCAAAAA CCCTCGCGCA GTGGCAGGCC ATGTTGCCTA ACACCTGGAT CAATATCGAT AAACTGATTT TTTCTCCCTG GCAGGAATGG CAGGGAAAAC TCTCTCTCGC ATTAACCTCC GATATCCAGC AACTGCGTTA TCAGGGCGAA AAAGTTAAAT TTCAAGGCCA GCTGAAAGGG CAACAACTTA CAGTCAGCGA ACTGGATGTC GTCGCGTTTG AAAATCAGCC GCCGGTTAAA CTGGTGGGGG AATTTACTAT GCCGCTCGTG CCGGATGGAC TTCCGGTAAG TGGGCATGCT ACTGCGACGT TAAACTTGCC GCAGGAACCG TCACTGGTGG ATGCAGAGCT GGACTGGCAG GAAAATAGCG GGCAATTGAT CGTGCTGGCA CGGGATAACG GCGATCCGTT GCTCGATTTG CCGTGGCAAA TTACTCGTCA ACAATTGACC GTAAGCGATG GTCGCTGGAG CTGGCCGTAT GCAGGTTTTC CTTTGAGTGG CCGACTGGGT GTCAAAGTCG ACAACTGGCA GGCAGGGCTC GAGAACGCTC TGGTCAGCGG ACGACTGAGT GTGCTGACCC AGGGGCAAGC GGGTAAGGGC AACGCGGTGC TTAATTTTGG CCCAGGAAAA TTAAGCATGG ATAACAGCCA GCTGCCTATG CAACTGACCG GTGAAGCGAA ACAGGCGGAC CTCATTTTAT ATGCCCGTTT ACCTGCGCAG CTAAGTGGAA GTCTGTCTGA CCCAACGTTG ACCTTTGAGC CAGGCGCGTT ACTTCGTTCA AAGGGAAGAG TCATTGATTC GCTGGACATC GATGAAATCC GCTGGCCTTT AGCGGGTGTA AAAGTCACCC AACGTGGTAT TGACGGACGT TTGCAGGCCA TTTTGCAGGC GCATGAAAAT GAACTGGGCG ATTTCGTGCT GCATATGGAT GGACTGGCGA ATGATTTTCT CCCTGACGCT GGCCGCTGGC AGTGGCGCTA CTGGGGAAAA GGCAGTTTTA CACCGATGAA TGCCACCTGG GATGTCGCAG GAAAGGGTGA GTGGCATGAC AGCACGATTA CGCTGACCGA TCTCTCCACC GGTTTCGACC AGTTACAATA CGGTACGATG ACGGTAGAAA AGCCGCGACT AATTCTCGAC AAGCCCGTCG TCTGGGTACG TGACGCACAG CATCCCTCCT TTAGCGGCGC GCTGTCACTG GACGCCGGGC AAACGCTGTT CACTGGCGGC AGTGTGTTAC CGCCATCGAC CTTAAAATTT AGCGTCGATG GACGCGATCC GACGTATTTC CTCTTTAAAG GCGATTTACA TGCTGGTGAG ATTGGCCCGG TTCGGGTAAA TGGTCGCTGG GACGGTATTC GTCTGCGCGG TAACGCCTGG TGGCCTAAAC AATCACTGAC CGTATTCCAG CCGCTGGTGC CACCCGACTG GAAGATGAAC TTACGCGATG GCGAACTATA TGCCCAGGTC GCATTTTCTG CTGCGCCTGA ACAAGGATTC CGCGCGGGAG GGCACGGCGT GTTGAAAGGC GGTAGTGCCT GGATGCCAGA TAATCAGGTT AACGGTGTCG ATTTTGTCCT GCCTTTCCGT TTTGCCGATG GAGCCTGGCA TCTGGGGACT CGCGGCCCCG TTACGTTGCG AATTGCCGAA GTGATTAATC TGGTGACAGC GAAAAATATT ACGGCTGATT TGCAAGGGCG TTATCCGTGG AGTGAAGACG AACCATTGTT GTTGACCGAT GTTAGCGTCG ATGTGTTAGG CGGTAACGTG CTGATGAAAC AATTACGTAT GCCGCAACAT GACCCGGCGC TGTTGCGGCT GAATAATCTT TCCTCCAGCG AACTGGTTAG CGCCGTCAAT CCGAAACAAT TCGCCATGTC CGGGGCATTT AGTGGTGCAT TGCCGTTATG GCTAAACAAC GAAAAATGGA TAGTGAAAGA TGGTTGGCTG GCAAATAGCG GGCCAATGAC CTTGCGGCTG GATAAAGACA CCGCAGATGC GGTGGTGAAA GACAATATGA CTGCGGGTTC AGCAATTAAC TGGTTGCGCT ATATGGAAAT TAGCCGTTCA TCGACAAAAA TTAATTTAGA TAATCTCGGT TTATTAACCA TGCAGGCCAA CATTACAGGT ACCAGTCGCG TTGATGGTAA AAGCGGTACG GTAAATCTTA ATTACCATCA TGAAGAGAAT ATTTTTACGC TGTGGCGCAG TTTACGCTTT GGCGATAATC TCCAGGCATG GTTGGAGCAG AACGCACGTC TGCCGGGAAA TGACTGTCCG CAAGGAAAAG AGTGTGAGGA AAAACAATGA
|
Protein sequence | MLGKYKAVLA LLLLIILVPL TLLMTLGLWV PTLAGIWLPL GTRIALDESP RITRKGLIIP DLRYLVGDCQ LAHITNASLS HPSRWLLNVG TVELDSACLA KLPQTEQSPA APKTLAQWQA MLPNTWINID KLIFSPWQEW QGKLSLALTS DIQQLRYQGE KVKFQGQLKG QQLTVSELDV VAFENQPPVK LVGEFTMPLV PDGLPVSGHA TATLNLPQEP SLVDAELDWQ ENSGQLIVLA RDNGDPLLDL PWQITRQQLT VSDGRWSWPY AGFPLSGRLG VKVDNWQAGL ENALVSGRLS VLTQGQAGKG NAVLNFGPGK LSMDNSQLPM QLTGEAKQAD LILYARLPAQ LSGSLSDPTL TFEPGALLRS KGRVIDSLDI DEIRWPLAGV KVTQRGIDGR LQAILQAHEN ELGDFVLHMD GLANDFLPDA GRWQWRYWGK GSFTPMNATW DVAGKGEWHD STITLTDLST GFDQLQYGTM TVEKPRLILD KPVVWVRDAQ HPSFSGALSL DAGQTLFTGG SVLPPSTLKF SVDGRDPTYF LFKGDLHAGE IGPVRVNGRW DGIRLRGNAW WPKQSLTVFQ PLVPPDWKMN LRDGELYAQV AFSAAPEQGF RAGGHGVLKG GSAWMPDNQV NGVDFVLPFR FADGAWHLGT RGPVTLRIAE VINLVTAKNI TADLQGRYPW SEDEPLLLTD VSVDVLGGNV LMKQLRMPQH DPALLRLNNL SSSELVSAVN PKQFAMSGAF SGALPLWLNN EKWIVKDGWL ANSGPMTLRL DKDTADAVVK DNMTAGSAIN WLRYMEISRS STKINLDNLG LLTMQANITG TSRVDGKSGT VNLNYHHEEN IFTLWRSLRF GDNLQAWLEQ NARLPGNDCP QGKECEEKQ
|
| |