Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4928 |
Symbol | |
ID | 6143877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 5043494 |
End bp | 5045044 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619731 |
Product | hypothetical protein |
Protein accession | YP_001746835 |
Protein GI | 170681370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACTT CTCATGAAAA TGCACTGCAA CAACGTTGCC AGCAAATTGT CACCAGCCCG GTACTTAGCC CGGAGCAGAA GCGCCATTTT CTGGCACTGG AAGCAGAAAA CAATCTGCCT TACCCCGCGT TGCCTGCCGA AGCCCGCCGC GCGCTGGATG AGGGTGTAAT CTGCGATATG TTTGAAGGTC ATGCGCCGTA CAAACCGCGC TATGTCTTAC CCGATTACGC CCGTTTTCTG GCGAACGGTT CCGAATGGCT GGAGCTGGAA GGCGCGAAAG ATCTTGATGA CGCACTCTCT CTGCTGACCA TTCTTTACCA CCACGTACCG TCGGTCACAT CGATGCCGGT CTACCTGGGG CAACTGGATG CGTTGTTGCA ACCGTATGTT AGAATTCTAA CACAAGACGA GATCGATATT CGAATAAAAC GTTTCTGGCG TTACCTCGAC AGAACCCTGC CAGACGCCTT TATGCACGCC AATATCGGCC CGTCTGATTC GCCCATCACC CGTGCAATCT TACGTGCAGA TGCAGAGCTG AAGCAGGTTT CACCAAACCT GACCTTTATC TACGATCCTG AAATCACCCC TGATGACCTG CTGCTGGAAG TGGCGAAGAA CATCTGTGAA TGTAGCAAAC CGCACATCGC CAACGGTCCG GTGCATGATA AAATTTTCAC AAAAGGGGGC TACGGGATTG TGAGCTGTTA CAACTCACTG CCGCTGGCGG GTGGTGGCAG CACGCTGGTA CGCCTAAACC TGAAAGCCAT TGCCGAGCGC AGTGAATCGC TGGAGGACTT CTTTACGCGC ACTCTACCGC ACTACTGCCA ACAGCAGATC GCCATCATCG ATGCGCGGTG TGAATTCCTC TATCAACAAT CACACTTCTT TGAGAATAGC TTCCTGGTGA AAGAAGGGCT GATTAACCCT GAACGTTTTG TGCCAATGTT TGGCATGTAC GGACTGGCGG AAGCAGTTAA CTTACTGTGT GAGAAAGAAG GAATTGTCGC ACGTTACGGT AAAGAAGCCA CCGCAAATGA AGTGGGTTAT CGCATCAGCG CGCAACTGGC GGAGTTTGTC GCCAATACCC CTGTGAAATA TGGCTGGCAA AAACGCGCCA TGTTACACGC ACAGTCGGGG ATCAGTTCCG ATATCGGCAC CACGCCGGGC GCGCGTTTAC CATATGGCGA TGAGCCAGAT CCGATTACCC ATCTGCAAAC TGTCGCGCCG CATCATGCTT ATTATTATTC CGGCATCAGC GACATTCTGA CGCTCGACGA AACCATCAAA CGTAATCCGC AGGCGCTGGT ACAGCTTTGC CTCGGTGCCT TTAAAGCCGG AATGCGTGAA TTTACCGCCA ATGTCAGCGG TAACGATCTG GTTCGCGTTA CCGGTTATAT GGTGCGTTTG TCGGATTTAG AAAAATATCG CGCCGAAGGT TCACGCACCA ACACCACCTG GCTGGGAGAA GAAGCCGCAC GCAACACTCG TATTCTGGAA CGTCAGCCGC GCGTGATAAG CCATGAACAG CAGATGCGCT TTAGTCAGTA A
|
Protein sequence | MPTSHENALQ QRCQQIVTSP VLSPEQKRHF LALEAENNLP YPALPAEARR ALDEGVICDM FEGHAPYKPR YVLPDYARFL ANGSEWLELE GAKDLDDALS LLTILYHHVP SVTSMPVYLG QLDALLQPYV RILTQDEIDI RIKRFWRYLD RTLPDAFMHA NIGPSDSPIT RAILRADAEL KQVSPNLTFI YDPEITPDDL LLEVAKNICE CSKPHIANGP VHDKIFTKGG YGIVSCYNSL PLAGGGSTLV RLNLKAIAER SESLEDFFTR TLPHYCQQQI AIIDARCEFL YQQSHFFENS FLVKEGLINP ERFVPMFGMY GLAEAVNLLC EKEGIVARYG KEATANEVGY RISAQLAEFV ANTPVKYGWQ KRAMLHAQSG ISSDIGTTPG ARLPYGDEPD PITHLQTVAP HHAYYYSGIS DILTLDETIK RNPQALVQLC LGAFKAGMRE FTANVSGNDL VRVTGYMVRL SDLEKYRAEG SRTNTTWLGE EAARNTRILE RQPRVISHEQ QMRFSQ
|
| |