Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3513 |
Symbol | |
ID | 6144783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3591493 |
End bp | 3593742 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641618342 |
Product | hypothetical protein |
Protein accession | YP_001745489 |
Protein GI | 170681434 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAC AAACCGTTCA TTTGTTTGGC ATCCGTCACC ATGGCCCTGG CTGCGCGCGC AGCCTGCGAA AGGCGCTGGA GACGCTACAG CCCGACTGCC TGCTGGTGGA AGGGCCGCCC GACGGCGAAT CCATCCTGCC TTTTATGCAG CATGAACAGA TGCAACCGCC GGTCGCGCTA CTGATTTATG CCCCGGACGA CTCACATCAC GCCGCGTTTT ACCCCTTTGC GGCGTTCTCC CCGGAATGGC AGGCGCTGCG CTATGGTTTT GAGCAAAATA TCGCCGTGCG CTTTATGGAC TTGCCCATCA GTCATCAGTT CGCGCTTGAT GACGAAAAAG AGAGCGATGA CGATGATGCC GTTGAAGCCA GTCCCGACGG CGATCCGCTG GACTGGCTGG GCCGCGCCGC CGGTTACACC GATGGGGAAA GCTGGTGGAG CCACAGAGTA GAGGAACGCG AAGATGATTT GTCTCTGTTC GAGGCGATTC GCGAGGCGAT GATTGCCCTA CGCCAGGCAA CTCCCGAAGC CCGAAACTCA GCCAGAGATC AACTGCGTGA AGCGTATATG CGCAAGACGC TACGTCAGGC GAAAAAAGAG GGCTTCTCGC GCATCGCGGT AGTCTGCGGT GCGTGGCACG TTCCGGCGCT GGAAAACCTG CCGCCCGCAA AGAATGATAA CGAACTGCTG AAAAATTTGC CGAAGTGCAA GGTCGCGGCA GCCTGGACGC CCTGGAGCTA CGAGGCGTTA AGCCGCGCCA GCGGTTATGG CGCAGGGGTG GTGTCACCGG AATGGTACGA TCATCTGTGG CGTTATCAGG GCGCTTCTCA CCGCGATATC GGCTGGCTGT CACGCGCGGC AAGGCTGTTT CGCGAAGCCG ATCTCGACTG TTCCAGCGCC CACATTATCG AAGCGGCGCG GCTGGCGCAG ACGCTGGCGA TTATGCGCCA TCACCCGCAG CCGGGGCTTG ACGAACTTTG CGAAGCGCTG CAAACCGTAG TGTGTATGGG CGAAAGCGCG CCAATGCAGA TGATTCGCCA GAAGTTAATT GTCGGCGACG CGCTGGGAAG CGTACCGGAT GATACGCCCG TCGTGCCGCT CCAACGCGAC ATTACGCAAC AGCAAAAAAC GCTGCGCCTG AAGGCAGAAG CCAGCGAAAA AGTGCTGGAT CTTGATCTGC GCAAACCCGG CGATCTGGCG CGTAGCCATT TACTGCATCG CCTGACCTTA CTCGACATTT CCTGGGGCCG TCTGGCTGGA CAAGGAAATA ACAGCAAAGG GACGTTCCAC GAAGTCTGGT CGCTGCGCTG GGAACCGGCA CTGGCAATCA ACATCATTAC CGCCAGCCGT TGGGGTAACA GCATTGAGCA GGCCAGCTCC CGCTATGCCA TTGTGAGGGC GCAATACGCC AGCACGCTGC CGGAACTGGC AAAGCTTATC CAACAGGTGC TACTGGCCGA TTTACAGACC GCGATTGCCC CCATCGCCAA TACACTGGAA TCACTGGCTG CCACTCAGGG CAATATCGAA CAGTTGCTGG AAGCCCTGGC ACCACTGGTG GCGATTGTCC GCTACGGCAA CGTGCGCCAG ACCGACTCTG GTATGGTAAT GCAGGTACTG ATGAGCCTCG CCCCGCGCGC CGCCATCGCC CTACCCGGAG CCTGTTCGGC GCTCAACGAC GACAGCGCCG CCAGTATGAG AGAAAAAGTT ATCGACGCCC ACGCCGCGCT GCGGTTGCTG GATAACGAAG ATCTATTGGC GGGCTGGTTG CAGGCGTTGA TGGTGCTGGC AGAGGGCAGT ACCGCACACG CGCTGCTTCG CGGAACGGCG ACGCGCCTGT TGTTTGATCT GCAAACGTTA ACGACAGAAC GGATCAGCAC GCTGATGAAC CTGGCGCTAT CGCCAGCCAA TCCCCCGGCG GAAAGTGCCG CATGGGCGGA AGGTTTCCTC AACGACAGCG CGATGGTGCT ACTGCACAAT AGCGAATTAT GGCAGTTGAT CGATGCCTGG CTAAGCGGCC TGAACGACAA TCACTTCACC CGGATCCTGC CGATGCTGCG GCGGACATTC GGCCGTTTTT CCAGCCCGGA ACGCCGCCAG TTAGGCGAAC GCGCCGCGCA GGGCGAGCGT GTCGCGCAGC AAGAAGAGAC CAGCGGTATA TGGGATGAAC AGCGTGCCGC GTTAATGCTG CCGCTACTGC GCCGCATTCT GGCTTTGCCA CCACACGAAA GGGAAAATCA TGTCGAGTAA
|
Protein sequence | MTEQTVHLFG IRHHGPGCAR SLRKALETLQ PDCLLVEGPP DGESILPFMQ HEQMQPPVAL LIYAPDDSHH AAFYPFAAFS PEWQALRYGF EQNIAVRFMD LPISHQFALD DEKESDDDDA VEASPDGDPL DWLGRAAGYT DGESWWSHRV EEREDDLSLF EAIREAMIAL RQATPEARNS ARDQLREAYM RKTLRQAKKE GFSRIAVVCG AWHVPALENL PPAKNDNELL KNLPKCKVAA AWTPWSYEAL SRASGYGAGV VSPEWYDHLW RYQGASHRDI GWLSRAARLF READLDCSSA HIIEAARLAQ TLAIMRHHPQ PGLDELCEAL QTVVCMGESA PMQMIRQKLI VGDALGSVPD DTPVVPLQRD ITQQQKTLRL KAEASEKVLD LDLRKPGDLA RSHLLHRLTL LDISWGRLAG QGNNSKGTFH EVWSLRWEPA LAINIITASR WGNSIEQASS RYAIVRAQYA STLPELAKLI QQVLLADLQT AIAPIANTLE SLAATQGNIE QLLEALAPLV AIVRYGNVRQ TDSGMVMQVL MSLAPRAAIA LPGACSALND DSAASMREKV IDAHAALRLL DNEDLLAGWL QALMVLAEGS TAHALLRGTA TRLLFDLQTL TTERISTLMN LALSPANPPA ESAAWAEGFL NDSAMVLLHN SELWQLIDAW LSGLNDNHFT RILPMLRRTF GRFSSPERRQ LGERAAQGER VAQQEETSGI WDEQRAALML PLLRRILALP PHERENHVE
|
| |