Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2935 |
Symbol | |
ID | 6143346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3009559 |
End bp | 3010923 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617804 |
Product | hypothetical protein |
Protein accession | YP_001744959 |
Protein GI | 170680380 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.153502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATTACAC ATATTAGCCC GCTTGGCTCC ATGGATATGT TGTCGCAGCT GGAAGTGGAT ATGCTTAAAC GCACCGCCAG CAGCGACCTC TATCAACTGT TTCGCAACTG TTCACTTGCC GTACTGAACT CCGGTAGTTT GACCGATAAC AGCAAAGAAT TGCTGTCTCG TTTTGAAAAT TTCGATATTA ACGTCCTGCG CCGTGAACGC GGCGTAAAGC TGGAACTGAT TAATCCCCCG GAAGAGGCTT TTGTCGATGG GCGAATTATT CGCGCTTTGC AGGCCAACTT GTTCGCGGTT CTGCGCGACA TTCTCTTCGT TTACGGGCAA ATCCATAACA CCGTTCGTTT TCCCAACCTG AATCTCGACA ACTCCGTCCA CATCACTAAC CTGGTCTTTT CCATCTTGCG TAACGCTCGC GCGCTGCATG TTGGTGAAGC GCCAAATATG GTGGTCTGCT GGGGCGGGCA CTCAATTAAC GAAAACGAGT ATTTGTATGC CCGTCGCGTC GGGAATCAGC TGGGCCTGCG TGAGCTGAAT ATCTGCACCG GCTGTGGTCC GGGAGCGATG GAAGCGCCAA TGAAAGGTGC TGCGGTCGGA CACGCACAGC AGCGTTACAA AGACAGTCGT TTTATTGGTA TGACAGAGCC ATCGATTATC GCCGCTGAAC CGCCTAACCC GCTGGTCAAC GAATTGATCA TCATGCCGGA TATCGAAAAA CGTCTGGAAG CGTTTGTCCG TATCGCTCAC GGTATCATTA TCTTCCCTGG CGGTGTGGGT ACGGCAGAAG AGTTGCTGTA TTTGCTGGGA ATTTTAATGA ACCCGGCCAA CAAAGATCAG GTTTTACCAT TGCTCCTCAC CGGCCCGAAA GAGAGCGCCG ACTACTTCCG CGTACTGGAC GAGTTTGTCG TGCATACGTT GGGTGAAAAC GCGCGCCGCC ATTACCGCAT CATCATTGAT GACGCCGCTG AAGTCGCTCG TCAGATGAAA AAATCGATGC CGCTGGTGAA AGAAAATCGC CGCGACACAG GCGATGCCTA CAGCTTTAAC TGGTCAATGC GCATTGCGCC AGATTTGCAA ATGCCGTTTG AGCCGTCTCA CGAGAATATG GCTAATCTGA AGCTTTACCC GGATCAACCC GTTGAAGTGC TGGCTGCCGA CCTGCGCCGT GCGTTCTCCG GTATTGTGGC GGGTAACGTA AAAGAAGTCG GTATTCGCGC CATTGAAGAG TTTGGTCCTT ATAAAATCAA CGGCGATAAA GAGATTATGC GTCGTATGGA TGACCTGCTA CAGGGTTTTG TTGCCCAGCA TCGTATGAAG TTGCCAGGCT CAGCCTACAT CCCTTGCTAC GAAATCTGCA CGTAA
|
Protein sequence | MITHISPLGS MDMLSQLEVD MLKRTASSDL YQLFRNCSLA VLNSGSLTDN SKELLSRFEN FDINVLRRER GVKLELINPP EEAFVDGRII RALQANLFAV LRDILFVYGQ IHNTVRFPNL NLDNSVHITN LVFSILRNAR ALHVGEAPNM VVCWGGHSIN ENEYLYARRV GNQLGLRELN ICTGCGPGAM EAPMKGAAVG HAQQRYKDSR FIGMTEPSII AAEPPNPLVN ELIIMPDIEK RLEAFVRIAH GIIIFPGGVG TAEELLYLLG ILMNPANKDQ VLPLLLTGPK ESADYFRVLD EFVVHTLGEN ARRHYRIIID DAAEVARQMK KSMPLVKENR RDTGDAYSFN WSMRIAPDLQ MPFEPSHENM ANLKLYPDQP VEVLAADLRR AFSGIVAGNV KEVGIRAIEE FGPYKINGDK EIMRRMDDLL QGFVAQHRMK LPGSAYIPCY EICT
|
| |