Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2397 |
Symbol | |
ID | 6144112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2446224 |
End bp | 2447183 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641617270 |
Product | hypothetical protein |
Protein accession | YP_001744442 |
Protein GI | 170680731 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAT CAACAACCTC CTCCCCGCAT GATGCGGTAT TTAAAACCTT TATGTTCTCG CCCGATACGG CGCGAGATTT TCTGGAAATA CATCTACCAG AGCCGCTGCG CAAGCTTTGC AACCTGCAAA CCTTACGCCT GGAACCCACC AGCTTTATTG AAAAAAGTTT ACGCGCTTAC TACTCGGATG TTTTGTGGTC TGTGGAAACC AGCGAGGGTG ACGGTTATAT CTACTGCGTG ATTGAACATC AAAGCTCTGC AGAAAAGAAT ATGGCTTTTC GGCTAATGCG CTATGCCACT GCCGCCATGC AGCGTCACCT GGACAAAGGC TATGACAGAG TCCCGCTGGT GGTACCATTG CTGTTTTATC ATGGCGAAAC CTCGCCCTAC CCGTACTCAC TCAACTGGCT GGATGAGTTT GACGATCCGC AACTTGCCCG GCAGTTGTAC ACCGAAGCTT TTCCGTTGGT GGATATCACC ATCGTACCTG ACGATGAGAT CATGCAACAT CGGCGTATAG CTCTGCTGGA ACTGATTCAA AAGCATATTC GCGACCGCGA TTTAATCGGT ATGGTCGACA GGATCACCAC GCTTTTGGTT AGAGGCTTCA CTAATGACAG CCAGCTACAA ACGCTGTTTA ATTATCTGCT GCAATGCGGC GATACCTCCC GTTTCACCCG TTTTATTGAG GAGATTGCCG AACGTTCACC ACTACAAAAG GAGAGATTAA TGACTATTGC TGAACGGCTG CGGCAGGAAG GGCATCAAAT TGGCTGGCAG AAGGGTAAGA TTGAAGGTTG GCAGGAGGGG AAGTTGGAGG GTTTGCAGAA AGGCAAAGTA GAAGGTATGC ATGAACAAGC CATTAAAATT GCCTTGCGCA TGCTGGAACA GGGCTTTGAA CGTGAGATTG TGCTGGCGAC AACCCAACTC ACTGATGCTG ATATTCCGAA CTGTTATTAA
|
Protein sequence | MTESTTSSPH DAVFKTFMFS PDTARDFLEI HLPEPLRKLC NLQTLRLEPT SFIEKSLRAY YSDVLWSVET SEGDGYIYCV IEHQSSAEKN MAFRLMRYAT AAMQRHLDKG YDRVPLVVPL LFYHGETSPY PYSLNWLDEF DDPQLARQLY TEAFPLVDIT IVPDDEIMQH RRIALLELIQ KHIRDRDLIG MVDRITTLLV RGFTNDSQLQ TLFNYLLQCG DTSRFTRFIE EIAERSPLQK ERLMTIAERL RQEGHQIGWQ KGKIEGWQEG KLEGLQKGKV EGMHEQAIKI ALRMLEQGFE REIVLATTQL TDADIPNCY
|
| |