Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4500 |
Symbol | |
ID | 6143773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4598294 |
End bp | 4599832 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641619316 |
Product | hypothetical protein |
Protein accession | YP_001746428 |
Protein GI | 170680249 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATG ATGTGGTGCA AGGAAATAAT AAATTAGATC TTGATTTACT ACGTAATTTT AATGGGGTGC CAGGTTTAAA TAGAGATAAC TTTATTTATA TCAGTGATAT ATTTTTAAAT ATAAAACAAC GGAACGAAAA AAATCATGCA ATAAATATGT TTCGTGAAGT CTCAATCAGT AATGATATTA TAAGCGTAAA ATTTTATAGA AATGAAGAAA TAGAATGTGC TTGTGATTTT CTGATGGATA AAGATGCGCA GGGGTATACT GACCTGTCTG ATTTGGATTT AACAAGTTGT CATTTTAAAG GTGACGTTAT TTCGAAGGTG TCTTTCATAT CATCAAATCT ACAACATGTA ACATTCGAAT GTAAAGAAAT AGGGGATTGC AATTTTACTA CTGCAACAGT TGATAATGTC ATATTTAAAT GTCGACGTTT ACACAATGTG ATTTTTATCA AAGCGACTGG CGAATATGTC GATTTTAGCC AAAGTATTCT TGATACAGTT GACTTCTCGC GGAGCCAGCT TACTCATAGT AATTTTCGTG AATGTCAGAT TAGAAATTCA AAGTTCAATA ATTGTTATCT TTATGCTTCG CACTTCACCA GAGCAGAATT TCTTTCTACC AAAGAAATAT CATTTATTAA ATCGAATCTG ACAGCTGTTA TGTTTGATCA TGTGCGAATG TCGACAGGGA ATTTTAAAGA TTGCATTACA GAACGATTGG AATTAACTAT TGATTATTCA GATATATTTG GGAATGAAGA ACTTGATGGT TATATCAATA ACATTATAAA AATGATTGAT ACATTGCCAG ATAATGCAAT GATATTGAAA TCCGTTCTGG CAGTAAAACT GGTGATGCAA TTAAAAATTC TTAATATTGT TAATAAAAAC TTTATTGAGA ATATGAAGAA AATATTTAGC CATTGTCCTT ATATAAAAGA TCCAATTATA CGTAGTTATA TCCATCCTGA TGAAGATAAC AAGTTCGATA ATTTTATGCG TCAAAATCGA TTCAGTAAGA TGCATTTCGA TACCCAACAG ATGATCGATT TTATTAACAG ATTTAATATG AATAAATGGC TGATTGATCA AAATAACAAT TTTTTTATCC AACTTATCGA TCAGGCTCTA CGATCAACGG ATGATACGAT CAAAGCAAAT GCCTGGCATC TTTATAAAGA GTGGATTCGT AGTGATGATG TTTCACCTTT ATTTATAGAA ATTGAAGATA ATTTAAGAAC CTTTAACACG AATGAATTAA CACGAAATGA TAATATCTTT ATCCTGTTCT CCTCTGTCGA TGATGGGCCA GTTATGGTGG TAAGCTCCCA GCGCTTACAT GATATGTTGA ATCCTACAAA AGATACCAAT TGGAATTCCA CGTATATCTA CAAATCCAGA CATGAGATGT TGCCTGTTAA TCTTACTCCG GAAACACTTT TCGGCTCCAA ATCTCAGGAT AAACATGCGC TTTTCCCCAT TTTTACTGCG AGTTGGCGAG CTAATCGTAT AAAGAATAAA GGTATTTAA
|
Protein sequence | MKHDVVQGNN KLDLDLLRNF NGVPGLNRDN FIYISDIFLN IKQRNEKNHA INMFREVSIS NDIISVKFYR NEEIECACDF LMDKDAQGYT DLSDLDLTSC HFKGDVISKV SFISSNLQHV TFECKEIGDC NFTTATVDNV IFKCRRLHNV IFIKATGEYV DFSQSILDTV DFSRSQLTHS NFRECQIRNS KFNNCYLYAS HFTRAEFLST KEISFIKSNL TAVMFDHVRM STGNFKDCIT ERLELTIDYS DIFGNEELDG YINNIIKMID TLPDNAMILK SVLAVKLVMQ LKILNIVNKN FIENMKKIFS HCPYIKDPII RSYIHPDEDN KFDNFMRQNR FSKMHFDTQQ MIDFINRFNM NKWLIDQNNN FFIQLIDQAL RSTDDTIKAN AWHLYKEWIR SDDVSPLFIE IEDNLRTFNT NELTRNDNIF ILFSSVDDGP VMVVSSQRLH DMLNPTKDTN WNSTYIYKSR HEMLPVNLTP ETLFGSKSQD KHALFPIFTA SWRANRIKNK GI
|
| |