Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2075 |
Symbol | |
ID | 6144621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2089438 |
End bp | 2090490 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641616951 |
Product | hypothetical protein |
Protein accession | YP_001744127 |
Protein GI | 170680794 |
COG category | [R] General function prediction only |
COG ID | [COG1054] Predicted sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.602347 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.562441 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGTGT TACACAACCG CATTTCCAAC GACGCGTTAA AAGCCAAAAT GTTGGCTGAG AGCGAACCGC GAACCACCAT TTCGTTTTAT AAGTATTTCC ACATCGCCGA TCCTAAGGCG ACCCGTGACG CTTTATATCA GCTGTTTACC GCGCTGAATG TTTTTGGGCG AGTGTATCTG GCGCATGAGG GCATTAACGC GCAAATCAGC GTACCTGCGA GCAATGTTGA AACATTTCGC GCGCAGCTTT ATGCCTTCGA CCCGGCTTTA GAGGGCTTAC GCCTGAATAT CGCGTTGGAA GATGACGGGA AATCCTTCTG GGTACTGCGC ATGAAGGTCC GCGATCGTAT CGTTGCCGAC GGTATTGACG ATCCTCACTT TGATGCCAGC AATGTGGGTG AGTATCTGCA AGCGGCGGAA GTGAACGCCA TGCTTGACGA TCCCGATGCA TTGTTTATCG ACATGCGTAA CCACTATGAG TATGAAGTGG GGCACTTTGA AAACGCGCTG GAAATTCCGG CAGATACCTT CCGTGAGCAG CTGCCAAAAG CAGTTGAGAT GATGCAGGCA CATAAAGATA AAAAAATCGT CATGTACTGC ACCGGCGGCA TTCGTTGTGA AAAAGCCAGT GCCTGGATGA AACATAACGG ATTCAATAAA GTCTGGCATA TCGAGGGTGG AATTATTGAA TACGCCCGTA AGGCGCGCGA GCAGGGCTTG CCGGTGCGTT TTATTGGCAA AAATTTTGTT TTTGACGAGC GGATGGGCGA ACGTATATCT GATGAGATTA TCGCGCATTG CCACCAGTGC GGTGCGCCGT GCGACAGCCA TACCAACTGT AAAAATGATG GCTGCCATCT GCTTTTTATT CAGTGTCCAG TATGTGCGGA AAAATACAAA GGTTGTTGTA GTGAGATTTG CTGCGAAGAA AGCGCGTTAC CGCCAGAGGA ACAGCGACGC CGTCGGGCAG GACGTGAAAA TGGCAATAAG ATCTTTAATA AGTCTCGTGG ACGTCTGAAT ACAACACTGG GCATTCCTGA TCCAACAGAG TAA
|
Protein sequence | MPVLHNRISN DALKAKMLAE SEPRTTISFY KYFHIADPKA TRDALYQLFT ALNVFGRVYL AHEGINAQIS VPASNVETFR AQLYAFDPAL EGLRLNIALE DDGKSFWVLR MKVRDRIVAD GIDDPHFDAS NVGEYLQAAE VNAMLDDPDA LFIDMRNHYE YEVGHFENAL EIPADTFREQ LPKAVEMMQA HKDKKIVMYC TGGIRCEKAS AWMKHNGFNK VWHIEGGIIE YARKAREQGL PVRFIGKNFV FDERMGERIS DEIIAHCHQC GAPCDSHTNC KNDGCHLLFI QCPVCAEKYK GCCSEICCEE SALPPEEQRR RRAGRENGNK IFNKSRGRLN TTLGIPDPTE
|
| |