Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4765 |
Symbol | |
ID | 6145916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4865478 |
End bp | 4866512 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641619578 |
Product | hypothetical protein |
Protein accession | YP_001746685 |
Protein GI | 170680748 |
COG category | [R] General function prediction only |
COG ID | [COG3943] Virulence protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATA ACATCCCACA AGCTCCTCAA GGTGAGTTTG TTCTTTTCAC CAGTGCTGAT GGTCAAACCC GCGTTGAGTG TCGTTTTGAG TCTGACACCT TGTGGCTTTC TCAGGCTGCA ATGGCAGAGC TTTATCAAGT TAGCTCTCAA GCAATCACCC AACACGTAAA AGCTGTCTAT TCTGAAGAAG AGCTTGAGCA AATTTCAACT TGTAAGGATT ACTTACAAGT TCAACTGGAA GGTGGCAGAG AAGTAAAGCG AAGCATTCGT CACTACAGTC TTCCGGTTAT TCTTGCAGTA GGATACCGAG TTCGCTCAAC ACGTGGCACC CAGTTCCGCC AATGGGCAAC CAGAATGTTG CAAGAATACC TAATCAAAGG ATTCGTCATG GATGACGAGC GCCTGAAAAA TCCGCCCATT GGCCATTCTG CTGTGCCAGA TTACTTTGAT GAAATGCTGG AACGCATTCG TGATATCCGA GCTAGTGAGC GTCGCGTTTA TCTGCGAGTC AAAGAAATCT TTACCATGGC TGCTGATTAC GAGCCATCTA ACCAAGAGAC CAACCGCTTC TTCCAAACCA TTCAGAATAA GCTGCATTAT GCTTGTACGC ATATGACGGC TGCTGAGCTT ATCGCCAGCC GTGTGGATGC CAGTAAGCCG GATATGGGCT TAACCAGCTA TAAAGGCGAT GAAGTGCGCA AGACAGACGT TACTGTTGCC AAGAACTATT TGCGTGAAGA CGAAATCAAA GAGCTTAATC GCATCGTCAA TATGTGGCTC GACTTTGCTG AAGACCAAGC ACTGCGTCGC AAGCAGGTAT TTTTACAGGA CTGGGCCGAT AAGCTTGACC AGTTCTTGAG TTTTAACGAT AGAGATGTAT TAAACGGTGC AGGAAAAATC TCCAAGAAAG ACGCTGATGA CAAAGCGAAA TTGGAGTTTG ACCGCTTTGC CCAGCAGCGC CGTCGTTTAA AAGAAGCCGA AGGCGCACGG GCCAATATCG CAGCCCTCAA GGCCATACTA AAAAAAGATA AATAG
|
Protein sequence | MADNIPQAPQ GEFVLFTSAD GQTRVECRFE SDTLWLSQAA MAELYQVSSQ AITQHVKAVY SEEELEQIST CKDYLQVQLE GGREVKRSIR HYSLPVILAV GYRVRSTRGT QFRQWATRML QEYLIKGFVM DDERLKNPPI GHSAVPDYFD EMLERIRDIR ASERRVYLRV KEIFTMAADY EPSNQETNRF FQTIQNKLHY ACTHMTAAEL IASRVDASKP DMGLTSYKGD EVRKTDVTVA KNYLREDEIK ELNRIVNMWL DFAEDQALRR KQVFLQDWAD KLDQFLSFND RDVLNGAGKI SKKDADDKAK LEFDRFAQQR RRLKEAEGAR ANIAALKAIL KKDK
|
| |