Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3330 |
Symbol | |
ID | 5586991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3345110 |
End bp | 3346396 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640926964 |
Product | hypothetical protein |
Protein accession | YP_001464335 |
Protein GI | 157155806 |
COG category | [V] Defense mechanisms |
COG ID | [COG4268] McrBC 5-methylcytosine restriction system component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAG TTATCTCAAT TTTTGAATAT GACCTGTTGG GGAGTGGAAA AGCGGCTTCA GTTGGCGCAA AGTTGGTGCC TCAACAGGTC TTTGATTATT TAGAAACACT TAGCCTAACA AGTGAGCAAG GAAGCCAGTT TCTCAAACTC ACCTCTCGTT CAGGTTTCAA ATTGCTGCAG GTACAAAACT ACGCGGGGAT GCTTTCTACA CCTCACGGTT TGCAGCTTGA GATACTGCCC AAGATTGGTA AAAACCTCAC GCTTGCGAGT GCGCGAGAAA CATTCATTAC GATGCTAAGT CACCTACCTA GGTTTCGACA CATTCAAACC CAGCAGGCCA CTCTTCAGGC GCAACGCATG CCTTTGCTCG AGATTTTTAT CAGCCAATTT TTGCAAAGCG TCAGTCAATT GCTTAAACAA GGTTTACGCT CCGACTATGT GAGTGAGAAA GGTAATCTGG CTTTTATGAA GGGCAAACTG ATGCTCTCTG CACAACTGCG ACATAACGCG GTGAATCGCC ATAAGTTTTG TGTCGATTAT GATGAGTATA TGCCTGATTG TGCTGCCAAT CGGCTCTTAC ACTCCACACT GGATAAGTTA CTTAGTCTGA AGCTGTCATC AGAGAATCAA CGCTGGCTTT ACGAACTGTG CTTTGCTTTT GACGGTATTC CACTTAGTAG GGATATAGAG AGCGATCTGA ACAGCTTACG CATTGAGCGT GGTATGACTC ATTACAGCGA GCCAATAGCT TGGGCGCAGT TGATCCTGAG AGGAATGAGC CCAAGTGCAT TGCAAGGAAA CACCAAAGCG ATATCACTTT TGTTTCCTAT GGAAGCGGTA TTTGAATCCT TTGTGGCACA GACCCTACCC TACGAATTAC CTTCTCACCT AAAAGTTTTT TCTCAAGCAG CAACGTATTC TTTGGTAAAG CATGGACTCA AAGATTGCTT TAAGCTTCGC CCAGACTTGC TGATTCAATC TCGGCAACCG ATTCAAACCA AAATGGTGAT GGATACAAAA TGGAAGCAGG TGAATAGCAG CCAGCAAAAA AAATCACTTT ATGGGCTAGC GCAATCCGAC TTCTATCAAA TGTTTGCCTA CGGCCAAAAA TACCTTGGCG GAACAGGCGA AATGTACCTG ATTTACCCTG CGCATGATGA CTTTAGCCAA CCGATACCGC AGCACTTTGC TTTTTCAGAG ACTTTAAAAT TATGGGTTGT GCCATACCGG ATAATGGCAA AACGTGGTGA GAGGATGATG TGGGCAAGTG ATGTGTTAGC TACATAG
|
Protein sequence | MSEVISIFEY DLLGSGKAAS VGAKLVPQQV FDYLETLSLT SEQGSQFLKL TSRSGFKLLQ VQNYAGMLST PHGLQLEILP KIGKNLTLAS ARETFITMLS HLPRFRHIQT QQATLQAQRM PLLEIFISQF LQSVSQLLKQ GLRSDYVSEK GNLAFMKGKL MLSAQLRHNA VNRHKFCVDY DEYMPDCAAN RLLHSTLDKL LSLKLSSENQ RWLYELCFAF DGIPLSRDIE SDLNSLRIER GMTHYSEPIA WAQLILRGMS PSALQGNTKA ISLLFPMEAV FESFVAQTLP YELPSHLKVF SQAATYSLVK HGLKDCFKLR PDLLIQSRQP IQTKMVMDTK WKQVNSSQQK KSLYGLAQSD FYQMFAYGQK YLGGTGEMYL IYPAHDDFSQ PIPQHFAFSE TLKLWVVPYR IMAKRGERMM WASDVLAT
|
| |