Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1284 |
Symbol | |
ID | 5588402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 1284108 |
End bp | 1287521 |
Gene Length | 3414 bp |
Protein Length | 1137 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640924981 |
Product | phage tail domain-containing protein |
Protein accession | YP_001462390 |
Protein GI | 157159373 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTAA AGATTTCAGG TGTACTGAAA GACGGCACAG GAAAACCGGT ACAGAACTGC ACAATCCAGC TGAAAGCAAA ACGTAACAGC ACCACGGTGG TGGTGAACAC ACTGGCCTCA GAAAATCCGG ATGAAGCCGG GCGTTACAGC ATGGACGTTG AGTACGGTCA GTACAGCGTT ATTCTGTTGG TGGAAGGCTT CCCGCCGTCA CATGCCGGGA CCATCACCGT GTATGAAGAT TCCCGACCCG GTACGCTGAA TGATTTTCTC GGTGCCATGA CGGAGGATGA TGCCCGTCCT GAGGCACTGC GCCGTTTTGA ACTGATGGTG GAAGAGGTGG CGCGTAACGC GTCCGCGGTG GCACAGAACA CGGCAGCCGC GAAGAAGTCA GCCAGCGATG CCGGCACATC AGCCCGTGAG GCGGCAACCC ATGCGACTGA TGCTGCAGGC TCAGCACGCG CAGCCAGCAC GTCAGCCGGA CAGGCCGCGT CGTCGGCTCA GTCAGCGTCT TCCAGCGCAG GAACGGCATC AACAAAGGCT ACTGAAGCAT CAAAAAGTGC TGCTGCTGCA GAGTCCTCAA AAAGCGCGGC AGCTACCAGT GCCAGTGCCG CGAAAACGTC AGAAACGAAT GCGGCAGCGT CACAAAAATC TGCAGCCACT TCTGCATCCG CCGCGACCAC AAAGGCGTCA GAAGCTGCCA CCTCAGCCCG GGATGCGGCG GCCTCAAAAG AGGCAGCGAA ATCATCAGAA ACGAACGCAT CATCAAGCGC CAGTAGTGCA GCTTCCTCGG CAACGGCGGC AGGAAATTCC GCGAAGGCGG CAAAGACGTC CGAGACGAAC GCCAGGTCTT CTGAAACGGC AGCGGGACAG AGTGCCTCAG CTGCGGCAGG CTCAAAAACA GCGGCTGCGT CGTCTGCCAG CGCCGCGTCA ACAAGTGCCG GGCAGGCCTC AGCCAGTGCC ACCGCCGCCG GAAAATCGGC AGAAAGCGCC GCATCATCCG CTTCAACAGC CACAACGAAG GCTGGCGAAG CCACTGAGCA GGCCAGCGCA GCAGCGAGGT CTGCTTCTGC AGCAAAAACC TCTGAAACAA ATGCAAAGAC TTCAGCAGAC AATGCTGCTT CCTCTAAGGC GGCAGCCGCA TCGTCCGCTG GTTCAGCGGC GTCATCGGCA TCATCTGCGT CTGCTTCAAA AGATGAGGCG ACCAGACAAG CGTCAGCAGC GAAAGGTAGT GCCACGACAG CAACAACGAA AGCATTAGAG GCGGCAGGCA GTGCGACGGC TGCATCTCAG AGCAAAGTTG CTGCTGAATC CGCGGCAACG CGCGCCGAGA CAGCGGCAAA ACGGGCAGAG GATATTGCAT CCGCCGTGGC GCTTGAGGAT GCGAGCACGA CGAAAAAGGG GATAGTCCAG CTCAGCAGTG CGACCAACAG TACGTCTGAA ACGCTGGCGG CAACGCCAAA GGCAGTAAAA TCAGCCTATG ACAATGCAGA GAAACGTCTG CAGAAAGACC AGAACGGCGC TGATATACCC GATAAGGGAC GCTTCCTGAA CAACATTAAC GCGGTCAGTA AAACAGACTT TGCTGATAAG CGTGGTATGC GTTATGTGCG GGTTAACGCT CCTGCAGGTG CAACATCTGG AAAATATTAC CCTGTTGTTG TTATGCGTTC TGCTGGCTCA GTAAGCGAAC TGGCATCAAG GGTCATTATC ACCACGGCAA CGCGAACCGC AGGCGATCCG ATGAATAACT GCGAGTTTAA CGGATTTGTT ATGCCTGGTG GCTGGACTGA CAGGGGGCGT TATGCTTATG GAATGTTCTG GCAATATCAA AACAATGAAC GAGCCATCCA CTCAATAATG ATGAGTAATA AGGGCGATGA TTTGCGCTCT GTGTTCTATG TTGATGGCGC TGCTTTCCCT GTTTTTGCGT TTATCGAAGA TGGCCTGTCA ATATCCGCAC CTGGTGCTGA TCTCGTTGTT AATGATACGA CCTATAAGTT TGGTGCAACA AATCCAGCGA CTGAATGTAT CGCGGCGGAC GTTATCCTTG ATTTTAAGAG TGGGCGTGGT TTTTATGAGT CCCATTCGTT AATCGTTAAC GATAACTTGT CGTGCAAAAA ACTTTTTGCC ACAGACGAAA TTGTAGCGCG TGGTGGTAAT CAGATTCGAA TGATAGGTGG GGAGTATGGT GCATTATGGC GTAATGATGG CGCTAAAACT TACCTGCTGC TTACCAATCA AGGTGATGTT TATGGTGGCT GGAATACATT AAGACCGTTT GCTATTAATA ACGCAACCGG CGAACTGGTT ATTGGAACCA AACTGTCCGC AAGTCTGAAC GGTAATGCAT TAACAGCAAC AAAGCTGCAA ACGCCAAGAC TGGTTTCTGG TGTTGAGTTT GATGGTTCCA GAGATATTAC TTTAACCGCT GCGCATGTGG CTGCTTTTGC CAGAAGGGCA ACGGATACAT ATGCCGATGC GGATGGTGGC GTTCCCTGGA ATGCCGAATC AGGCGCTTAC AATGTCACCC GCTCTGGCGA CACCTATATT CTGGTTAACT TCTATACCGG AGTCGGAAGT TGCCGGACCC TGCAGATGAA GGCGCATTAC AGAAATGGTG GTCTGTTCTA CCGTTCCTCA AGAGATGGCT ATGGTTTTGA GGAAGACTGG GCAGAAGTTT ATACCTCGAA AAATCTTCCA CCAGAAAGCT ACCCAGTCGG CGCACCAATC CCGTGGCCAT CAGATACCGT TCCGTCTGGT TATGCCCTGA TGCAGGGGCA GACTTTTGAC AAATCTGCTT ACCCGAAACT TGCAGCCGCT TATCCGTCAG GTGTGATCCC TGATATGCGT GGCTGGACGA TTAAGGGCAA ACCTGCCAGT GGTCGGGCCG TATTGTCTCA GGAACAGGAC GGCATTAAAT CGCACACCCA CAGCGCCAGC GCATCCAGTA CGGATTTGGG GACGAAAACC ACATCGTCGT TTGATTACGG CACTAAATCC ACGAATAACA CCGGGGCGCA TACGCACAGT CTGAGTGGCT CTACGGGGTC TGCCGGTGTT CATACTCATG GTAATGGTAT TCGTTGGCCA GGAGGCGGCG GTTCTGCGTT ATCATTTTAT GATGGCGGTG GGTTCACTTA TGTCCAGAAT TCACAGTATC AAGTAAGCCC GGAGACTTCT TCCTATAGAT CGTATTATCA ACGTATTCAG ACACAGTCAG CAGGTGCTCA TACCCACTCG CTGTCTGGTA CTGCAGCAAG TTCTGGCGCA CATGCACATA CTGTAGGTAT TGGTGCGCAT ACGCACTCCG TTGCGATTGG TTCACATGGA CACACCATCA CCGTTAACGC TGCGGGTAAC GCGGAAAACA CCGTCAAAAA CATCGCATTT AACTATATTG TGAGGCTTGC ATAA
|
Protein sequence | MAVKISGVLK DGTGKPVQNC TIQLKAKRNS TTVVVNTLAS ENPDEAGRYS MDVEYGQYSV ILLVEGFPPS HAGTITVYED SRPGTLNDFL GAMTEDDARP EALRRFELMV EEVARNASAV AQNTAAAKKS ASDAGTSARE AATHATDAAG SARAASTSAG QAASSAQSAS SSAGTASTKA TEASKSAAAA ESSKSAAATS ASAAKTSETN AAASQKSAAT SASAATTKAS EAATSARDAA ASKEAAKSSE TNASSSASSA ASSATAAGNS AKAAKTSETN ARSSETAAGQ SASAAAGSKT AAASSASAAS TSAGQASASA TAAGKSAESA ASSASTATTK AGEATEQASA AARSASAAKT SETNAKTSAD NAASSKAAAA SSAGSAASSA SSASASKDEA TRQASAAKGS ATTATTKALE AAGSATAASQ SKVAAESAAT RAETAAKRAE DIASAVALED ASTTKKGIVQ LSSATNSTSE TLAATPKAVK SAYDNAEKRL QKDQNGADIP DKGRFLNNIN AVSKTDFADK RGMRYVRVNA PAGATSGKYY PVVVMRSAGS VSELASRVII TTATRTAGDP MNNCEFNGFV MPGGWTDRGR YAYGMFWQYQ NNERAIHSIM MSNKGDDLRS VFYVDGAAFP VFAFIEDGLS ISAPGADLVV NDTTYKFGAT NPATECIAAD VILDFKSGRG FYESHSLIVN DNLSCKKLFA TDEIVARGGN QIRMIGGEYG ALWRNDGAKT YLLLTNQGDV YGGWNTLRPF AINNATGELV IGTKLSASLN GNALTATKLQ TPRLVSGVEF DGSRDITLTA AHVAAFARRA TDTYADADGG VPWNAESGAY NVTRSGDTYI LVNFYTGVGS CRTLQMKAHY RNGGLFYRSS RDGYGFEEDW AEVYTSKNLP PESYPVGAPI PWPSDTVPSG YALMQGQTFD KSAYPKLAAA YPSGVIPDMR GWTIKGKPAS GRAVLSQEQD GIKSHTHSAS ASSTDLGTKT TSSFDYGTKS TNNTGAHTHS LSGSTGSAGV HTHGNGIRWP GGGGSALSFY DGGGFTYVQN SQYQVSPETS SYRSYYQRIQ TQSAGAHTHS LSGTAASSGA HAHTVGIGAH THSVAIGSHG HTITVNAAGN AENTVKNIAF NYIVRLA
|
| |