Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1042 |
Symbol | |
ID | 5586406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1065925 |
End bp | 1067334 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640924747 |
Product | putative phage tail fiber protein |
Protein accession | YP_001462161 |
Protein GI | 157156132 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTATA TTGATAACGA CAGCGGCGTA ACCATCATGC CGCCCGTATC CGCCCAGCGT AGTGCTATCG TTCGCTGGTT TTCAGAAGGT GACGGGAATA ATGTTATCAC ATGGCCCGGC ATGGACTGGT TTAATATTGT GCAGGCGGAG TTATTAAACA CGCTGGAAGA AGCCGGTATT CAACCGGATA AAACAAAATT AAACCAGCTT GCACTGTCCA TTAAAGCCAT TATGAGCAAT AACGCGCTGC TGATAAAAAA TAACCTCAGC GAAATTAAAA CTGCCGGGGC ATCAGCACAG CGTACAGCAC GTGAAAATCT GGATATCTAT GATGCCAGCC TGAACAAAAA AGGACTCGTT CAGCTAACCA GTGCCACTGA CAGCCCCAGT GAAACGCTGG CAGCCACCGC AAAAGCGGTG AAAATTGCGA TGGATAATGC CAATGCCCGT CTGGCAAAAG ACCGGAACGG AGCAGATATT CCCAATAAGC CGCTGTTTAT CCAAAACCTC GGTTTACAGG AAACGGTAAA CAAGGCTGGT AACGCCGTTC AGCGTTCCGG CGATAAAATG ACCGGAGAAC TGAAAATTGG AACGATGAAT GCGCTGCGAA TTTTTAATGA TGCCTTCGGT CTTATTTTCC GCCGTTCAGA AGAGTCCCTT CATTTCATCC CTACGGCTGA AGGACAAGGC GAAAACGGTG ATATCGGCCC ATTAAGGCCA TTCGCTATAA ATCTAAGAAC AGGTGCTATA TATGTCAGCC ACGGGGCCAA AATTGAAGGA GGTCTAGCTA TTGGTGCTAC TGATAACGCA CTGGGTGAAA ACTCCATTGT TCTGGGAGAT AACGACACCG GATTTAGGCA AGATGGAGAT GGTATTATTA GCTTCTATTC AAATGGTTCG CGCATCGGAC ATATTGATGG GTTAGGATTA CATCTTTATA AAGATATTGA ATCTAATTGC AGCGCTTTTA GATTAAAAAG TAATTACCGC CACCACATTA CATTCACCAA CGAAGACGGA AGTATTCGTA TGTTTTTGTG GAAAGATAAC GGTGGTGATG GTGTTCATAT TAATAACGGT TCAGATGGTG GTGGTGATTT CATTTTTAAA ACAGATGGGG GATTTGCGCT GGGAAGTGGT GCGCAAGTTG CGTCTAGTGG CGATATTTAT GGTTCAGTGT GGGGAAACAA CTGGTTAAGC ACATGGCTGC ATAATCATGT CGTTCGGGAT ATTCGTCTTG GCAGCATTGA ATATAAAAAC GTATGGCGCG ACTACGGCTT TGGCGATGCG TCAGGTTATG TTTTAACAGC CGCAATTAAC GGCAATGCGG ATGATCTTGT CGACACTGTT GCCAGAAGGC CAATTCAGAA ATTGATTGGG GGAATATGGT ACAACGTGGG GAGTGTTTAA
|
Protein sequence | MFYIDNDSGV TIMPPVSAQR SAIVRWFSEG DGNNVITWPG MDWFNIVQAE LLNTLEEAGI QPDKTKLNQL ALSIKAIMSN NALLIKNNLS EIKTAGASAQ RTARENLDIY DASLNKKGLV QLTSATDSPS ETLAATAKAV KIAMDNANAR LAKDRNGADI PNKPLFIQNL GLQETVNKAG NAVQRSGDKM TGELKIGTMN ALRIFNDAFG LIFRRSEESL HFIPTAEGQG ENGDIGPLRP FAINLRTGAI YVSHGAKIEG GLAIGATDNA LGENSIVLGD NDTGFRQDGD GIISFYSNGS RIGHIDGLGL HLYKDIESNC SAFRLKSNYR HHITFTNEDG SIRMFLWKDN GGDGVHINNG SDGGGDFIFK TDGGFALGSG AQVASSGDIY GSVWGNNWLS TWLHNHVVRD IRLGSIEYKN VWRDYGFGDA SGYVLTAAIN GNADDLVDTV ARRPIQKLIG GIWYNVGSV
|
| |