Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1639 |
Symbol | |
ID | 5593271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1662058 |
End bp | 1664703 |
Gene Length | 2646 bp |
Protein Length | 881 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640920787 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001458343 |
Protein GI | 157161025 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 0.994447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGACCC ATCCGCGCTA CGGCATGGGG AAACGTCTTG GTGCGGCAGA TGTGGATAAA TGGGCGCTGT ATGTCATCGG CCAGTACTGC GACCAGTCGG TGCCGGACGG TTTTGGCGGC ACGGAGCCGC GCATCACCTG TAATGCCTAC CTGACCACGC AGCGTAAGGC GTGGGATGTG CTCAGTGATT TCTGCTCGGC GATGCGCTGT ATGCCGGTAT GGAACGGGCA GACGCTGACG TTCGTGCAGG ACCGACCATC AGATAAGGTG TGGACCTATA ACCGCAGTAA TGTGGTGATG CCGGATGATG GCGCGCCGTT CCGCTACAGC TTCAGCGCCC TGAAGGACCG CCATAATGCC GTTGAGGTGA ACTGGATTGA CCCGAACAAC GGCTGGGAGA CGGCGACAGA GCTTGTTGAA GATACGCAGG CCATTGCCCG TTACGGTCGT AATGTCACGA AGATGGATGC CTTTGGCTGT ACCTGCCGGG GGCAGGCACA CCGCGCCGGG CTGTGGCTGA TTAAAACGGA ACTGCTGGAG ACGCAGACCG TGGATTTCAG CGTGGGTGCT GAAGGGCTTC GCCATGTACC GGGCGATGTC ATTGAAATCT GCGATGATGA CTATGCCGGT ATCAGCACCG GCGGGCGCGT GCTGGCGGTA AACAGCCAGA CCCGGACGCT GACGCTCGAC CGTGAAATCA CGCTGCCATC TTCCGGCACC ACGCTGATAA GCCTGGTTGA CGGGCAGGGG AGTCCGGTCA GCGTGGAGGT TCAGTCCGTC ACCGACGGCG TGAAGGTAAA AGTGAGCCGT GTTCCTGACG GTGTTGCTGA ATACAGCGTA TGGGGGCTGA AGCTGCCGAC GCTGCGCCAG CGACTGTTCC GCTGCGTGAG TATCCGTGAG AACGACGACG GCACGTATGC CATCACCGCC GTGCAGCATG TGCCGGAAAA AGAGGCCATC GTGGATAACG GGGCGCACTT TGACGGCGAC CAGAGCGGCA CGGTGAATGG TGTCACGCCG CCAGCGGTGC AGCACCTGAC TGCCGAAGTC ACCGCAGACA GCGGGGAGTA TCAGGTACTG GCCCGCTGGG ACACGCCGAA GGTGGTGAAG GGGGTGAGCT TTATGCTTCG CCTGACCGTG GCAGCGGACG ACGGCAGTGA GCGGCTGGTC AGCACGGCCC GGACGACAGA AACCACATAC CGCTTCACGC AACTGGCGCT GGGACGGTAC ACGCTGACAG TCCGGGCGGT AAATGCGTGG GGACAGCAGG GCGATCCGGC GTCGGTATCG TTCCGGATTG CCGCACCGGT AGCACCGTCG CGGATTGAGC TGACGCCGGG CTATTTTCAG ATAACTGCCA CGCCGCATCT TGCGGTTTAT GATCCGACGG TACAGTTTGA GTTCTGGTTC TCGGAAACGC GGATTACCGA TATCAGGCAG GTTGAAACCA CAGCCCGCTA CCTTGGCGCG GGGCTGTACT GGATAGCCGC CAGTATCAAT ATCAAACCGG GCCATAATTA TTATTTTTAC GTTCGCAGTG TGAACACCGT TGGCAAATCG GCATTCGTGG AGGCTGTTGG TCAGCCGAGT GATGACGCAT CCGGCTATCT GGATTTTTTC AAAGGCGAGA TAGGGAAAAC CCATCTGGCT CAGGAGCTGT GGACGCAGAT TGATAACGGT CAGCTTGCGC CTGACCTGGC TGAAATCAGG ACATCCATTA CGGATGTCAG CAATGAAATC ACACAGACCG TCAATAAGAA ACTGGAAGAC CAGAGTGCAG CGATCCAGCA GATACAGAAG GTTCAGGTTG ATACAAATAA TAACCTGAAC AGCATGTGGG CCGTGAAACT GCAGCAGATG CAGGACGGAC GCCTTTATAT TGCGGGTATC GGTGCCGGTA TTGAGAATAC GCCAGCAGGA ATGCAGAGTC AGGTGCTGCT GGCGGCAGAC AGGATTGCGA TGATTAATCC TGCGAATGGC AACACAAAGC CGATGTTTGT TGGTCAGGGT GATCAGATAT TCATGAACGA CGTGTTCCTG AAACGCCTGA CGGCTCCCAC CATTACCAGC GGCGGTAATC CTCCTGCATT TTCCCTTACA CCGGACGGGC GACTGACGGC GAAAAATGCG GATATCAGTG GCAGTGTGAA TGCGAACGCC GGGACGCTCA ACAATGTCAC GATAAATGAA AACTGTCGGG TTCTGGGAAA ACTGTCTGCG AACCAGATTG AAGGCGATCT CGTTAAAACA GTGGGCAAAG CTTTCCCCCG GGACTCCCGT GCACCGGAGC GGTGGCCATC AGGGACCATT ACCGTCAGGG TTTATGACGA TCAGCCGTTT GACCGGCAGA TTGTTATTCC CACGGTGGCG TTTCGTGGCG CTAAACATGA GCGGGAGAAT AACGATATTT ATTCGTCATG CCGCCTGATA GTGAAGAAAA ACGGTGCTGA AATTTATAAC CGTACCGCGC TGGATAATAC GCTGGTTTAT ACAGGTGTTA TTGATATGCC TGCTGGTCGC GGTCACATGA CGCTGGAGTT TTCGGTATCA GCGTGGCTGG TAAATGACTG GTATCCCACA GCCAGTATCA GTGATTTGCT GGTTGTGGTG ATGAAGAAAT CCACAGCAGG TATCAGTATC AGCTGA
|
Protein sequence | MLTHPRYGMG KRLGAADVDK WALYVIGQYC DQSVPDGFGG TEPRITCNAY LTTQRKAWDV LSDFCSAMRC MPVWNGQTLT FVQDRPSDKV WTYNRSNVVM PDDGAPFRYS FSALKDRHNA VEVNWIDPNN GWETATELVE DTQAIARYGR NVTKMDAFGC TCRGQAHRAG LWLIKTELLE TQTVDFSVGA EGLRHVPGDV IEICDDDYAG ISTGGRVLAV NSQTRTLTLD REITLPSSGT TLISLVDGQG SPVSVEVQSV TDGVKVKVSR VPDGVAEYSV WGLKLPTLRQ RLFRCVSIRE NDDGTYAITA VQHVPEKEAI VDNGAHFDGD QSGTVNGVTP PAVQHLTAEV TADSGEYQVL ARWDTPKVVK GVSFMLRLTV AADDGSERLV STARTTETTY RFTQLALGRY TLTVRAVNAW GQQGDPASVS FRIAAPVAPS RIELTPGYFQ ITATPHLAVY DPTVQFEFWF SETRITDIRQ VETTARYLGA GLYWIAASIN IKPGHNYYFY VRSVNTVGKS AFVEAVGQPS DDASGYLDFF KGEIGKTHLA QELWTQIDNG QLAPDLAEIR TSITDVSNEI TQTVNKKLED QSAAIQQIQK VQVDTNNNLN SMWAVKLQQM QDGRLYIAGI GAGIENTPAG MQSQVLLAAD RIAMINPANG NTKPMFVGQG DQIFMNDVFL KRLTAPTITS GGNPPAFSLT PDGRLTAKNA DISGSVNANA GTLNNVTINE NCRVLGKLSA NQIEGDLVKT VGKAFPRDSR APERWPSGTI TVRVYDDQPF DRQIVIPTVA FRGAKHEREN NDIYSSCRLI VKKNGAEIYN RTALDNTLVY TGVIDMPAGR GHMTLEFSVS AWLVNDWYPT ASISDLLVVV MKKSTAGISI S
|
| |