Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0940 |
Symbol | |
ID | 5593844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 938781 |
End bp | 940532 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640920110 |
Product | tail fiber domain-containing protein |
Protein accession | YP_001457677 |
Protein GI | 157160359 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 63 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAA AATTTTATAC CCTGCTGACG GATATTGGCG CGGCGAAACT TGCCAGCGCC GCCGCGCTCG GTGTGCCGTT AAAAATTACC CATATGGCGG TCGGCGATGG CGGCGGAACA TTACCAACGC CGGACGCAAA GCAGACAGCA CTGGTAAATG AGAAACGCCG GGCTGCGCTG AATATGCTCT ATATCGACCC GCAGAACAGC AGCCAGATTA TTGCTGAACA GGTGATCCCT GAAAACGAGG GCGGTTGGTG GATACGTGAA GTGGGCCTGT TTGATGAGTC CGGGGCATTG ATTGCCGTGG GCAACTGCCC GGAAAGCTAT AAGCCGCAAC TGGCTGAAGG CAGCGGGCGT ACCCAGACCG TGCGCATGGT GCTGATTACC AGCAGTACGG ACAATATCAC CCTGAAAATC GACCCTGCTG TAGTGCTGGC AACCCGCAAG TATGTGGATG ACAAAATATC AGAGCACGAA CAGTCACGAC GTCACCCGGA CGCCTCGCTG ACCGCAAAGG GTTTTACTCA GTTAAGCAGT GCGACCAACA GTGAATCCGA AATACTGGCC GCAACACCGA AGGCTGTGAA GGCGGCATAT GATCTTGCAG CAGGTAAAGC ATCCGCCAGT CACACACACC CGTGGAATCA GATAACGGAT GTGCCTGCAG CTTCACTGAC GGTAAAAGGC ACCGTGCAAC TCAGCAGCGC CACTAACAGC ACGTCAGAAA CGCAGGCTGC CACACCAAAG GCAGTGAAGG CGGCATATGA CCTTGCAGCA GGTAAGGCAC CTGTCAGTCA CACGCACCCG TGGAGCCAGA TAACAGATGT GCCTGCAGCT TCACTGACGG TAAAAGGCAC CGTGCAACTC AGCAGCGCTA CTAACAGCAC GTCAGAAACG CAGGCTGCCA CACCAAAAGC CGTGAAGGCT GTATATGACC TTGCCAATGG AAAACAACCT GCCGACGCCA CACTGACCGC ACTGGCAGGC CTTGCCACTG CGGCAGACAA ACTTCCGTAT TTTACGGGGA ATGATACAGC CAGCCTGACA ACCCTGACTA ACGTTGGACG GAATATTCTG GATAAAGCAA GCACACAGGC GGTTATTCAA TATCTTGGTC TGAGCGATGC AAGTGGATAC GTTGGACGCT GGCTGAATAC CCAGGTTTTC ACCTCATCAG GTACGTACAC CCCGACGCCA GGAACAAAAC GGATTAGGGT CACAATAACG GGCGGCGGTG GCGGAGGGGG CGGCTGCAAG GCTATATCCA ATAATGAAAC GTTTTTCGGT GCTGGCGGCG GGGCAGGTGG GACAGTAATC ACCACGCTGA TCCTGACGAA GGATAGTTAT CCTGTCACTA TCGGCGCAGG TGGGGCCGGC GGCGTTAGTG CGACGAACGG CCTCAAGGGC GGTGATAGCT CGTTCGGATC GGTAATAGCC CCTGGTGGTG AAGGTGGTGG AAAATCAGGA GTCACAAACA CGAACGGTGG TAACGGCGGT GTGCCAAGTA CTGGCGGTAT CAACATCATT GGTGGAAATG GAGGCGACGG TCAGTCCGGA AATATCGGCG TCAGCGGTGA AGGCGGAACA TCGCACTGGG GTGGCGGTGG ACGCGCAGGC GCTGGCGGTG GTGTTAGTGG TAAGGCATAT GGTTCAGGTG GCGGTGGCGC ATACGATGCC GGTTATAGCG GAACCAGTAT GACAGGCGGG AAAGGTGCCG CTGGGATTTG TATTATCGAG GAGTTTGCAT AA
|
Protein sequence | MSTKFYTLLT DIGAAKLASA AALGVPLKIT HMAVGDGGGT LPTPDAKQTA LVNEKRRAAL NMLYIDPQNS SQIIAEQVIP ENEGGWWIRE VGLFDESGAL IAVGNCPESY KPQLAEGSGR TQTVRMVLIT SSTDNITLKI DPAVVLATRK YVDDKISEHE QSRRHPDASL TAKGFTQLSS ATNSESEILA ATPKAVKAAY DLAAGKASAS HTHPWNQITD VPAASLTVKG TVQLSSATNS TSETQAATPK AVKAAYDLAA GKAPVSHTHP WSQITDVPAA SLTVKGTVQL SSATNSTSET QAATPKAVKA VYDLANGKQP ADATLTALAG LATAADKLPY FTGNDTASLT TLTNVGRNIL DKASTQAVIQ YLGLSDASGY VGRWLNTQVF TSSGTYTPTP GTKRIRVTIT GGGGGGGGCK AISNNETFFG AGGGAGGTVI TTLILTKDSY PVTIGAGGAG GVSATNGLKG GDSSFGSVIA PGGEGGGKSG VTNTNGGNGG VPSTGGINII GGNGGDGQSG NIGVSGEGGT SHWGGGGRAG AGGGVSGKAY GSGGGGAYDA GYSGTSMTGG KGAAGICIIE EFA
|
| |