Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0234 |
Symbol | |
ID | 5592001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 253577 |
End bp | 254659 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640919421 |
Product | hypothetical protein |
Protein accession | YP_001457008 |
Protein GI | 157159690 |
COG category | [S] Function unknown |
COG ID | [COG3520] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03347] type VI secretion protein, VC_A0111 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 72 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGGAA AGAATCGGGC AGCATCCTCT TATCTGAGTC CGGGAAATCC CCCGGCAGAT AAAGAGCAGA ATGATCCGCT GGCACAGGTT TTCCATAATG CCTGCTCTTA CAATTTTTTT GCGATGGCGG AGCTGTTGCA CCGCCTGGCA AAGGGTGAAA AGGGGACGCC AGAATTATCC CTGCGTGACG ATCCGGCACA GGAAACCCTG CGTTTCAGCG CCGATGCCAG CCTTGCCTTC CCGTGCAGTG ATATCAGTGC GCTGAAACGG GATACATCAG GGGCATTCAG GATGACCACC ACCTTTATGG GGTTACAGGG GAGCCAGTCG CCTCTCCCCG GTTATTACCT CGATCACCTG GCCTGGAAAG CGGTACATGA ACAAAGTCCG GTAGGTGATT TTCTGGATAT GTTCAGCCAC CGCCTGACCC AGTTTGTCTG GCATATCTGG CGTAAGTACC GCTATCACAT CAGTTTTCGT AACGGCGGTG TGGATGCGTT CTCGCAGCGC ATGTACTCCC TTGTCGGGCT GGGCCACCGC CAGCTTCGCG ATAAGCTGGC GATTAATCAC AGCAAAATGC TGGCGTATTC CGGGATCCTG GCAAATCCGG GACGATCCCC GGAAATAATC TGTGGCCTGG TGTCCCACTG CTTTGATTTG TCAGAAGTGA CGCTGCAAAA CTGGCAACGT CGCAAGGTGG ATATTGAACC GGATCAGCAA AATAGTCTGG GGAGCTATTC TCTTAAAAAT GGCGAAAAAC TGGCGGGAAG ATCGGTACTG GGCAACTTTG TTCTGGGTAC ACGAGTGCCC GATCTCAGCG GAAAATTCCA GCTCAGTATT ACCAGCCTGA CCCGAAAACA GTTCCTCTCT TTTCTGCCGT CCGGCGAAAA CTTCCTGCCA CTGACGATGT TCGTGTCCTT CATCCTGCGC GATCAGCTTG CCTGGGACTT ACATCTGGGG CTGGCGCCGG AGCAGGTGGG CGCAATGCGC CTGGGCGATA ACAAAAGTGC GCTGCTGGGC TGGACCAGCT TCCTCGGCAC ACCGGAAGAA CGACCCTCAG TCACCATCAG GGTACGGTCG TAA
|
Protein sequence | MDGKNRAASS YLSPGNPPAD KEQNDPLAQV FHNACSYNFF AMAELLHRLA KGEKGTPELS LRDDPAQETL RFSADASLAF PCSDISALKR DTSGAFRMTT TFMGLQGSQS PLPGYYLDHL AWKAVHEQSP VGDFLDMFSH RLTQFVWHIW RKYRYHISFR NGGVDAFSQR MYSLVGLGHR QLRDKLAINH SKMLAYSGIL ANPGRSPEII CGLVSHCFDL SEVTLQNWQR RKVDIEPDQQ NSLGSYSLKN GEKLAGRSVL GNFVLGTRVP DLSGKFQLSI TSLTRKQFLS FLPSGENFLP LTMFVSFILR DQLAWDLHLG LAPEQVGAMR LGDNKSALLG WTSFLGTPEE RPSVTIRVRS
|
| |