Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0231 |
Symbol | |
ID | 5591121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 250417 |
End bp | 251748 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640919418 |
Product | hypothetical protein |
Protein accession | YP_001457005 |
Protein GI | 157159687 |
COG category | [S] Function unknown |
COG ID | [COG3522] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03353] type VI secretion protein, VC_A0114 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 70 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACCA CCCGCAACAA GGTGATGTGG CAGGAAGGGA TGCTGATGCG TCCACACCAT TTCCAGCAGC AGCAGCGTTA CAACGACTAC CTGGATAACC AGCGTTTCCG GGCCATGAAT AATTTATCCT GGGGATTTAC CGAACTCACT CTCAACAATG AACTGCTGGC ACAGGGTAAG ATCATGATTG ACAGCGCGTC AGGCACACTG CCCGACGGCA CCGTCTTTTC TATCCCCGAC CAGGACGCAC TGCCCGATCC GCTGCACCCG CAACATTTTC CGGACGAGAG AAGCCGCAAT ATCTACCTCG CTCTGCCGGT CGCCAGTGAT GTGAGAAATG AAATCAGCGA CGGGCGGCGA ATCGGGCGTT ACCGGCTGAA TTATGCCGAT GTCCGGGATT TGCATTCAGA AGAAGGTGAC ACGCGAACAC TGACGCTGGG ACAACTGACG CCGCGCATTA TGAGCGGTGC AGAAGATATG AGCGCCTATA TTACGCTCCC GCTTTGCCGT ATCAGTGATC GCCATGCCGA CGGCTCCCTG ACGCTGGATG ACGATTTTAT CCCCTCCTGC CAGAATATTC AGGTCAGTAA GAAACTGCGT GTTTATCTCA AAGAGGTACA GGGGGCCATT GGCGGACGGG CAAGCGATCT GGCAAACCGC ATTGGCTCTC CGGCGCAGAG CGGCATCGCG GATGTGGCGG AATTTATGAT GTTGCAGTTA CTTAACCGTA ACCAGACCCG GTTTACCCAT CGCGCTCGTC GATCCCAGCT CCACCCGGAA GATTTCTACC TTGATCTTGC CGGGTTGCTG GGTGAACTGA TGACCTTTAC CGAGCCGTCG CGCCTGCCCT GCCCGCTTGA TGTGTATGAT CATCATGACC TGACCAAAAC ATTTAAAACG CTGTTACCGG AAGTCAAACG GGCGCTGCAT ACCGTACTGT CGCCAAGAGC CGTTAATCTG CCGCTGCATC TGCGTGACGG TATCTGGCAG GCCGATATCC ATGACACAGA ACTGCTGCAA TCTGCCACCT TTGTGCTGGC TGTGGCGGCA AACATGCCGG TCGATCAGAT CCAGCGCCAG TTTATCCAGC AGTCGAAAAT TTCCTCGCCG GAAAAAATCC GCAATATGGT CAGTGTGCAG ATACCCGGCA TTCCATTGCG TGCCCTGATG GTGGCTCCCC GCCAGCTTCC TTACCATTCC GGGTTCAGCT ATTTCGAACT CGACAAGAGC GGACAGGCCT GGACAGAAAT GGCTGCCGCC GGGGCCGTTG CACTGCATGT ATCCGGCAGT TTCCCGGATC TGAACATGCA ACTGTGGGCG ATAAGAGGGT AA
|
Protein sequence | MATTRNKVMW QEGMLMRPHH FQQQQRYNDY LDNQRFRAMN NLSWGFTELT LNNELLAQGK IMIDSASGTL PDGTVFSIPD QDALPDPLHP QHFPDERSRN IYLALPVASD VRNEISDGRR IGRYRLNYAD VRDLHSEEGD TRTLTLGQLT PRIMSGAEDM SAYITLPLCR ISDRHADGSL TLDDDFIPSC QNIQVSKKLR VYLKEVQGAI GGRASDLANR IGSPAQSGIA DVAEFMMLQL LNRNQTRFTH RARRSQLHPE DFYLDLAGLL GELMTFTEPS RLPCPLDVYD HHDLTKTFKT LLPEVKRALH TVLSPRAVNL PLHLRDGIWQ ADIHDTELLQ SATFVLAVAA NMPVDQIQRQ FIQQSKISSP EKIRNMVSVQ IPGIPLRALM VAPRQLPYHS GFSYFELDKS GQAWTEMAAA GAVALHVSGS FPDLNMQLWA IRG
|
| |