Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0649 |
Symbol | |
ID | 6968369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 676762 |
End bp | 678663 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643384686 |
Product | type VI secretion system Vgr family protein |
Protein accession | YP_002269199 |
Protein GI | 209397950 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.823903 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAACCG GATTACGTTT CACACTGGAA GTGGACGGCC TGCCACCGGA CGCTTTTGCG GTGGTTTCCT TTCATCTCAC CCAGTCACTC TCTTCGCTCT TTTCCCTCGA TCTCTCCTTG GTCAGCCAGC AGTTTCTCTC CCTTGAATTT GCGCAGGTGC TGGACAAAAT GGCCTACCTG ACAATATGGC AGGGTGATGA TGTACAGCGC CGGGTGAAAG GTGTGGTGAC CTGGTTTGAA CTGGGGGAGA ACGACAAAAA CCAGATGCTG TACAGCATGA AGGTGCACCC GCCGCTGTGG CGTGCCGGTC TGCGCCAGAA CTTCCGTATC TTCCAGAACG AGGACATCAA AAGCATCCTC GGCACGATAT TGCAGGAAAA CGGGGTGACC GAGTGGAGTC CGCTGTTCAG TGAACCGCAT CCTTCCCGTG AGTTTTGTGT CCAGTACGGT GAGACTGATT ACGATTTCCT GTGCCGGATG GCGGCGGAGG AAGGCATCTT CTTTTATGAG GAGCATGCTT ACAAAAGTAC CGACCAGAGC CTGGTGCTGT GCGACACGGT CCGCCATCTG CCCGAATCTT TTGAAATCCC CTGGAACCCG AACACCCGTA CCGAGGTGAG CACCCTCTGC ATCAGCCAGT TTCGCTACAG CGCACAAATC CGCCCTTCTT CCGTGGTGAC CAAAGACTAC ACCTTTAAAC GCCCCGGCTG GGCAGGACGT TTTGATCAGG AAGGCCAGCA CCAGGATTAC CAGCGCACGC AGTATGAGGT GTATGACTAC CCCGGACGTT TCAAGGGCGC CCACGGGCAG AACTTTGCCC GCTGGCAGAT GGAGGGCTGG CGAAACAATG CAGAAACCGC GCGGGGAATG AGTCGCTCGC CGGAGATATG GCCGGGACGG CGAATTGTGC TGACGGGGCA TCCGCAGGCG AACCTGAACC GGGAATGGCA GGTGGTGGCA AGTGAACTGC ACGGCGAGCA GCCGCAGGCG GTGCCGGGAA GGCGGGGAGC AGGGACGGCG CTGGAGAACC ATTTTGCGGT GATCCCGGCA GACCGGACAT GGCGACCCCA GCCGCGGCTA AAACCGCTGG TGGACGGTCC GCAGAGCGCC GTCGTGACGG GGCCGGAGGG TGAGGAAATC TTCTGTGATG AACATGGCCG AGTGCGGGTG AAATTCAACT GGGACCGTTA TAACCCGGCA GACCAGGACA GCTCGTGCTG GATCCGTGTG GCACAGGCGT GGGCAGGCAC CGGTTTTGGC AACCTGGCGA TACCGCGTGT GGGTCAGGAG GTGATAGTGG ACTTCCTCAA CGGCGATCCG GACCAGCCGA TCATTATGGG GCGCACCTAC CACCAGGAAA ACCGCACCCC CGGCAGCCTG CCGGGGACGA AGACGCAGAT GACCATCCGT TCCAAAACGT ATATGGGCAG CGGGTTTAAT GAGCTGAAGT TTGATGATGC GACAGGGAGA GAACAGGTCT ACATCCACGC GCAGAAGAAC ATGGATACCG AAGTGCTCAA CGACCGTACC ACCACCGTAA AACACGATCA TCGCGAAACC GTAAAAAATG ACCAGACGGT CACGATCCAG GAAGGTAACC GCCTTCTTAC GGTGGAAAAA GGCCACAAGA TCACCGGAGT ACTGAAAGGG TCTTTATCTG AGGATGTCTT TCAGGACAGA GGCACGATTG CCGGTTCGGT GCATGTTGAC GCTGTAAACA ATGGTGGCGA AGGCAACGGT ATACAGGCTT ATACGGCAAT TAAGGAAATT ATGCTGGCCG TGGAGGAAAG CAAAATTGCG CTGACGCCGG ATGGCATTCA GCTACAGGTC GGGGAATCAA CGGTAATCAG GCTGTCGAAG GATGGCATCA CCATCGTGGG CGGTTCTGTT TTCATCAACT GA
|
Protein sequence | MSTGLRFTLE VDGLPPDAFA VVSFHLTQSL SSLFSLDLSL VSQQFLSLEF AQVLDKMAYL TIWQGDDVQR RVKGVVTWFE LGENDKNQML YSMKVHPPLW RAGLRQNFRI FQNEDIKSIL GTILQENGVT EWSPLFSEPH PSREFCVQYG ETDYDFLCRM AAEEGIFFYE EHAYKSTDQS LVLCDTVRHL PESFEIPWNP NTRTEVSTLC ISQFRYSAQI RPSSVVTKDY TFKRPGWAGR FDQEGQHQDY QRTQYEVYDY PGRFKGAHGQ NFARWQMEGW RNNAETARGM SRSPEIWPGR RIVLTGHPQA NLNREWQVVA SELHGEQPQA VPGRRGAGTA LENHFAVIPA DRTWRPQPRL KPLVDGPQSA VVTGPEGEEI FCDEHGRVRV KFNWDRYNPA DQDSSCWIRV AQAWAGTGFG NLAIPRVGQE VIVDFLNGDP DQPIIMGRTY HQENRTPGSL PGTKTQMTIR SKTYMGSGFN ELKFDDATGR EQVYIHAQKN MDTEVLNDRT TTVKHDHRET VKNDQTVTIQ EGNRLLTVEK GHKITGVLKG SLSEDVFQDR GTIAGSVHVD AVNNGGEGNG IQAYTAIKEI MLAVEESKIA LTPDGIQLQV GESTVIRLSK DGITIVGGSV FIN
|
| |