Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2064 |
Symbol | |
ID | 6970938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1958329 |
End bp | 1960437 |
Gene Length | 2109 bp |
Protein Length | 702 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385976 |
Product | type VI secretion system Vgr family protein |
Protein accession | YP_002270465 |
Protein GI | 209400664 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.201838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.0713545 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAACCG GATTACGTTT CACGCTGGAA GTGGACGGCC TGCCACCGGA CGCGTTTGCG GTGGTCTCCT TTCATCTGAA CCAGTCACTC TCTTCACTTT TTTCCCTCGA CCTTTCTCTG GTCAGTCAGC AGTTTCTTTC CCTTGAATTT GCGCAGGTGC TGGACAAAAT GGCCTACCTG ACGGTATGGC AGGGCGATGA CGTACAGCGC CGGGTGAAAG GTGTGGTGAC CTGGTTTGAA CTGGGGGAGA ACGACAAAAA CCAGATGCTG TACAGCATGA AGGTGTGCCC GCCGCTGTGG CGCACAGGGC TGCGCCAGAA CTTCCGTATC TTCCAGAATG AGGACATCGA AAGCATACTC GCTACGATCC TGAAAGAAAA CGGTGTGACC GAGTGGAGTC CGCTGTTCAG CGAGCCGCAT CCTTCCCGTG AGTTTTGTGT CCAGTACGGC GAAACTGATT ACGATTTCCT GTGCCGGATG GCGGCGGAGG AAGGCATCTT CTTTTATGAG GAGCACGCGC AAAAAAGTAT CGACCAGAGC CTGGTCCTGT GCGACACCGT GCGTTATCTG CCGGAGTCCT TTGAGATCCC CTGGAACCCG AACACCCGTA CCGAGGTGAG CACCCTATGC ATCAGCCAGT TCCGCTACAG CGCACAAATC CGCCCTTCTT CCGTGGTGAC CAAGGACTAC ACCTTTAAAC GACCCGGCTG GGCAGGGCGT TTTGATCAGG AAGGCCAGTA CCAGGATTAC CAGCGCACAC AGTATGAAGT GTATGACTAC CCCGGACGTT TCAAGGGTGC CCACGGGCAG AACTTTGCCC GCTGGCAGAT GGATGGCTGG CGCAACAACG CAGAAGTGGC GCGCGGAACA AGCCGTTCTC CGGAAATATG GCCGGGACGG CGAATTGTGC TGACGGGGCA TCCGCAGGCG AACCTGAACC GGGAATGGCA GGTGGTGGCA AGTGAACTGC ACGGCGAGCA GCCACAGGCG GTGCCGGGAC GCAGGGGTTC AGGTACCACG CTGAATAACC ACTTTGCGGT AATACCGGCA GACCGGACAT GGCGACCACA GCCGTTGCTG AAACCGCTGG TGGATGGCCC GCAGAGCGCC GTCGTGACGG GACCGGCAGG CGAGGAAATC TTCTGCGACG AACATGGTCG CGTGCGGGTG AAATTTAACT GGGACCGCTA TAACCCGTCA AACCAGGACA GTTCATGCTG GATCCGTGTG GCACAGGCGT GGGCAGGCAC CGGCTTTGGC AACCTGGCGA TACCGCGTGT GGGTCAGGAG GTGATTGTGG ACTTCCTCAA CGGCGATCCG GACCAGCCGA TTATTATGGG GCGTACCTAC CACCAGGAAA ACCGCACACC CGGCAGCCTG CCGGGGACGA AGACGCAGAT GACCATTCGT TCGAAAAACT ATAAGGGCAG CGGGTTTAAT GAACTGAAGT TTGACGATGC GACAGGGAAA GAACAGGTCT ACATCCACGC GCAGAAGAAC ATGAACACCG AGGTGCTGAA TAACCGCACC ACGGATGTGA TAAACAACCA TGCTGAGACC ATTGGCAACA ATCAGATGAT TGCGGTTACC AACAATCAGA TACAGACGGT GGGCGTTAAC CAGATAGAGA CGGTGGGCAG TAACCAGATC ATTAACGTGG GTTCTGTTCA GGTGGAAACG ATTGGACTTG TTCGTGCGCT GACCGTGGGC GTGGCGTACC AGACGACGGT AGGTGGCATT ATGAACACTT CGGTGGCTCT GATGCAGTCC TCGCAGATCG GTTTGCATAA ATCGTTGAGG GTCGGGCTGG GTTATGACGT CAAAGTCGGA AATAACGTTA CCTTCACCGT TGGTAAAACG AAAAAGGATG ATACCGGGCA GACCGCGATT TACTCCGCCG GTGAGCATCT GGAGCTCTGC TGTGGTAAGG CAAGGCTGGT GCTGACGAAG GACGGACAAA TTTTTCTCAA CGGCACAAAA ATTCATTTGC AGGGTAAGGA GCAGGTTAAT GGTGACTCAC TGTTGATTAA CTGGAACTGT GCAGCCTCGA AATCCCCACC GAAGACCCCT GATGAAAAGC AGGATACGCC GGATATGAGA GAGTACTGA
|
Protein sequence | MSTGLRFTLE VDGLPPDAFA VVSFHLNQSL SSLFSLDLSL VSQQFLSLEF AQVLDKMAYL TVWQGDDVQR RVKGVVTWFE LGENDKNQML YSMKVCPPLW RTGLRQNFRI FQNEDIESIL ATILKENGVT EWSPLFSEPH PSREFCVQYG ETDYDFLCRM AAEEGIFFYE EHAQKSIDQS LVLCDTVRYL PESFEIPWNP NTRTEVSTLC ISQFRYSAQI RPSSVVTKDY TFKRPGWAGR FDQEGQYQDY QRTQYEVYDY PGRFKGAHGQ NFARWQMDGW RNNAEVARGT SRSPEIWPGR RIVLTGHPQA NLNREWQVVA SELHGEQPQA VPGRRGSGTT LNNHFAVIPA DRTWRPQPLL KPLVDGPQSA VVTGPAGEEI FCDEHGRVRV KFNWDRYNPS NQDSSCWIRV AQAWAGTGFG NLAIPRVGQE VIVDFLNGDP DQPIIMGRTY HQENRTPGSL PGTKTQMTIR SKNYKGSGFN ELKFDDATGK EQVYIHAQKN MNTEVLNNRT TDVINNHAET IGNNQMIAVT NNQIQTVGVN QIETVGSNQI INVGSVQVET IGLVRALTVG VAYQTTVGGI MNTSVALMQS SQIGLHKSLR VGLGYDVKVG NNVTFTVGKT KKDDTGQTAI YSAGEHLELC CGKARLVLTK DGQIFLNGTK IHLQGKEQVN GDSLLINWNC AASKSPPKTP DEKQDTPDMR EY
|
| |