Gene ECH74115_2064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2064 
Symbol 
ID6970938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1958329 
End bp1960437 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content55% 
IMG OID643385976 
Producttype VI secretion system Vgr family protein 
Protein accessionYP_002270465 
Protein GI209400664 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.201838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0713545 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACCG GATTACGTTT CACGCTGGAA GTGGACGGCC TGCCACCGGA CGCGTTTGCG 
GTGGTCTCCT TTCATCTGAA CCAGTCACTC TCTTCACTTT TTTCCCTCGA CCTTTCTCTG
GTCAGTCAGC AGTTTCTTTC CCTTGAATTT GCGCAGGTGC TGGACAAAAT GGCCTACCTG
ACGGTATGGC AGGGCGATGA CGTACAGCGC CGGGTGAAAG GTGTGGTGAC CTGGTTTGAA
CTGGGGGAGA ACGACAAAAA CCAGATGCTG TACAGCATGA AGGTGTGCCC GCCGCTGTGG
CGCACAGGGC TGCGCCAGAA CTTCCGTATC TTCCAGAATG AGGACATCGA AAGCATACTC
GCTACGATCC TGAAAGAAAA CGGTGTGACC GAGTGGAGTC CGCTGTTCAG CGAGCCGCAT
CCTTCCCGTG AGTTTTGTGT CCAGTACGGC GAAACTGATT ACGATTTCCT GTGCCGGATG
GCGGCGGAGG AAGGCATCTT CTTTTATGAG GAGCACGCGC AAAAAAGTAT CGACCAGAGC
CTGGTCCTGT GCGACACCGT GCGTTATCTG CCGGAGTCCT TTGAGATCCC CTGGAACCCG
AACACCCGTA CCGAGGTGAG CACCCTATGC ATCAGCCAGT TCCGCTACAG CGCACAAATC
CGCCCTTCTT CCGTGGTGAC CAAGGACTAC ACCTTTAAAC GACCCGGCTG GGCAGGGCGT
TTTGATCAGG AAGGCCAGTA CCAGGATTAC CAGCGCACAC AGTATGAAGT GTATGACTAC
CCCGGACGTT TCAAGGGTGC CCACGGGCAG AACTTTGCCC GCTGGCAGAT GGATGGCTGG
CGCAACAACG CAGAAGTGGC GCGCGGAACA AGCCGTTCTC CGGAAATATG GCCGGGACGG
CGAATTGTGC TGACGGGGCA TCCGCAGGCG AACCTGAACC GGGAATGGCA GGTGGTGGCA
AGTGAACTGC ACGGCGAGCA GCCACAGGCG GTGCCGGGAC GCAGGGGTTC AGGTACCACG
CTGAATAACC ACTTTGCGGT AATACCGGCA GACCGGACAT GGCGACCACA GCCGTTGCTG
AAACCGCTGG TGGATGGCCC GCAGAGCGCC GTCGTGACGG GACCGGCAGG CGAGGAAATC
TTCTGCGACG AACATGGTCG CGTGCGGGTG AAATTTAACT GGGACCGCTA TAACCCGTCA
AACCAGGACA GTTCATGCTG GATCCGTGTG GCACAGGCGT GGGCAGGCAC CGGCTTTGGC
AACCTGGCGA TACCGCGTGT GGGTCAGGAG GTGATTGTGG ACTTCCTCAA CGGCGATCCG
GACCAGCCGA TTATTATGGG GCGTACCTAC CACCAGGAAA ACCGCACACC CGGCAGCCTG
CCGGGGACGA AGACGCAGAT GACCATTCGT TCGAAAAACT ATAAGGGCAG CGGGTTTAAT
GAACTGAAGT TTGACGATGC GACAGGGAAA GAACAGGTCT ACATCCACGC GCAGAAGAAC
ATGAACACCG AGGTGCTGAA TAACCGCACC ACGGATGTGA TAAACAACCA TGCTGAGACC
ATTGGCAACA ATCAGATGAT TGCGGTTACC AACAATCAGA TACAGACGGT GGGCGTTAAC
CAGATAGAGA CGGTGGGCAG TAACCAGATC ATTAACGTGG GTTCTGTTCA GGTGGAAACG
ATTGGACTTG TTCGTGCGCT GACCGTGGGC GTGGCGTACC AGACGACGGT AGGTGGCATT
ATGAACACTT CGGTGGCTCT GATGCAGTCC TCGCAGATCG GTTTGCATAA ATCGTTGAGG
GTCGGGCTGG GTTATGACGT CAAAGTCGGA AATAACGTTA CCTTCACCGT TGGTAAAACG
AAAAAGGATG ATACCGGGCA GACCGCGATT TACTCCGCCG GTGAGCATCT GGAGCTCTGC
TGTGGTAAGG CAAGGCTGGT GCTGACGAAG GACGGACAAA TTTTTCTCAA CGGCACAAAA
ATTCATTTGC AGGGTAAGGA GCAGGTTAAT GGTGACTCAC TGTTGATTAA CTGGAACTGT
GCAGCCTCGA AATCCCCACC GAAGACCCCT GATGAAAAGC AGGATACGCC GGATATGAGA
GAGTACTGA
 
Protein sequence
MSTGLRFTLE VDGLPPDAFA VVSFHLNQSL SSLFSLDLSL VSQQFLSLEF AQVLDKMAYL 
TVWQGDDVQR RVKGVVTWFE LGENDKNQML YSMKVCPPLW RTGLRQNFRI FQNEDIESIL
ATILKENGVT EWSPLFSEPH PSREFCVQYG ETDYDFLCRM AAEEGIFFYE EHAQKSIDQS
LVLCDTVRYL PESFEIPWNP NTRTEVSTLC ISQFRYSAQI RPSSVVTKDY TFKRPGWAGR
FDQEGQYQDY QRTQYEVYDY PGRFKGAHGQ NFARWQMDGW RNNAEVARGT SRSPEIWPGR
RIVLTGHPQA NLNREWQVVA SELHGEQPQA VPGRRGSGTT LNNHFAVIPA DRTWRPQPLL
KPLVDGPQSA VVTGPAGEEI FCDEHGRVRV KFNWDRYNPS NQDSSCWIRV AQAWAGTGFG
NLAIPRVGQE VIVDFLNGDP DQPIIMGRTY HQENRTPGSL PGTKTQMTIR SKNYKGSGFN
ELKFDDATGK EQVYIHAQKN MNTEVLNNRT TDVINNHAET IGNNQMIAVT NNQIQTVGVN
QIETVGSNQI INVGSVQVET IGLVRALTVG VAYQTTVGGI MNTSVALMQS SQIGLHKSLR
VGLGYDVKVG NNVTFTVGKT KKDDTGQTAI YSAGEHLELC CGKARLVLTK DGQIFLNGTK
IHLQGKEQVN GDSLLINWNC AASKSPPKTP DEKQDTPDMR EY