Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0616 |
Symbol | |
ID | 5594987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 626778 |
End bp | 628679 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640919797 |
Product | type VI secretion system Vgr family protein |
Protein accession | YP_001457379 |
Protein GI | 157160061 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.0499415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAACCG GATTACGTTT CACACTGGAA GTGGACGGCC TGCCGCCGGA TGCTTTTGCG GTAGTTTCCT TTCATCTGAA CCAGTCACTC TCTTCGCTCT TTTCCCTCGA TCTCTCCCTG GTCAGCCAGC AGTTTCTCTC CCTTGAATTT GCGCAGGTGC TGGACAAAAT GGCCTACCTG ACGATATGGC AGGGCGATGA AGTACAGCGC CGGGTGAAAG GCGTGGTGAC CTGGTTTGAA CTCGGGGAGA ACGACAAAAA CCAGATGCTG TACAGCATGA AGGTGCACCC GCCGCTGTGG CGTGCCGGTC TGCGCCAGAA CTTCCGTATC TTCCAGAACG AGGACATCAA AAGCATCCTC GGCACGATGT TGCAGGAAAA CGGGGTGACC GAATGGAGTC CGCTGTTCAG CGAGCCGCAT CCTTCCCGTG AGTTTTGTGT CCAGTACGGT GAGACTGATT ACGATTTTCT GTGCCGGATG GCGGCGGAGG AAGGCATCTT CTTTTATGAG GAGCATGCTT ACAAAAGTAC CGACCAGAGC CTGGTGCTGT GCGACACAGT CCGCCATCTG CCCGAATCTT TTGAAATCCC ATGGAACCCG AACACCCGTA CCGAGGTGAG CACCCTCTGC ATCAGCCAGT TCCGCTACAG CGCACAAATC CGCCCTTCTT CCGTGGTGAC CAAAGACTAC ACCTTTAAAC GCCCCGGCTG GGCCGGACGT TTTGAACAGG AAGGCCAGCA CCAGGACTAC CAGCGCACGC AGTATGAAGT GTATGACTAC CCCGGACGTT TCAAGAGCGC CCACGGGCAG AACTTTGCCC GCTGGCAGAT GGACGGCTGG CGAAACAACG CAGAAACCGC GCGGGGAATG AGCCGCTCGC CGGAGATATG GCCGGGACGG CGAATTGTGC TGACGGGGCA TCCGCAGGCG AACCTGAACC GGGAATGGCA GGTGGTGGCA AGTGAACTGC ACGGCGAACA GCCACAGGCG GTGCCAGGAC GGCAGGGAGC GGGGACGGCG CTGGAGAACC ATTTTGCGGT GATCCCGGCA GACAGAACAT GGCGACCACA GCCGTTGCTG AAACCGCTGG TCGACGGCCC GCAGAGCGCT GTCGTGACAG GACCGGCAGG CGAGGAAATC TTCTGCGACG AACATGGTCG CGTGCGGGTG AAGTTCAACT GGGACCGTTA TAACCCGGCA GACCAAGACA GTTCGTGCTG GATCCGTGTG GCACAGGCGT GGGCAGGCAC CGGTTTTGGC CACCTGGCGA TACCGCGTGT GGGTCAGGAG GTGATTGTGG ACTTCCTCAA CGGCGATCCG GACCAGCCGA TCATTATGGG GCGCACCTAC CACCAGGAAA ACCGCACCCC CGGCAGCCTG CCGGGAACAA AGACGCAGAT GACCATCCGC TCCAAAACGT ACATGGGCAG CGGATTTAAT GAGCTGAAGT TTGATGATGC GACAGGGAGA GAACAGGTCT ACATCCACGC GCAGAAGAAC ATGGATACCG AAGTGCTCAA CGACCGTACC ACCACCGTAA AACACGATCA CCGCGAAACC GTAAAAAATG ACCAGACGGT CACGATCCAG GAAGGTAACC GCCTTCTTAC GGTGGAAAAA GGCCACAAGA TCACCGGAGT ACTGAAAGGG TCTTTATCTG AGGATGTCTT TCAGGACAGA GGCACGATTG CCGGTTCGGT GCATGTTGAC GCTGTAAACA ATGGTGGCGA AGGCGACGGT ATACAGGCTT ATACGGCGAT TAAGGAAATT TTGCTGGCTG TGGAGGAAAG CAAAATTGCG CTGACGCCGG ATGGCATTCA GCTACAGGTC GGGGAATCGA CGGTAATCAG GCTGTCGAAG GATGGCATCA CCATCGTGGG CGGTTCTGTT TTCATCAACT GA
|
Protein sequence | MSTGLRFTLE VDGLPPDAFA VVSFHLNQSL SSLFSLDLSL VSQQFLSLEF AQVLDKMAYL TIWQGDEVQR RVKGVVTWFE LGENDKNQML YSMKVHPPLW RAGLRQNFRI FQNEDIKSIL GTMLQENGVT EWSPLFSEPH PSREFCVQYG ETDYDFLCRM AAEEGIFFYE EHAYKSTDQS LVLCDTVRHL PESFEIPWNP NTRTEVSTLC ISQFRYSAQI RPSSVVTKDY TFKRPGWAGR FEQEGQHQDY QRTQYEVYDY PGRFKSAHGQ NFARWQMDGW RNNAETARGM SRSPEIWPGR RIVLTGHPQA NLNREWQVVA SELHGEQPQA VPGRQGAGTA LENHFAVIPA DRTWRPQPLL KPLVDGPQSA VVTGPAGEEI FCDEHGRVRV KFNWDRYNPA DQDSSCWIRV AQAWAGTGFG HLAIPRVGQE VIVDFLNGDP DQPIIMGRTY HQENRTPGSL PGTKTQMTIR SKTYMGSGFN ELKFDDATGR EQVYIHAQKN MDTEVLNDRT TTVKHDHRET VKNDQTVTIQ EGNRLLTVEK GHKITGVLKG SLSEDVFQDR GTIAGSVHVD AVNNGGEGDG IQAYTAIKEI LLAVEESKIA LTPDGIQLQV GESTVIRLSK DGITIVGGSV FIN
|
| |