Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2823 |
Symbol | mshE |
ID | 5137190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2973460 |
End bp | 2975187 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640534267 |
Product | MSHA biogenesis protein MshE |
Protein accession | YP_001218673 |
Protein GI | 147673996 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCAATTA ATAAACTGCG TAAACGGCTT GGTGACTTGC TGGTTGAAGA GGGGATTGTG AGCGAAGCGC AGCTTGAACA AGCGCTCAAT GCCCAGAAAA ATACCGGACG CCGCTTAGGG GATACCTTAA TTTCGCTTGG CTTTTTAAGC GAAACCCAGT TGCTGAACTT TTTGGCGCAG CAATTGAGTT TGCCAGTGAT CGATCTGAGC CGTGCCCATG TCGATATTGA TGCTGTTCCT TTACTGCCTG AAGTTCATGC CCGCCGTTTG CGTGCGCTGG TGATAGGCCG CAGTGGTGAT ACGCTGCGCA TAGCCATGAG TGATCCTGCG GATTTGTTTG CTCAAGAAGC CTTGCTCAAC CAGTTGCCTG ATTATGGTTT TGAGTTTGTC ATCGCCCCTG AGAAGCAGTT GGTCGATGGC TTTGATCGTT ACTATCGCCG CACCAAAGAA ATTGTCTCGT TTGCCGAGCA ATTGCATGCC GAACACAAAA CCAATGATAG TTTTGATTTT GAGATCACCG ATTCAGATAG CGATGAAGTG ACGGTTGTTA AACTGATCAA CTCGCTGTTT GAAGATGCGA TTCAAGTGGG AGCCTCGGAT ATACATATTG AGCCGGATGC CAACGTACTG CGTTTGCGCC AACGGATCGA TGGGGTATTG CATGAAACCT TGCTCCATGA AGTGAATATC GCCTCAGCCT TAGTGCTGCG CTTAAAATTA ATGGCGAATC TGGATATTTC AGAAAAACGC CTTCCGCAAG ATGGCCGCTT TAACATCCGT GTAAAAGGAC AATCCGTCGA TATCCGGATG TCAACCATGC CTATCCAGCA CGGTGAATCG GTGGTGATGC GTTTGCTCAA CCAATCTGCA GGGGTTAGAA AGCTCGAAGA GTCAGGTATT CCGCCTCATT TATTGCTGCG TTTGCGCCAT CAGTTGAAAC GCCCTCATGG CATGATTCTG GTGACTGGGC CTACCGGCTC GGGGAAAACC ACCACCTTGT ATGGCGCACT CAACGAGCTC AATACGTCCG GCAAAAAAAT CATTACTGCG GAAGATCCGG TGGAATACCG GATTTCACGG ATCAATCAGG TGCAAGTGAA CCCGAAAATC AATCTCGATT TCTCGACGTT ACTGCGCACT TTCTTGCGCC AAGACCCAGA TATTATTTTG GTCGGTGAGA TGCGTGACCA AGAGACGGTC GAAATTGGCC TGCGCGCTGC ATTAACCGGT CACTTAGTCT TAAGTACTCT GCACACCAAT GATGCCGTCG ATAGCGCGCT GCGTTTGATT GATATGGGGG CGCCCGGTTA TCTGGTTGCC AGTGCGGTAC GTGCCGTGGT TGCGCAGCGT TTAGTGCGAA AAGTCTGTCC AGATTGTAGT GGACATGATG AGGTGGATGA AGCGCGTCGC CAATGGCTTG TGACGCGTTT CCCTAATCAA GCTGCCGCCA AATTCACTCG TGGACGCGGC TGCCAGAACT GTAACTTAAC CGGTTATCGC GGCCGGATCG GTGTATTTGA AATGCTTGAG CTGGATCAGC CGATGATGGA TTGCTTGCGG GCAAATGATG CTGTGGCCTT CTCTAAAGCC GCACGCAGTA ACACAGATTA TAAACCGCTA CTGGCTTCGG CGATGGAGTT AGCGCTACAA GGCATCGTCA GTCTTGATGA AGTGATGTCG CTGGGTGAAG GTGATTCCTC TGGTTTGGTT GAACCGATTT ATCTGTAG
|
Protein sequence | MPINKLRKRL GDLLVEEGIV SEAQLEQALN AQKNTGRRLG DTLISLGFLS ETQLLNFLAQ QLSLPVIDLS RAHVDIDAVP LLPEVHARRL RALVIGRSGD TLRIAMSDPA DLFAQEALLN QLPDYGFEFV IAPEKQLVDG FDRYYRRTKE IVSFAEQLHA EHKTNDSFDF EITDSDSDEV TVVKLINSLF EDAIQVGASD IHIEPDANVL RLRQRIDGVL HETLLHEVNI ASALVLRLKL MANLDISEKR LPQDGRFNIR VKGQSVDIRM STMPIQHGES VVMRLLNQSA GVRKLEESGI PPHLLLRLRH QLKRPHGMIL VTGPTGSGKT TTLYGALNEL NTSGKKIITA EDPVEYRISR INQVQVNPKI NLDFSTLLRT FLRQDPDIIL VGEMRDQETV EIGLRAALTG HLVLSTLHTN DAVDSALRLI DMGAPGYLVA SAVRAVVAQR LVRKVCPDCS GHDEVDEARR QWLVTRFPNQ AAAKFTRGRG CQNCNLTGYR GRIGVFEMLE LDQPMMDCLR ANDAVAFSKA ARSNTDYKPL LASAMELALQ GIVSLDEVMS LGEGDSSGLV EPIYL
|
| |