Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VEA_001507 |
Symbol | |
ID | 8559819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio sp. Ex25 |
Kingdom | Bacteria |
Replicon accession | NC_013457 |
Strand | - |
Start bp | 1699867 |
End bp | 1702893 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 646409176 |
Product | MSHA biogenesis protein MshQ |
Protein accession | YP_003288655 |
Protein GI | 262396802 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.101461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCGTC TTTGCAGTAT TTTCGCTGCG GTTCTTGTTT TACCGCTAGC CTCACACGCC ACCACTTATG ACGTCAAAGA TCTATCTGAG TTAGGTCAAC TTTGCTCACA CAAGAGTTTT AGTATCGGAA GCGATGGCAC GACTTATCGC TGTACAGGGG AAATAGTCCT ACCCGCAGGT GACAAAATTA TCTCCTCATT ACCGGAAAAA GAGATAACGT TAAAAGCAGA TTATGGCATT GTGTTAGAGG GGAACAATAC GATTGGTGAC CCCAATAAGC GTATCTCTCT ACACGCACTC GCTACCGGAT TAACGGTTAA CAATGAGAAA GCTTCAAGCA TTGACGATTA CCAAAACAAG CACACTGTTA TCTATGGTGA TCTTCTATCG GCATATCCAA CAAAGCTGCA TAACATTCTC ATAGACGGGA ATATTAAGCT AACAGGTTCG ACACTTCACA TTGATGGAGC GCACAACACC ATTATTGGTG ATATTTCGAC ACACAGCACG ACCGATATAT TCAATGCTAA CGTTTGTGGC ACCATTGAAT CAAAAGGCCA TCAATTGCAT CTCAAGACGA AGAACCCTCT CCAACGACAT TTCGTCGTCG GTGATATTGT GGCGCACAGT TCTCTGCTTA TTGATGACAT GGAGATCTAC GGCACTACTG AGACTAAAGG AGCATCTGCC GCATTGAACG GTTCGTTCTA TGACTCTACT ACAGCAATTA AATACTTCCA AACATTGAAC TTCAACAACG CTCCAAATCA GCCAAGTCAA GTTTGTGGTG AAATCCAACA AACGCCAGGA AGTGCACAAA CGAAAGCACA AGACAACAAA TACACGCAAT ACTGTGGATT GGGCGACACT GATTGTCACT ACTCTTCTCA GCTCTGTCCG ATCACCCCTC AAGATATTCC GAATTGCAGC ATGCAACCTC CATCAGAGAA CGACTTCGAC CTAAACGTCA CACCATCGGA CGACATGGCA CTGATGTGCG GCGACGATCT TCCGCAATTT ACAGCAACAA CAACCAATAA TGGAGAAGTT GTAAGCGCAG AAGTCAGTGC TGCCCTATCA CACCCTGACT TGTTCACTTT AGAGGTGGTC AAAGGTCAGC AGACATTAAA AGCAAACCAA TTTATGTCTG ACGACAATGG TAAGCTCGTT GTTCGAGTGG TCCCGAACGA TATTGATACC ATCGCGTTAG ATACGAACTA CACGCTGACT TTCACTATGG CAGAAGACAC CAGCAAGCAT CAAACGGTTA GCTTTATGTT TACGCCATAT ATGTTTGAGG CCTACTCAAA GACGAAAAGC GCAACTTTAA ATGAAATACG CGTTATTGCA GGTAAACCAG AAAACGTGCA TACGCGATTG CTGGCTTGTG CTTCCACCGG CGAACCTGTA GTCGCAAGCA ACTACAATGG TACTCCTAAG ATTTTTCATC CATTGGTTCA ACCATTAGGA GGAAGTGAAG GAAACTTTAG CTATTCAGCG GAATTTAAAG ACGGCTTGTC TGAGCATGGT CTTATCACGA ACGAGTCCGG CATATTTGAA GTGACATTGT CCGATCATTT TGAATGTGAG GGATTTGAAG AGTGCCCAGA AGAAGGCAAA GTGGAAGTGA AAGGAACATT TAATGTGTAC TCGCGACCAT GGACATTGGC TATCTGTGAC AACCAACGTG CTTTGCCTTC TGGCACCTCT GAACAAGGAG ATGGGTTTAT TGCTGCCGGA GAACTTTTCT CTCTAACAGT GAAACCCATC ATTTGGCAAC CTGACGGCTC TGTAAGTGGC TCTGTGGAAT CAGCACGTTA TTGTGATGCC TCTATCACCC GTAATTTTAT GCTTGATGAC GCTCCAGCAG CATCGGTAGT CTTGTCGAGT GAGCAACATA CGCCATCAGA GACAGCGAAC CAAACCTCGA AATTGCTAAA AAGTAGTGAC TCACTGGCTC AGGCGCATAA TTCAGCCAAG AATGATCAAT TCGTCTTTAA TGGGTTATAT TGGGAAGAAG TCGGGAGCTT GAGAGTAAAA GCAAACCTAG AAAGTAAATA TCTAGGTATG ACTGTTAACG AAGGCTATCG TTACATCGGT CGTTTTTACC CAAAATATTT TCAAGTGCAA GACCAAGAAT GGCACTACCC ACATGGCCAA ACTTTTGCTT ACATGAATCA ACCTTTTGAG AAAGTCACTT ACGATGTTGT CGCACTCAAT GCGAATAAGG AAAACGTGAA GAACTACGTC CACTTTTCAT CTAACCTTCA ACAACATTTC GACTTAGGCG AACTAAGCTC ATACTCCGAA CGGTTTTTGC CGCCTAAGCC TAAGAAGGTA GACTGGAAAC TACTGGGTAA CGCGAGTGTT GGAACGTTTG TGATCGAGAA GCCCTCGACC AATGCGACTT GTAACAATAG CCCATGTTGG AAAAAGAACA CAACCGATAA ACAATACCCT GATGGTCCAT TTAATAATGG GACTAACAGT GACAGCAGCA AAATAGGTAT AGTCACGACT AAAGCCGTTG ATGAGGTAAA TTTCTTTGAT AACAGCGAAG TGCTCACTCA ACAACCGGAC ATTCGATTCG GTCGTTTGAA CTTTCAAGAT GTGGGTGGCA ACCAAGGCAT GGTGATCAAA GTGCCACTTG ATGTGGAAAT TTGGCAAAAC GGTCGATTTA CGACTAACTT CGATGACAGC TCAACAACGG CGAATGGTGA ATATTACTTC AGCACGCCAA TTTGGTCGCA TGCGCCAGCA AACAACGCCC TGCTTTCTGG CGTTGGAACG ATGAGTATTG GCAGGATGAC CGATATCGTT GCAAGCCAAA TTGACTCAGC GCGCGAGCAA ATACAATTCC ACCTAGATTT AGACAGCTCA GGCAATCGCA TCCCTTGGCT GAAATACGAC TGGGATAGCT CTACAAGTGA AGAGGAAAAC CCACCCGTCA TTGTTACGTT TGGTATTCAC CGAGGAAACG ATCGCATTAT TTATCGTGGT GAACCCAATA TGCTTGGCCT AAATTAA
|
Protein sequence | MMRLCSIFAA VLVLPLASHA TTYDVKDLSE LGQLCSHKSF SIGSDGTTYR CTGEIVLPAG DKIISSLPEK EITLKADYGI VLEGNNTIGD PNKRISLHAL ATGLTVNNEK ASSIDDYQNK HTVIYGDLLS AYPTKLHNIL IDGNIKLTGS TLHIDGAHNT IIGDISTHST TDIFNANVCG TIESKGHQLH LKTKNPLQRH FVVGDIVAHS SLLIDDMEIY GTTETKGASA ALNGSFYDST TAIKYFQTLN FNNAPNQPSQ VCGEIQQTPG SAQTKAQDNK YTQYCGLGDT DCHYSSQLCP ITPQDIPNCS MQPPSENDFD LNVTPSDDMA LMCGDDLPQF TATTTNNGEV VSAEVSAALS HPDLFTLEVV KGQQTLKANQ FMSDDNGKLV VRVVPNDIDT IALDTNYTLT FTMAEDTSKH QTVSFMFTPY MFEAYSKTKS ATLNEIRVIA GKPENVHTRL LACASTGEPV VASNYNGTPK IFHPLVQPLG GSEGNFSYSA EFKDGLSEHG LITNESGIFE VTLSDHFECE GFEECPEEGK VEVKGTFNVY SRPWTLAICD NQRALPSGTS EQGDGFIAAG ELFSLTVKPI IWQPDGSVSG SVESARYCDA SITRNFMLDD APAASVVLSS EQHTPSETAN QTSKLLKSSD SLAQAHNSAK NDQFVFNGLY WEEVGSLRVK ANLESKYLGM TVNEGYRYIG RFYPKYFQVQ DQEWHYPHGQ TFAYMNQPFE KVTYDVVALN ANKENVKNYV HFSSNLQQHF DLGELSSYSE RFLPPKPKKV DWKLLGNASV GTFVIEKPST NATCNNSPCW KKNTTDKQYP DGPFNNGTNS DSSKIGIVTT KAVDEVNFFD NSEVLTQQPD IRFGRLNFQD VGGNQGMVIK VPLDVEIWQN GRFTTNFDDS STTANGEYYF STPIWSHAPA NNALLSGVGT MSIGRMTDIV ASQIDSAREQ IQFHLDLDSS GNRIPWLKYD WDSSTSEEEN PPVIVTFGIH RGNDRIIYRG EPNMLGLN
|
| |