Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0787 |
Symbol | |
ID | 5135969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 796456 |
End bp | 798459 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640532245 |
Product | transposase |
Protein accession | YP_001216737 |
Protein GI | 147675169 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000128606 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGGTTA CAGCAAAAGA ATGCGCAGGC GTTGGTGGAT TTCCAACAGA TAAAAACAAC GCAAGAAATA GATTGGTGAA GCTGGCTTCT GGTCAGCCAG ACATGCAAAG AAAAAGAGAA GGTACTAAAG CCTTTGAATT TCACATCAGC ATCTTACCTC CTCAAACACA AGCCGCCCTC CTAAAGAAAA CGGGCAAGGT CAAAGTTGGT GAGCAAGTCA TCAACCTGCC AAAAGCAAAG GCAGAGAAAA CCTATTGTCG CGAATCGCTG TGGGCATCGT GGGGTAAAAC CAACGACAAA ACCAAGCAGA AAGCACATCA GGCCTTGCGC TGCGTTCAAG CAGTGAAAGC GTTGGAACAA AACGGCATCA ACCGCATGCA CGCATACCAA ACCGTGTGTG ATGAATACGG AATCCCACTT TCAACGCTGC GCCGTCACGT TGCCAAAGTG AAAGACATTG ACGAATGCGA CTGGCTACCT GCGCTGCTCA CCAAGCACTT TGAAACCGCG CAACTGCGCA AGGTGGAGCA CTTTGCCCAC ATCACGCCAG AGGCTTGGGA GTTCTTCAAA GGTGATTACC TTCGCCTAGA GCGCCCAACC ATGAGCGTGT GCTACGAGCG CCTAAAGAAA GTGGCAGTGC AAAACGGCTG GGCAATCCCA AGCCTGAAAA GCCTCAGCCG CCGACTAGAA GCCGAAGTGC CTATCCAAGC GCGCGTCATG CTGCGCGAAG GTGAGCACGC CTTGCACCAA ATGTTTCCGC CGCAAGAGCG CAGCGTGCTT GAGCTGCATG CCCTCGAATG GATTAACGGA GACGGCTACC AACACAACGT GTTTGTGCGT TGGTTCAATG GCGAAATCGT GCGCCCTAAA ACATGGTTCT GGGCTGATGT TTACAGCCGC AAGATTCTGG GTTGGCGCTG TGATCTCAGC GAAAACACAG ACAGCATCCG CCTTAGCTTT ATGGATGTGA TCGAGCGCTA CGGCATACCC AAGCACATCA CCATTGATAA CACCCGTGCG GCGGCAAACA AGTGGATGAC GGGCGGCGTG CCGAATCGTT ACCGCTTCAA AGTAAAGCCA GACGACCCTA AAGGCCTGAT GACCATGTTA GTCGGTGAGC GAAACATCCA CTGGACAAGC GTGATCCTCG GTAAAGGTCA TGGTCAGGCA AAGCCCATCG AGCGTGCGTT TGGTGTGGGT GGCCTTGAAG AGTACATCGA CAAACAACCG ATTAACGAAG GCGCTTATAC AGGCCCGAAC CCAATGGCGA AGCCTGAGAA CTACGGCGAC AAAGCCATTG ATGCGGATGC GTTCCTAAAG TCCATCGCTC TCGGTGTGGA AATGTTCAAC CAAAAGAGCA ACCGCAATAA CGAAGTGTGT CGCGGCTTTA TGAGCTACGA AGAAGCGTTT AACGCCAGCT ACCAAAGTGC GCCGATTAAA AAGGCCACCA AAGAGCAGTT GCAAATGCTG ATGCTATCGG CGGAAGCCTG TCGCGTATCG CGCCACGGCA CCATCACGCT CGATGCGGGT GGCACGTTGG CAGGTCGCAA AAACCGCTAC TTCAACGAAG TAATGATGAA CTACATCGAC CAAAAACTGG TGGCACGTTT TGACCCTATC AAGCTGCACG AGTCGGTAGA GATCTACACC CTAAACGGTG TTTACCTCTG CACTGCTGAG TGCGTTGAAA AGGTTGGCTT CGGCGATACC CAAGCCGCGA GAGAGCACAA ACGCAAGCGC ACCCAGTTTA CCAAAGCGAA CAAAGCCGCC GCCCAAGCGC AGCGTGAAAT GAGCGCATTG GAAGTGGCCG CGATGATGCC AGAGCCAGAA GAAGAAGTGA TCCCAGAAGC AAAAGTGGTC GAGGTGTATC GCCCTGTCGC TATCGGAAAC ACCGCCGCCG CGATTCGCCA GCAAGAGCAA ATCGAAACGG AAGAAGATTT GGAAGCGAAC TACCAAGCCA GCGTTGCCAG CCTGATGGCT CAACGCCTGA AAAACCGACT TTAA
|
Protein sequence | MWVTAKECAG VGGFPTDKNN ARNRLVKLAS GQPDMQRKRE GTKAFEFHIS ILPPQTQAAL LKKTGKVKVG EQVINLPKAK AEKTYCRESL WASWGKTNDK TKQKAHQALR CVQAVKALEQ NGINRMHAYQ TVCDEYGIPL STLRRHVAKV KDIDECDWLP ALLTKHFETA QLRKVEHFAH ITPEAWEFFK GDYLRLERPT MSVCYERLKK VAVQNGWAIP SLKSLSRRLE AEVPIQARVM LREGEHALHQ MFPPQERSVL ELHALEWING DGYQHNVFVR WFNGEIVRPK TWFWADVYSR KILGWRCDLS ENTDSIRLSF MDVIERYGIP KHITIDNTRA AANKWMTGGV PNRYRFKVKP DDPKGLMTML VGERNIHWTS VILGKGHGQA KPIERAFGVG GLEEYIDKQP INEGAYTGPN PMAKPENYGD KAIDADAFLK SIALGVEMFN QKSNRNNEVC RGFMSYEEAF NASYQSAPIK KATKEQLQML MLSAEACRVS RHGTITLDAG GTLAGRKNRY FNEVMMNYID QKLVARFDPI KLHESVEIYT LNGVYLCTAE CVEKVGFGDT QAAREHKRKR TQFTKANKAA AQAQREMSAL EVAAMMPEPE EEVIPEAKVV EVYRPVAIGN TAAAIRQQEQ IETEEDLEAN YQASVASLMA QRLKNRL
|
| |