Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_0947 |
Symbol | |
ID | 5134834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | - |
Start bp | 923456 |
End bp | 924436 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640531269 |
Product | IS5 transposase |
Protein accession | YP_001215783 |
Protein GI | 147671737 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3039] Transposase and inactivated derivatives, IS5 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCATC AACTGACTTT CGCAGACGGT GAGTTTTCCA ATAAACGTCG ACAAACCCGC AAAGAGCTCT TCCTCGCCAG AATGGAGAAG CTCCTACCAT GGTCTCAGTT GCTCGCGGTG ATCGAACCCT TTTATCCCAA GGCGGGCAAT GGCCGCCGCC CTTATCCTCT CGAAACCATG TTCCGCATCC ACTGTATGCA GCAGTGGTAC AGCTTAAGTG ACGAAGCGAT GGAAGATGCA CTCTATGAGA TCGCGTCCAT GCGGTTATTT GCCCATCTTT CGCTGGACAG AGCCATTCCC GACCGCACTA CCATCATGAA CTTCCGCCAC TTGTTAGAGC AGCATCAGCT GGGACGCAGT GTGTTCGAGC CGATCAATCA ATGGCTCAGC GAGCGCGGCG TGCTGATGAA GCAAGGCACG TTGGTCGATG CGACGATTAT CGAAGCGCCC AGCTCGACCA AGAACAAAAC CAACCAACGT GATCCCGGAA TGCACCAGAC CAAGAAAGGC AATGAGTGGC ACTTCGGTAT GAAGGCACAT ATTGGTGTGG ATGCCAAAAG TGGCCTCACT CATACACTGG TGACTACTGC CGCTAACGAG CATGATCTGA ATCAATTGAG CAACCTGCTA CACGGTGATG AAGAATTCGT CTCCGGTGAT GCAGGCTACC AAGGTGCACA CAAGCGCGAC GAGCTGAAGG GGGCAGACGT TGATTGGCTG ATAGCCGAAC GTCCCGGTAA AGTTCGCGCC CTGAAAAAGC ACCCTCGCAA AAACAAAGTG GCCATCCATA TCGAATACTT GAAAGCCAGC ATTCGGGCCA AAGTCGAGCA CCCGTTTCGC ATCATTAAAT GCCAGTTTGG TTTTATCAAA GCCCGCTACA AAGGCCTGAT GAAAAACGAC AACCAATTAG CCATGCTATT CACCTTGGCG AATCTGGTCA AAGTCGACCA ACTGATACGA CGACAGGCGA GATCTGCCTG A
|
Protein sequence | MSHQLTFADG EFSNKRRQTR KELFLARMEK LLPWSQLLAV IEPFYPKAGN GRRPYPLETM FRIHCMQQWY SLSDEAMEDA LYEIASMRLF AHLSLDRAIP DRTTIMNFRH LLEQHQLGRS VFEPINQWLS ERGVLMKQGT LVDATIIEAP SSTKNKTNQR DPGMHQTKKG NEWHFGMKAH IGVDAKSGLT HTLVTTAANE HDLNQLSNLL HGDEEFVSGD AGYQGAHKRD ELKGADVDWL IAERPGKVRA LKKHPRKNKV AIHIEYLKAS IRAKVEHPFR IIKCQFGFIK ARYKGLMKND NQLAMLFTLA NLVKVDQLIR RQARSA
|
| |