Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2062 |
Symbol | |
ID | 5137367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2220585 |
End bp | 2221673 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640533519 |
Product | hypothetical protein |
Protein accession | YP_001217986 |
Protein GI | 147673059 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR00661] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAATAC TGTATGGCGT TCAAGGAACC GGGAATGGGC ATATTGCGCG TTCAAGAGCC ATGTGTGCGG CACTTAAGCA GCAGCAAGTT GATGTGGATT ATCTGTTCTC AGGACGTCCT GTTGAGAACT ATTTTTCAAT GGAATGCTTT GGTGATTTTG CGACTCGACG AGGGCTGACC TTTGCCACCG AAAATGGCCA CGTCAATTAC GTTAAAACGT TACGTAAGAA TAATCTGTTG CAGTTTTGGA ATGAGGTGAA GCGCTTAGAT CTTTCTGGTT ATGACCTGAT TTTAAACGAT TTTGAGCCAG TGACGGCTTG GGCTGCCAAA TTACAAAATA TTCCCTGTAT TGGAATTAGT CATCAGAATG CCTTCTTGTA TCCAGTGCCA TTAAAAGGTG CTTCTTGGCT GGATAAAGCG ATCTTGCGGC ATTTTGCGCC AGCCCAGTAC CATTTAGGTT TACATTGGTA CCATTTTGAA CAGCCTATTT TGCCTCCCAT TATTTATGCG CCGGAGCAGC CGCTGAGTCA GCAGAACTTT ATATTAGTCT ACCTTCCATT TGAAAATGTG AATGAAATCT GTGAGTTACT GTATAGATTT ACCAATATTC ACTTCATTTG TTATCACCCC GAGGTGCCAG AAAACGAGTT GACGGAAAAT GTCGAACTGC GTCGTTTGCA TCATGGTGAT TTTCAGCATC ATTTACACCA ATGCAGCGGA GTGATCACCA GTGGTGGTTT TGAGTTGCCT TCGGAAGCTT TAGCGTTAGG CAAAAAATTA TTAATGAAGC CATTGGCTGG CCAATTTGAA CAGGTGAGTA ATGCGGCAAC CCTTGAAACA CTTGGGCTGG CAAGTGTGAT GGAGTTTCTT GACCCTGCCT GCTTACGTCT GTGGCTTGAT GAGAAACAGG CTGAGCGAGT GATCTATCCC GATGTGGCTC ACTTTTTAGT TGAATGGATT CTCAAGGGGA AATGGGAAAA TAGTGAAGTA CTTTGTCAGC AACTTTGGCA AAAAGTCGAT TTCCCCAGTT ACGCAATTTT GGGTAATGAA CTGACTAACT CAATGAGTCC TTCGTTAAAA CATTTTTAA
|
Protein sequence | MKILYGVQGT GNGHIARSRA MCAALKQQQV DVDYLFSGRP VENYFSMECF GDFATRRGLT FATENGHVNY VKTLRKNNLL QFWNEVKRLD LSGYDLILND FEPVTAWAAK LQNIPCIGIS HQNAFLYPVP LKGASWLDKA ILRHFAPAQY HLGLHWYHFE QPILPPIIYA PEQPLSQQNF ILVYLPFENV NEICELLYRF TNIHFICYHP EVPENELTEN VELRRLHHGD FQHHLHQCSG VITSGGFELP SEALALGKKL LMKPLAGQFE QVSNAATLET LGLASVMEFL DPACLRLWLD EKQAERVIYP DVAHFLVEWI LKGKWENSEV LCQQLWQKVD FPSYAILGNE LTNSMSPSLK HF
|
| |