Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0311 |
Symbol | |
ID | 5137332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 329090 |
End bp | 331072 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640531769 |
Product | hypothetical protein |
Protein accession | YP_001216267 |
Protein GI | 147673392 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2015] Alkyl sulfatase and related hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAG TCTTTAAACC CACTTTGTTG GCGGTATTAG TGGCCGCCTC TGCGCCCGTA TTAGCTCATT CATTGATTGC GGATACCAAA GCCCCGAGCC CTACCACAAA AGCGTTTGCG GCTGAGCAAC GCAATACGCT GCCGTTTGCG GATCAGGAAG ATTTCAAATT GGTCGAGAAA GGTCTGATCG CGCAGCAAAA AGATCTCGAA ATCAAAGATG CGAATGGCAA GGTGGTTTGG GAATTGGGTA ATTACCGCTT TTTATTGGAT GGACTCGACT ATGACAGCAT CCATCCTAGT TTACAGCGTC AAGCGCAACT GAATATGCAT CACGGCCTGT ATAAAGTGAC GGATCGCATT TATCAGGTAC GTGGCTACGA TCTGGCCAAC ATCACCTTTG TAAAGGGCGA TACGGGATGG ATTGTGTTTG ATCCATTGAC CGTTCCTGCT ACGGCTAAAG CGGCGCTGGA TTTTGTTAAC CAAGAGCTAG GTGAGCGTCC CGTTAAAGCT GTGGTGTACA GTCATGCGCA TGCTGACCAT TTTGGTGGTG TCAAAGGCAT TGTGAGCCAA GAGCAGGTCG ACCGAGGTGA AGTGCCGATC ATCGCACCGA AAGGCTTTTT AAATCACGCG GTGGCTGAGA ATGTTTTGGC AGGAAATGCC ATGTCGCGTC GTACGACTTA TCAATATGGC AACGTTTTAC CGAAAGGGGC GACCGGGCAG GTGGATGCCG CGATTGGTAA GAATGTCGCG CAAGGTGAAG TGAGCTTGAT TGCGCCAACC AAAGTCATTT CTGAGCAAAC GGAAACTGTG GTGATCGATG GCGTAACCAT GGAGTTCCAG AATACACCGG GAACCGAATC TCCAGCGGAA ATGAACACCT ACTTCCCACA GTTTAAAGCC TTGTGGATGG CAGAAAACAC CGTTGGTGGT TTGCACAACG TCTACACCTT ACGTGGTGCA GAAGTGCGAG ATGCCAAGGC GTGGAGCAAA TACATCAATG AGTCGATTCA TATGTACGCG AAAGAGGCGG ATGTGATGTT TGCTTCACAC ACATGGCCGC GTTGGGGGAA CGACAATATC AACCATTTCC TGCGTAAACA GCGTGATATG TACGGCTACA TTCATGATCA GGCTTTACGT TTGGCCAACC ACGGTGTCAC CATCAATGAA ATTCAAGACG AATTCCATGT ACCGGATGTG TTAGCACACG AGTGGTACAA CCGAGGTTAT CACGGTAGTT ACCACCGCAA TGCTAAAGCG GTGATCAACA AATATCTGGG GTATTTTGAT ATGAATCCCG CCACGTTACG CCCATTGGCT CCAACCGATG CGGCTCCTAA ATATGTCGCG GCGATGGGCG GCATGGATAA CGTGATTAAG CTTGGGAAAG AAGCGTTTGA TAAAGGCGAG TTTCGTTGGT GTGCAGAGAT TGTCGATAAA GCCGTGTTTG CAGAGCCGAG CAATAAACAA GCTCGTTACT TACAGGCGGA TTGCTTGGAG CAACTGGGCT ACCAGAGTGA ATCGGCAGGG GAGCGTAACA CTTATTTGAT GGGCGCGTAT GAGTTGCGTA ATGGGGTACC GAAAGTGTCA GCAACCAAAA CGGCGGGTGC TGATACGGTT GTGGCGATGG ATACTGAGCT GTTCCTCGAC TATTTGGGCG TTCGCCTCAA TGGCGACAAA GCCGCTGGTA TTGATTACAC CATTAATTTC GTCTTGCCAG ATGTGAATGA GAAATTCTTG GTTGAACTGG AAAATGCCCA CTTGAATAAC TTAAAAGGCA TTCAATCGGA GAATCCGGAT ATGACGCTGA CTCTCAATCG TGCTCAGCTC AATCAAGTAT TGATGGGTAA AACCACCATT CAACAGTTGG CCAAAGAAGG CAAAGCGAAG ATTGAAGGCA ACGCGCAAGC GCTGACCGAT ATCGCAGGTA TGCTGGATAA CTTCGAATTC TGGTTCAATA TTATCGAACC GAAAACGAAG TAA
|
Protein sequence | MKTVFKPTLL AVLVAASAPV LAHSLIADTK APSPTTKAFA AEQRNTLPFA DQEDFKLVEK GLIAQQKDLE IKDANGKVVW ELGNYRFLLD GLDYDSIHPS LQRQAQLNMH HGLYKVTDRI YQVRGYDLAN ITFVKGDTGW IVFDPLTVPA TAKAALDFVN QELGERPVKA VVYSHAHADH FGGVKGIVSQ EQVDRGEVPI IAPKGFLNHA VAENVLAGNA MSRRTTYQYG NVLPKGATGQ VDAAIGKNVA QGEVSLIAPT KVISEQTETV VIDGVTMEFQ NTPGTESPAE MNTYFPQFKA LWMAENTVGG LHNVYTLRGA EVRDAKAWSK YINESIHMYA KEADVMFASH TWPRWGNDNI NHFLRKQRDM YGYIHDQALR LANHGVTINE IQDEFHVPDV LAHEWYNRGY HGSYHRNAKA VINKYLGYFD MNPATLRPLA PTDAAPKYVA AMGGMDNVIK LGKEAFDKGE FRWCAEIVDK AVFAEPSNKQ ARYLQADCLE QLGYQSESAG ERNTYLMGAY ELRNGVPKVS ATKTAGADTV VAMDTELFLD YLGVRLNGDK AAGIDYTINF VLPDVNEKFL VELENAHLNN LKGIQSENPD MTLTLNRAQL NQVLMGKTTI QQLAKEGKAK IEGNAQALTD IAGMLDNFEF WFNIIEPKTK
|
| |