Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0144 |
Symbol | |
ID | 5137259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 140450 |
End bp | 142174 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640531604 |
Product | endoglucanase-related protein |
Protein accession | YP_001216109 |
Protein GI | 147673295 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 59 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTTAC TGACCAACCA CATTGGCTAT GAAACCCAAG GCCCTAAACA GGCGGTGTTG CTGTGTGGAC AAACACAACT CATGGACGAT TGTGTGCTTT TGGTTTGCGC TCGTAGTCAC CAAACCGTTG CCAAGCTCGC TATTGAGTGG CACGGCAAGG TCGACAACTG GCATCAGGGA CAATTTCATC GTATCGATTT TTCCGATTTC ACGACGCCAG GCGACTATTA TCTGCGCCTG GAACACACGC ATTCTGCCAC TTTTACTATT GCGCGAGGCG TATTAATGCA ACGCACATTT TCTGATGTGC TGCACTACTT TAAATCGCAA CGCTGCTCTG GGCAGTTTGA TCAACAAGAC AAGCAAGTAC CGCTGCTGAG CACATCAACC ACTGCCGATG TGCACGGTGG CTGGTACGAC GCTTCAGGTG ACGTTAGCAA GTATCTCAGT CACCTTTCTT ACGCTAACTA TTTGAACCCA CAACAAACAC CTTTGGTGGT CTGGAACATG CTCAAAGGGT TAGCGGTTTT ACAACATCAC TCGGGTTTTG CATCGTTTTC TCGCACTCGC CTCAAGGATG AAGCGTTGTT TGGTGCTGAT TTTCTACGCC GTATGCAAAA CTCCGAGGGA TTTTTTTATA TGACCGTCTT TGACAAATGG AGCAAAGACA CCAAACAACG GGAGATTTGT GCCTACGCGA CCCAACAAGG CCATAAATCC GATGATTATC AAGCGGGTTT TCGCCAAGGT GGGGGCATGG CGATTGCGGC ACTTGCCGCA GCCGCGCGTT TGGATACGCA CGGCGAGTTC ACACAAGCTG ACTATTTACA AGCGGCAGAA AATGGCTACT GGCACCTCAA AGAGCATAAC CTCGCCTACC TCAATGATGG GGTTGAAAAC ATCATTGATG AGTACTGCGC ACTCTTGGCG TGTTGCGAAC TTTACCGCAC GACAGAGAAT GACCAATATC TGGCTCAAGC TCGTGAGTGG GCACAGCGTT TAGCCAAGCG CCAATGCAGC GATGAACAAA TTGCGCACTA CTGGTCTGCC ACCAGCAACG GTGAGCGCCC ATACTTCCAC GCCAGTGATG CTGGTCTGCC TGTCATTGCA CTTTGCGAGT ATCTGAATAT TGAAACGGAT ACGGCTAACT ACGCTCAACT CCAAAGAGTG GTCGAGCAAG CCTGTCAATT CGAGTTAGCG ATAACTCAAC AAGTCTCCAA CCCGTTTGGG TACCCGCGTC AGTATGTCAA AGGGGTGGAA AGCGCCAAAC GCACCAGTTT CTTTATCGCT CAAGACAACG AAAGTGGTTA CTGGTGGCAA GGTGAAAATG CCCGTCTCGC CTCGTTGGCG AGCATGGCCT ATCTTGCTCA GCCTCATTTG AGTACCGCTA TCGCTAAACC GCTTGAACAG TGGTCACAAA ATGCCCTGAA CTGGATTGTC GGGCTCAATC CTTACAACAT GTGCATGCTC GATGGACATG GGCACAATAA TCCCGATTAC TTACCTCATT TAGGCTTTTT CAATGCCAAA GGCGGTGTGT GTAACGGCAT AACCGCGGGC TTTGATGACC CAAGAGATAT CGCGTTTAAC CCAGCAGGGC AAAAAGATGA CATGCTGCAA AACTGGCGTT GGGGAGAACA ATGGATCCCG CATGGCGCTT GGTATCTGCT TGCCATCATC AGCCAATTTG CTCACTTTAC CGCTCACGGG GAGGAGAACC AATGA
|
Protein sequence | MLLLTNHIGY ETQGPKQAVL LCGQTQLMDD CVLLVCARSH QTVAKLAIEW HGKVDNWHQG QFHRIDFSDF TTPGDYYLRL EHTHSATFTI ARGVLMQRTF SDVLHYFKSQ RCSGQFDQQD KQVPLLSTST TADVHGGWYD ASGDVSKYLS HLSYANYLNP QQTPLVVWNM LKGLAVLQHH SGFASFSRTR LKDEALFGAD FLRRMQNSEG FFYMTVFDKW SKDTKQREIC AYATQQGHKS DDYQAGFRQG GGMAIAALAA AARLDTHGEF TQADYLQAAE NGYWHLKEHN LAYLNDGVEN IIDEYCALLA CCELYRTTEN DQYLAQAREW AQRLAKRQCS DEQIAHYWSA TSNGERPYFH ASDAGLPVIA LCEYLNIETD TANYAQLQRV VEQACQFELA ITQQVSNPFG YPRQYVKGVE SAKRTSFFIA QDNESGYWWQ GENARLASLA SMAYLAQPHL STAIAKPLEQ WSQNALNWIV GLNPYNMCML DGHGHNNPDY LPHLGFFNAK GGVCNGITAG FDDPRDIAFN PAGQKDDMLQ NWRWGEQWIP HGAWYLLAII SQFAHFTAHG EENQ
|
| |