Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2472 |
Symbol | |
ID | 5135090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2623253 |
End bp | 2624428 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640533923 |
Product | hypothetical protein |
Protein accession | YP_001218365 |
Protein GI | 147674460 |
COG category | [S] Function unknown |
COG ID | [COG1652] Uncharacterized protein containing LysM domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATTTCTC CGAAACCCAG TCATTTTTTC GACCAAGGAA GCAAGGTCAT GGCACGATTA ACCCGTTATA TTACCTACGC TTTATTGCCT TTCACTCTGC TCAGTGGACA GGTGACCGCC GATGAGCAAA CGCCGCTGGC ACTCAAACCA AATGCACCGA CCACCTATAC CGTGGTGAAA GGTGACACCT TGTGGGATAT TTCCGCTTTG TATCTCGACA GTCCATGGCT GTGGCCAAGA CTTTGGCAAG TGAACCCAGA AATTGATAAC CCACATTTGA TCTACCCCGG TGATAAATTG ACGCTGTTTT GGCGTGATGG GCAACCCGTA CTCAGTCTAA AACCGATGCG CAAACTGAGC CCGCAAGTGC GTGTGTTGGA GAAACAAGCT GTTCCGACTG TGCAAGAAGG CTTGGTGCTG CCTTATTTAC AATCTGACCG TTTGATGGCG AAAACCGCCT TGCAAGGTAG CGTGCGTGTG ATTGGCTCCA GTGAGGGGCG TCAATATCTC ACCAAGCAAG ATCAGCTTTA CATCTCCGGT GTACACAGCG AGAAAAAGTG GGGGATTTAT CGCGAGGTAG CACAGTATCA ACGTGATGAT GAAGTCATGG TGGCGCTGCG TTTAGTGGCG GTGGGTGAAT TGGCGATGAC TGGGGGCAAC TTTAGTGGAC TAAGCTTGCT AGAGCAAAAT CAAGAGATTT TAGCCAATGA TATTGCTTTG CCAGAAGTCG ATCTTGAGGA GCGCCAACTG TCGACAACCT TCTATCCGCA GCCAGCGCCT GCAGGGAGTG AAGCTCGCAT TCTGGGTTCG CTTGAAGGGA GTCAATACGC CGGACAGAAT CAAGTGGTGG TGATTGACCA AGGCCGCAGT GATGGCGTCG CGCAAGGCAG TATGTTTGAA CTGTATCAAG CCGCAGTCCA AGTCAAAGCT AAGCAAGATT CGTCTACTTT CCTGTCTGAG CGTTCGAACA CGGACGTACA GTTACCAAGT GTAAAAGTGG GCGCTTTGAT GGTGATTCGC CCTTATGAGC GTTTTAGCTT GGCACTGATT ACTAACAGTT CGGCACCGAT CAGTGCTGAA GTGCACGCAC TGTCTCCGCA GGCAGAGCCT CAAACCGAGC AGCCAGATGT AGTGCAATTG TCTGAACAAA TCGATTTGGT TGATGATGCC AGTTAA
|
Protein sequence | MISPKPSHFF DQGSKVMARL TRYITYALLP FTLLSGQVTA DEQTPLALKP NAPTTYTVVK GDTLWDISAL YLDSPWLWPR LWQVNPEIDN PHLIYPGDKL TLFWRDGQPV LSLKPMRKLS PQVRVLEKQA VPTVQEGLVL PYLQSDRLMA KTALQGSVRV IGSSEGRQYL TKQDQLYISG VHSEKKWGIY REVAQYQRDD EVMVALRLVA VGELAMTGGN FSGLSLLEQN QEILANDIAL PEVDLEERQL STTFYPQPAP AGSEARILGS LEGSQYAGQN QVVVIDQGRS DGVAQGSMFE LYQAAVQVKA KQDSSTFLSE RSNTDVQLPS VKVGALMVIR PYERFSLALI TNSSAPISAE VHALSPQAEP QTEQPDVVQL SEQIDLVDDA S
|
| |