Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_0718 |
Symbol | |
ID | 5134021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | + |
Start bp | 778058 |
End bp | 778987 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640531040 |
Product | hypothetical protein |
Protein accession | YP_001215554 |
Protein GI | 147671540 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00184612 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAGGTA AGCTACACGT GATCAAAGCG GGACCACTCA CGCTACTTCA CGACTTGGGT CGCTATGGGT TCAGTCATTT CGGGATCACC CCTTCCGGCC CTCTCGATGA ATACGCCTAC AGCTGGGCTA ACCATTTACT CGCCAATACC GTCAATTGCG CCACCTTAGA AATCACGCTC GGTCCAGCGG AATTTCTTTT ACGCAGCGAC GCGCAATTGG CGATTGCCGG TGGCGATCTC AATGCCACGC TAGACGGTCG CCCCATCGCC AATTGGAGCC GATTTTATGC CAAGCAAGGA CAAACGCTGC GCTTTGGTTT ACCACGCAAC GGGCTACGCG CCTATCTTGC GGTTCAAGGA GGGTTTACGG TCTCTCCACA ACTGGGCTCT GTATCTACCC ATGTTCGGCA AGGCTTAGGT GGACTAACGT CGCAAGGGCT GGCGCTACAA ACGGGTGATG ACTTAGCATT TTCCGCGCAG CAGATTGCCC AGAAACCGGT GCTGATGACG TTTCGCTTTC GACCTGATTA CAACTTACCT CTGCGACTGC GCGTGATTGA GAGTTACCAA TATCAAGCGT TTTCACCATC CGCGATGGAA AGCTTCTACC GCAGCGAGTT TATCGTGACC CCAAACAGCG ATCGCATGGG CTATCGCTTG CAAGGTGAAA CCATATCGCC ACCGGATCAA ACCATTCTCT CTGAGGGGAT TGCCTTGGGG GCGATTCAAG TTCCGCCGAA TGGTCAGCCA ATCATTTTGC TCAATGATCG GCAGACGATT GGCGGGTATC CCAAGCTAGG TTGTGTAGCG AGAATCGATC TGCCCCGTTT AGCGCAAGCC AAGCCGGGAC ATTCCGTACG GTTTGTCGCT GGCGATCTCG CTGGGCTTCA AGCGGTGTGG TGTCAGTGGG CGCGATTTTT CGGTTACTGA
|
Protein sequence | MLGKLHVIKA GPLTLLHDLG RYGFSHFGIT PSGPLDEYAY SWANHLLANT VNCATLEITL GPAEFLLRSD AQLAIAGGDL NATLDGRPIA NWSRFYAKQG QTLRFGLPRN GLRAYLAVQG GFTVSPQLGS VSTHVRQGLG GLTSQGLALQ TGDDLAFSAQ QIAQKPVLMT FRFRPDYNLP LRLRVIESYQ YQAFSPSAME SFYRSEFIVT PNSDRMGYRL QGETISPPDQ TILSEGIALG AIQVPPNGQP IILLNDRQTI GGYPKLGCVA RIDLPRLAQA KPGHSVRFVA GDLAGLQAVW CQWARFFGY
|
| |