Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1269 |
Symbol | |
ID | 5137824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1347884 |
End bp | 1349032 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640532727 |
Product | putative ABC transporter solute-binding protein |
Protein accession | YP_001217213 |
Protein GI | 147675331 |
COG category | [R] General function prediction only |
COG ID | [COG4134] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0000820878 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TGCTGACTTT TCTCACGACC TGCGCCCTCT CATTTGGCGT TTCTGCGCAA TCATGGGAAG AGACTACCGA GAAAGCCCGC GGACAAACGG TTTATTTCCA TGCTTGGGGT GGCAGCCAAG AGATTAATAG CTACTTGCGC TGGTCTGCTG ACCTACTCAA ATCCCGCTAT GGCGTCACCT TGCAGCATGT GAAAGTAACG GATATCGCCG AAACGACTAC CCGTTTGATC GCAGAGAAAG CGGCAGGAAA AAATGAGCAA GGCAGTGTCG ATTTGGTTTG GATTAACGGT GAAAACTTCA AATCAATGAA AAACAACCAG TTACTCTACG GGCCATTTGT CGAAGATTTA CCTAACTGGC CGAAAGTCGA CAAAAGCTTA CCCGTTGACA GCGATTTTTC CGAGCCAACT GAAGGCTTAG AAGCGCCTTG GGGCGTCGGA CAATTGGTGT TTATTCATGA TAAGCAGCAA GTGAACAATC CTCCGGCTTC ATTTGCCGAG CTGCTCAGCT ATGCGCAGGC TTTTCCAAAC CGCATTAGCT ACCCACAGCC CCCTGAATTT CATGGTACCA GCTTCTTGAA AGCAGCCCTG ATCGAACTGA CTCACTATGC GCCAGAACTC AATCAAGCCG TAGAGCCTGC CACATTTGAG CAGCTCACCC AACCACTATG GCAATACTTA GATGAATTGC ATGCTGTGGC TTGGCGCAAA GGCAAACAGT TTCCCGCTGG CTCTGCCCAG ATGATGCAAC TGCTGGATGA TGGTCAACTA GACCTCGCGA TCACCTTTAA CCCGAATGCG GTATTTTCAG CGCAAGCTAC TGGCAAGCTC GCACCGACAG CGCAAAGCTA TGCGATGAAA AAGGGCGCTT TAACCAATAT CCACTTCTTA GCAATTCCTT GGAACGCTAA CGCAAAAGAA GGGGCGCTGG TGGCGATTAA TTTCCTCCTG AGTGACGAAG CACAATCTCG TAAAGGGGAT TTGTCTATTT GGGGCGACCC TTCCATATTG AAACCTCAGT TTCTGACAGG CAGTGCCAAA AATACCATCC TATTCCCTGC GATTGCTGAG CCGCACCCAA GTTGGCAAAG CGCGCTCGAA GCAGAATGGC AAAAGCGCTA CGGTAATAGC TTAAAGTAA
|
Protein sequence | MKKMLTFLTT CALSFGVSAQ SWEETTEKAR GQTVYFHAWG GSQEINSYLR WSADLLKSRY GVTLQHVKVT DIAETTTRLI AEKAAGKNEQ GSVDLVWING ENFKSMKNNQ LLYGPFVEDL PNWPKVDKSL PVDSDFSEPT EGLEAPWGVG QLVFIHDKQQ VNNPPASFAE LLSYAQAFPN RISYPQPPEF HGTSFLKAAL IELTHYAPEL NQAVEPATFE QLTQPLWQYL DELHAVAWRK GKQFPAGSAQ MMQLLDDGQL DLAITFNPNA VFSAQATGKL APTAQSYAMK KGALTNIHFL AIPWNANAKE GALVAINFLL SDEAQSRKGD LSIWGDPSIL KPQFLTGSAK NTILFPAIAE PHPSWQSALE AEWQKRYGNS LK
|
| |