Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_0657 |
Symbol | |
ID | 5134073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | + |
Start bp | 722565 |
End bp | 724742 |
Gene Length | 2178 bp |
Protein Length | 725 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640530979 |
Product | hypothetical protein |
Protein accession | YP_001215496 |
Protein GI | 147672452 |
COG category | [S] Function unknown |
COG ID | [COG1289] Predicted membrane protein |
TIGRFAM ID | [TIGR01666] hypothetical membrane protein, TIGR01666 [TIGR01667] integral membrane protein, YccS/YhfK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCTGCG CTGTGCCGAA AATTAACCAA CTGCGCCTGT ATTGGGCAAA TAAAACTGTT AACTACAGTG TCTTGATCTT ATTGACTTTA CTGGGTGTGG TGATCCCCGC TTGGTATTAT CAGCTCAACA CTTGGATCAC CCCTTTGATT CTCGGGGTGA TTGCCGCGGC ACTCGCCGAT CGAGATGACC GCTTTAGTGG CCGTTTGAAA TCCATCATTC TCACTTTGAT CTGCTTTGCG ATTGCCGCTT TCTCGATTGA GATTCTGTTT CAAACCCCGT GGCTGTTTGC GCTTGGTCTA TTCACCTCAA GTTTTGGTTT TATCATGCTC GGCGCTATGG GGGCACGCTA CGCCAGCATC GCGTTTGCTT CTCTACTGGT TGCCGTGTAC ACCATGCTCG GTGCCCATCA AAGCACGAAT ATCTGGTTTC AGCCGTTACT TTTACTCAGC GGCAGCGCTT GGTACTACTT GATGTCGATG CTGTGGCATG CCTTTTGGCC AATGCAGCCC GTACAACAAA ACCTCGCTAA CGTGTTTCTG CAACTGGCCA ATTATTTAGA GGCGAAATCC ACCCTTTTCC ACCCGGTTTC CAACATGATC CCGCAGCCAC ACCGGATCAT TGAGGCCAAT CTTAACGCCG CGACAGTCAA TGCGTTAAAC CAGTGCAAGG CGGTGTTTCT GACCCGCTCC AAACGGGGGC ATGTGGACAG CGCCAGCGAT CGCTTTTTAA ACATCTATTT TCTGGCGCAA GACATTCACG AACGAGTGAG CTCCAGCCAT TACCGTTACC AAGAGCTAGC CGATCACTTT GGCCGCTCAG ACATTCTGTT CCGCTTTAAA TATTTATTGG AGACGCAAGC GAAAGCGTGC CGCGATATCG CTCAATCGAT TCGTCTTGGG CACAGTTACC AGCACGACTC TGCTTCGATT GTCGCGCTGG ATGAACTGCA GTTATCACTC AGCTATCTTC GCCAACAAGA GAGACGAGAT TGGAAAAGTC TTTTGGGTCA ATTAGGCTAT TTATTCAATA ACTTAGCGAC CGTAGAAAAA CAGCTCAATA ACGTCAGTAA CCCGGATGTG GCCAAACCCG AAGAAGGGGT ACTGGATGAC ACTGAAGCGC ACACCTTAGG CAGCATGTGG CAACGTATTC GCGCCAATCT CAATAAAGAT TCGCTGTTAT TTCGCCATGC GCTGCGCTTG TCCATCACGC TCACCTTGGG CTATGCGATC ATCCAAGGGT TCGGCATTGA GCGCGGTTAC TGGATTTTGC TCACCACCCT GTTTGTTTGC CAGCCAAACT ACGCCGCAAC CAAGCAAAAA CTCACCGCGC GAATCATAGG CACCTTGGCC GGCTTGCTGA TTGGCGTGCC GCTACTCACC TTCTTCCCAT CACAAGAGAG CCAATTGGTG TTTATTGTCT TCTCCGGTGT GATGTTTTTT GCGTTTCGCC TCAACAACTA CGGCTACGCG ACCGGTTTTA TCACCTTATT GGTGCTGTTT TGTTTCAATC AGTTAGGTGA AGGCTATGCG GTGGTATTAC CAAGATTGGC AGATACCCTC ATTGGCTGCG CCTTGGCGGT AGCGGCCGTG GTGCTCATTC TGCCAGACTG GCAATCGAAA CGGCTGCATA AAGTGATGGC GGAAGCGATT GATGCCAATA AGCAGTACTT GGCGCAGATC ATTGGTCAAT ATCGGATTGG TAAAAAAGAC AGCTTGAGTT ACCGGATTGC GCGGCGTCAT GCCCACAACC AAGATGCCGC GCTTTCTGCC GCGGTCACCA ATATGCTGGC GGAGCCGGGG CGTTATCGTG CGGCGGCCGA TGAGAGCTTT CGTTTCCTGA CCCTTAACCA CGCGCTGCTC AGTTATATCT CGGCGCTTGG AGCACATCGA ACCCGGCTGG ATGATGAAAC CGTGCATCAG TTGGTACTCG ATTCACATCG CGTGATCCAT CAGCATCTCG ACTTGCTGCA TCAACAGTTG TCCAACCACT GTGAAGAGTG TGATACCAGC GGTATTGATA GCTCAGGGCT AGAGAAACGT CTTGCTGAGT GGCGTGAGGA TGATGAAGGC TCGGCGCGCA TGGTACTGCA ACAGCTGCAC TTGATTTATC GCATGCTGCC TGAGCTGCAC ATGCTCGCCA GCAAGTTTGC TGTCAAGGTC GATTCTCAAT CCGACTGA
|
Protein sequence | MCCAVPKINQ LRLYWANKTV NYSVLILLTL LGVVIPAWYY QLNTWITPLI LGVIAAALAD RDDRFSGRLK SIILTLICFA IAAFSIEILF QTPWLFALGL FTSSFGFIML GAMGARYASI AFASLLVAVY TMLGAHQSTN IWFQPLLLLS GSAWYYLMSM LWHAFWPMQP VQQNLANVFL QLANYLEAKS TLFHPVSNMI PQPHRIIEAN LNAATVNALN QCKAVFLTRS KRGHVDSASD RFLNIYFLAQ DIHERVSSSH YRYQELADHF GRSDILFRFK YLLETQAKAC RDIAQSIRLG HSYQHDSASI VALDELQLSL SYLRQQERRD WKSLLGQLGY LFNNLATVEK QLNNVSNPDV AKPEEGVLDD TEAHTLGSMW QRIRANLNKD SLLFRHALRL SITLTLGYAI IQGFGIERGY WILLTTLFVC QPNYAATKQK LTARIIGTLA GLLIGVPLLT FFPSQESQLV FIVFSGVMFF AFRLNNYGYA TGFITLLVLF CFNQLGEGYA VVLPRLADTL IGCALAVAAV VLILPDWQSK RLHKVMAEAI DANKQYLAQI IGQYRIGKKD SLSYRIARRH AHNQDAALSA AVTNMLAEPG RYRAAADESF RFLTLNHALL SYISALGAHR TRLDDETVHQ LVLDSHRVIH QHLDLLHQQL SNHCEECDTS GIDSSGLEKR LAEWREDDEG SARMVLQQLH LIYRMLPELH MLASKFAVKV DSQSD
|
| |