Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1228 |
Symbol | |
ID | 5135872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1294188 |
End bp | 1295525 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640532686 |
Product | agglutination protein |
Protein accession | YP_001217172 |
Protein GI | 147674567 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.000902456 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCACA CTGGGGAGAA CGTTTTGAAA GGATTACCAT TAAAGGCGAT TGCGCTCGCT ATCGCATTGA GCAATCCGGT TTCTGCGCAA ACGTTGGAGC AAGCTGTATC CATCACGTTG GCATCCAACC CGGAGCTAAA AAGCGCATTT AATCAATTTA AAAGTCGTGA ATATGATGCA GAGGCATCGT CAGGCGCTTA CTTACCGAAA ATCGATCTTG ATGCAGGTAT CGGTTATGAG GCAATTAATC CTGCAGAATC TAGTGGTAAT AGAAACACAG ATTTAACTCG TAAAGATGCT ACCCTTACCC TCACTCAATT GATTTGGGAT GGGTCTGCCA CACTGAATGA TATGGATAGA ACGGCAGCGG AAGCTGAGGC GGATCGCTAT CAGTTATTAG CCGATGCCTC TAACATGGCT CTGGAAGTCG CCAAGATTTA TCTGGACGCA ACCAAAGCTT CTGAAATTCT AACGCTGTCA GAAAACAATC TGGCTATTCA TAAAGATATC TACCGCGATA TTAAAAAGCG CGCCGATTCA GGGATTGGCT CTACGGCCGA TGTGACTCAA GTTGAGGCTC GTTTAGCAAA AGCGCACAGC AACTTGGTGG CTGCACAAAA CAATCTTTTT GACATCTACA CTCAGTTTCG CCGTTTAGTG GGTCAAGAAC CTGTGAGCTT AGAGTTTCCT CGTGCCGATC AAAATGCGAT ACCGCCGACA TTAGAAAATG CTTTAAATAT GGCGCAAGAA AATCATCCAG TGATTAAGGT TGCTCAAGCG GATGTGGATG CAGCACGCTT CCAGTATAAG CAGTCTAAAG CGCCAAATTA CCCCACACTC TCTTTCGAAG CGGCTCAAAG CTGGCGTAAT GATGCAGGCG GTATTGAGGG CAGCAGTGAT GAGCTCAGTG CGATGCTCCG TTTGCGTTAC AATCTCTACA ATGGCGGCAG TGACAGTGAT CGTACAGAAA GTGCGGCTTA TCAACTCAAT CGTTCCAAGG ATCTGCGTGA AAAAACCTTC CGTACCGTTG AGGAAGGGCT TCGCTTATCT TGGAGTGCCT TGGATCTCAC TCTGCAACAG AAACAGTTTT TGGCAGATCA CGTTGATTCT GCATCCAAAA CGGTTGTTTC GTATCGTAAA CAGTATCAAA TCGGCCAGCG TACATTGCTC GATCTCCTCA ACACTGAGAA TGAGCTTTTC GAGGCACGCA AGGACTACCT CGATGCACGA TATGCTGAAC AATATGCAAA ATATCGTGTC ATGAATGCAT CTGGTAACTT GTTAGATGCC TTAAGAGTCG ATATCCCGCA GGAATGGACA GCTAAGGTGG AGTACTAA
|
Protein sequence | MAHTGENVLK GLPLKAIALA IALSNPVSAQ TLEQAVSITL ASNPELKSAF NQFKSREYDA EASSGAYLPK IDLDAGIGYE AINPAESSGN RNTDLTRKDA TLTLTQLIWD GSATLNDMDR TAAEAEADRY QLLADASNMA LEVAKIYLDA TKASEILTLS ENNLAIHKDI YRDIKKRADS GIGSTADVTQ VEARLAKAHS NLVAAQNNLF DIYTQFRRLV GQEPVSLEFP RADQNAIPPT LENALNMAQE NHPVIKVAQA DVDAARFQYK QSKAPNYPTL SFEAAQSWRN DAGGIEGSSD ELSAMLRLRY NLYNGGSDSD RTESAAYQLN RSKDLREKTF RTVEEGLRLS WSALDLTLQQ KQFLADHVDS ASKTVVSYRK QYQIGQRTLL DLLNTENELF EARKDYLDAR YAEQYAKYRV MNASGNLLDA LRVDIPQEWT AKVEY
|
| |