Gene VC0395_A1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1228 
Symbol 
ID5135872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1294188 
End bp1295525 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content46% 
IMG OID640532686 
Productagglutination protein 
Protein accessionYP_001217172 
Protein GI147674567 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000902456 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCACA CTGGGGAGAA CGTTTTGAAA GGATTACCAT TAAAGGCGAT TGCGCTCGCT 
ATCGCATTGA GCAATCCGGT TTCTGCGCAA ACGTTGGAGC AAGCTGTATC CATCACGTTG
GCATCCAACC CGGAGCTAAA AAGCGCATTT AATCAATTTA AAAGTCGTGA ATATGATGCA
GAGGCATCGT CAGGCGCTTA CTTACCGAAA ATCGATCTTG ATGCAGGTAT CGGTTATGAG
GCAATTAATC CTGCAGAATC TAGTGGTAAT AGAAACACAG ATTTAACTCG TAAAGATGCT
ACCCTTACCC TCACTCAATT GATTTGGGAT GGGTCTGCCA CACTGAATGA TATGGATAGA
ACGGCAGCGG AAGCTGAGGC GGATCGCTAT CAGTTATTAG CCGATGCCTC TAACATGGCT
CTGGAAGTCG CCAAGATTTA TCTGGACGCA ACCAAAGCTT CTGAAATTCT AACGCTGTCA
GAAAACAATC TGGCTATTCA TAAAGATATC TACCGCGATA TTAAAAAGCG CGCCGATTCA
GGGATTGGCT CTACGGCCGA TGTGACTCAA GTTGAGGCTC GTTTAGCAAA AGCGCACAGC
AACTTGGTGG CTGCACAAAA CAATCTTTTT GACATCTACA CTCAGTTTCG CCGTTTAGTG
GGTCAAGAAC CTGTGAGCTT AGAGTTTCCT CGTGCCGATC AAAATGCGAT ACCGCCGACA
TTAGAAAATG CTTTAAATAT GGCGCAAGAA AATCATCCAG TGATTAAGGT TGCTCAAGCG
GATGTGGATG CAGCACGCTT CCAGTATAAG CAGTCTAAAG CGCCAAATTA CCCCACACTC
TCTTTCGAAG CGGCTCAAAG CTGGCGTAAT GATGCAGGCG GTATTGAGGG CAGCAGTGAT
GAGCTCAGTG CGATGCTCCG TTTGCGTTAC AATCTCTACA ATGGCGGCAG TGACAGTGAT
CGTACAGAAA GTGCGGCTTA TCAACTCAAT CGTTCCAAGG ATCTGCGTGA AAAAACCTTC
CGTACCGTTG AGGAAGGGCT TCGCTTATCT TGGAGTGCCT TGGATCTCAC TCTGCAACAG
AAACAGTTTT TGGCAGATCA CGTTGATTCT GCATCCAAAA CGGTTGTTTC GTATCGTAAA
CAGTATCAAA TCGGCCAGCG TACATTGCTC GATCTCCTCA ACACTGAGAA TGAGCTTTTC
GAGGCACGCA AGGACTACCT CGATGCACGA TATGCTGAAC AATATGCAAA ATATCGTGTC
ATGAATGCAT CTGGTAACTT GTTAGATGCC TTAAGAGTCG ATATCCCGCA GGAATGGACA
GCTAAGGTGG AGTACTAA
 
Protein sequence
MAHTGENVLK GLPLKAIALA IALSNPVSAQ TLEQAVSITL ASNPELKSAF NQFKSREYDA 
EASSGAYLPK IDLDAGIGYE AINPAESSGN RNTDLTRKDA TLTLTQLIWD GSATLNDMDR
TAAEAEADRY QLLADASNMA LEVAKIYLDA TKASEILTLS ENNLAIHKDI YRDIKKRADS
GIGSTADVTQ VEARLAKAHS NLVAAQNNLF DIYTQFRRLV GQEPVSLEFP RADQNAIPPT
LENALNMAQE NHPVIKVAQA DVDAARFQYK QSKAPNYPTL SFEAAQSWRN DAGGIEGSSD
ELSAMLRLRY NLYNGGSDSD RTESAAYQLN RSKDLREKTF RTVEEGLRLS WSALDLTLQQ
KQFLADHVDS ASKTVVSYRK QYQIGQRTLL DLLNTENELF EARKDYLDAR YAEQYAKYRV
MNASGNLLDA LRVDIPQEWT AKVEY