Gene VC0395_A2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2000 
SymbolpilB 
ID5135107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2150837 
End bp2152525 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content53% 
IMG OID640533457 
Producttype IV pilus assembly protein PilB 
Protein accessionYP_001217924 
Protein GI147674247 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.758865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCACCA ACCTTGTTGC TATCTTGCGT CAGGCTGAGT TAATCAGCGC AACGCAAGAA 
CAAGCGGTGG TTACACAGGT TAGCGCTTCG GGGACTTCGG TGCCCGAGGC GCTGCTTGAG
TTAAGTATTT TTCACGCCCA AGAACTGACC GAACAACTGA GCCATATTTT CGGCTTGCCG
GAAACCGACC TTAGCCGCTA CGACTACGCC AACTTGTGCC AACAGCTCGG GCTGCGTGAA
CTGATTACCC GCTACGATGC CTTGCCGATT GCCAAGCAAG GCAATTTATT GCTGCTTGCG
GTCTCTGACC CGACCTTACT GCAAGCCGAA GAAGAATTTC GTTTTGCCAC AGGATTACAA
GTTGAACTGG CACTGGCCGA TCACCGCGCG CTGCAAGCCG CGATTCGCCG TTTGTATGGC
CGCTCAATTC AAGGCGCAGC CAACCAAGGG AAAGAGATCA GCCAAGATGA GCTCGCCAAT
CTGGTTAAAG TCAGTGACGA CGAGCTGCAA TCCATTGAAG ATCTCAGCCA AGATGACTCT
CCGGTTAGCC GCTTTATCAA CCAAGTGCTG CTCGATGCGG TACGTAAAGG TGCCTCGGAT
ATTCATTTTG AGCCGTATGA AAACCAGTAT CGGATCCGCC TGCGCTGCGA TGGCATCTTG
GTCGAAACTC AGCAACCGGC TAGCCATTTA AGCCGCCGTT TAGCTGCGCG GATTAAAATT
CTCTCCAAAT TAGATATTGC CGAGCGCCGC TTGCCGCAAG ACGGGCGGAT TAAACTGCGC
CTAAGCCGCG ATACCGCCAT TGATATGCGT GTTTCGACAC TTCCCACTTT ATGGGGAGAA
AAAATCGTGC TGCGTCTGCT CGATAGCAGC GCCGCCAATC TGGATATTGA TAAGCTCGGC
TATAACCCGC AGCAAAAGCA ACTCTACCTC AACGCCCTGA AAAGACCGCA AGGAATGATT
TTAATGACCG GCCCCACCGG CAGCGGCAAA ACCGTTTCGC TCTATACTGG GCTGCGCATT
CTCAACACGT CACAGATCAA TATCTCCACC GCGGAAGATC CGGTAGAAAT TAACCTCTCT
GGGATTAACC AAGTGCAAGT GCAGCCGAAA ATCGGCTTTG GCTTTGCCGA AGCGCTACGC
TCGTTTCTGC GCCAAGACCC GGATGTGGTG ATGGTCGGCG AAATCCGCGA TCTGGAAACC
GCAGAAATCG CGGTCAAAGC CGCGCAAACC GGTCACTTAG TGCTTTCCAC CCTGCACACC
AATTCGGCCG CTGAAACCGT AATTCGTTTA GCCAATATGG GGGTGGAGCC GTTTAACCTC
GCGTCCTCAC TCAGTTTAAT CATCGCCCAA CGCCTCGCGC GCCGCCTATG TAAACACTGC
AAAATCGCGG TGCGCCCTTC TGCCCTATTG CAAAGCCAAT TTGCATTTCA ACCCAATGAA
ATCTTGTATG AAGCGAATGC GGCGGGGTGT AACGAGTGTA CGGGCGGCTA TTCAGGGCGC
GTTGGGATCT ATGAAGTGAT GGCGTTTAAT ACCGAGCTGG CGGAGGCCAT TATGCAACGC
GCCAGCATTC ATCAAATTGA ACGTTTAGCC AAAGCCAATG GCATGCAAAC GTTGCAAGAG
TCCGGTCTTG AAAAGCTGCG CGAAGGCATC ACCAGCTTTG CCGAGCTGCA GCGTGTGCTC
TACTTTTAA
 
Protein sequence
MLTNLVAILR QAELISATQE QAVVTQVSAS GTSVPEALLE LSIFHAQELT EQLSHIFGLP 
ETDLSRYDYA NLCQQLGLRE LITRYDALPI AKQGNLLLLA VSDPTLLQAE EEFRFATGLQ
VELALADHRA LQAAIRRLYG RSIQGAANQG KEISQDELAN LVKVSDDELQ SIEDLSQDDS
PVSRFINQVL LDAVRKGASD IHFEPYENQY RIRLRCDGIL VETQQPASHL SRRLAARIKI
LSKLDIAERR LPQDGRIKLR LSRDTAIDMR VSTLPTLWGE KIVLRLLDSS AANLDIDKLG
YNPQQKQLYL NALKRPQGMI LMTGPTGSGK TVSLYTGLRI LNTSQINIST AEDPVEINLS
GINQVQVQPK IGFGFAEALR SFLRQDPDVV MVGEIRDLET AEIAVKAAQT GHLVLSTLHT
NSAAETVIRL ANMGVEPFNL ASSLSLIIAQ RLARRLCKHC KIAVRPSALL QSQFAFQPNE
ILYEANAAGC NECTGGYSGR VGIYEVMAFN TELAEAIMQR ASIHQIERLA KANGMQTLQE
SGLEKLREGI TSFAELQRVL YF