Gene VC0395_A2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2001 
SymbolpilC 
ID5137670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2152568 
End bp2153794 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content49% 
IMG OID640533458 
Producttype IV pilin biogenesis protein PilC 
Protein accessionYP_001217925 
Protein GI147673666 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.148255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCGA CCCAAACCTT ACCTCTGAAA AATTATCGCT GGAAAGGCAT CAACAGCAAC 
GGCAAAAAAG TTTCCGGCCA GATGCTCGCC ATCTCCGAAA TCGAGGTGCG CGATAAGCTC
AAAGATCAGC ATATTCAGAT CAAAAAACTC AAAAAAGGCA GTGTATCTCT TTTGGCACGC
CTAACCCATC GCGTGAAAAG TAAAGATATT ACGATTTTGA CTCGGCAGTT GGCGACCATG
CTCACCACGG GCGTACCCAT TGTGCAAGCC CTCAAGTTGG TGGGCGATAA TCACCGTAAA
GCTGAGATGA AATCGATTCT GGCGCAAATC ACCAAAAGCG TGGAAGCGGG CACGCCACTT
TCCAAGGCGA TGCGCACCGC CAGCGCCCAT TTTGATACCT TGTATGTCGA TTTAGTGGAA
ACCGGAGAGA TGTCCGGTAA CTTACCTGAG GTGTTTGAGC GTTTGGCCAC CTACCGCGAG
AAAAGCGAGC AACTACGCGC CAAGGTGATT AAAGCGCTCA TCTACCCCAG CATGGTTGTG
TTGGTCGCGC TCGGGGTATC TTACTTAATG CTCACCATGG TCATCCCAGA GTTTGAAAGC
ATGTTTAAAG GCTTTGGTGC TGAACTGCCT TGGTTTACGC AGCAAGTGCT GAAACTCTCA
CACTGGGTGC AGGCTTACAG TTTATGGGCA TTTATCGCCA TCGCAGCAGC CATTTTTGGC
TTGAAAGCGC TGCGTAAAAA CTCTTTCCAG ATCCGTTTAA AAACCAGCCG CTTAGGGCTG
AAATTTCCGA TTATTGGTAA TGTGCTCGCT AAGGCTTCCA TCGCCAAATT CAGCCGTACC
CTCGCCACCA GCTTTGCCGC GGGGATCCCA ATTCTCGCCA GTTTAAAAAC CACGGCCAAA
ACCTCCGGCA ATGTGCACTT TGAAACCGCG ATTAATGAGG TGTACCGCGA TACCGCTGCG
GGTATGCCGA TGTACATTGC TATGCGCAAT ACCGAAGCTT TTCCCGAAAT GGTGCTGCAA
ATGGTGATGA TCGGTGAAGA GTCTGGGCAA TTAGACGACA TGCTCAACAA GGTCGCGACC
ATCTATGAAT TTGAAGTCGA TAACACGGTC GATAACTTGG GCAAGATTCT TGAACCACTG
ATCATCGTCT TTCTTGGGAC GGTTGTGGGC GGCTTAGTGG TGGCGATGTA CTTACCTATC
TTTAACTTAA TGAGTGTATT GGGTTAG
 
Protein sequence
MKATQTLPLK NYRWKGINSN GKKVSGQMLA ISEIEVRDKL KDQHIQIKKL KKGSVSLLAR 
LTHRVKSKDI TILTRQLATM LTTGVPIVQA LKLVGDNHRK AEMKSILAQI TKSVEAGTPL
SKAMRTASAH FDTLYVDLVE TGEMSGNLPE VFERLATYRE KSEQLRAKVI KALIYPSMVV
LVALGVSYLM LTMVIPEFES MFKGFGAELP WFTQQVLKLS HWVQAYSLWA FIAIAAAIFG
LKALRKNSFQ IRLKTSRLGL KFPIIGNVLA KASIAKFSRT LATSFAAGIP ILASLKTTAK
TSGNVHFETA INEVYRDTAA GMPMYIAMRN TEAFPEMVLQ MVMIGEESGQ LDDMLNKVAT
IYEFEVDNTV DNLGKILEPL IIVFLGTVVG GLVVAMYLPI FNLMSVLG