Gene VC0395_A1726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1726 
SymbolflaB 
ID5135069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1849386 
End bp1850516 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content50% 
IMG OID640533183 
Productflagellin 
Protein accessionYP_001217665 
Protein GI147674837 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTA ATGTAAACAC GAACGTGTCT GCCATGACCG CTCAGCGCTA TTTAAATGGT 
GCTGCTGATG GTATGCAGAA ATCGATGGAG CGTTTGTCGT CCGGCTACAA AATCAACAGT
GCCCGAGACG ATGCCGCAGG TCTGCAAATT TCTAACCGTT TGACATCGCA AAGTCGTGGT
TTGGACATGG CGGTGAAAAA CGCCAACGAT GGTATTTCCA TCGCCCAAAC TGCAGAAGGG
GCGATGAACG AAACGACCAA CATCTTACAA CGGATGCGCG ATCTTGCGTT GCAATCCTCT
AACGGCTCAA ACTCTTCTTC GGAACGCCGC GCGATTCAAG AAGAAGTGTC TGCCCTCAAT
GACGAGTTGA ACCGTATTGC AGAAACCACC TCTTTTGGTG GCAACAAACT GCTGAATGGT
TCGTTTGGTA GTAAATCGTT CCAGATTGGT GCGGATTCGG GTGAAGCGGT CATGCTTAGC
ATGGGCAGTA TGCGCTCGGA TACTCAAGCT ATGGGCGGAA AAAGCTATCG AGCTCAAGAA
GGCAAGGCCG CAGACTGGCG TGTCGGCGCA GCAACCGATT TGACCCTGAG CTATACTAAT
AAGCAGGGTG AAGCACGTGA AGTGACCATT AATGCCAAAC AAGGTGACGA CTTAGAAGAG
CTTGCGACTT ACATCAACGG TCAAACTGAA GACGTTAAAG CGTCGGTCGG TGAAGACGGT
AAGCTACAAC TGTTTGCTTC ATCACAAAAA GTCAATGGTG ATGTGACCAT TGGTGGTGGA
CTGGGTGGTG AAATCGGTTT TGATGCTGGC CGTAATGTGA CGGTGGCGGA TGTGAACGTT
TCAACCGTGG CCGGTTCGCA AGAAGCGGTA TCTATTCTGG ATGGGGCTCT GAAGGCGGTG
GATAGCCAAC GCGCTTCATT GGGTGCATTC CAGAACCGTT TCGGTCATGC GATCAGTAAC
TTGGATAACG TTAACGAAAA CGTCAACGCG TCTCGTAGCC GTATCCGTGA TACCGATTAT
GCTCGTGAAA CCACGGCGAT GACGAAGGCG CAAATATTGC AGCAGGCGAG TACCTCTGTG
TTGGCGCAAG CGAAGCAGTC ACCATCTGCA GCTCTGAGCT TATTGGGATA A
 
Protein sequence
MAINVNTNVS AMTAQRYLNG AADGMQKSME RLSSGYKINS ARDDAAGLQI SNRLTSQSRG 
LDMAVKNAND GISIAQTAEG AMNETTNILQ RMRDLALQSS NGSNSSSERR AIQEEVSALN
DELNRIAETT SFGGNKLLNG SFGSKSFQIG ADSGEAVMLS MGSMRSDTQA MGGKSYRAQE
GKAADWRVGA ATDLTLSYTN KQGEAREVTI NAKQGDDLEE LATYINGQTE DVKASVGEDG
KLQLFASSQK VNGDVTIGGG LGGEIGFDAG RNVTVADVNV STVAGSQEAV SILDGALKAV
DSQRASLGAF QNRFGHAISN LDNVNENVNA SRSRIRDTDY ARETTAMTKA QILQQASTSV
LAQAKQSPSA ALSLLG