Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1726 |
Symbol | flaB |
ID | 5135069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 1849386 |
End bp | 1850516 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640533183 |
Product | flagellin |
Protein accession | YP_001217665 |
Protein GI | 147674837 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATTA ATGTAAACAC GAACGTGTCT GCCATGACCG CTCAGCGCTA TTTAAATGGT GCTGCTGATG GTATGCAGAA ATCGATGGAG CGTTTGTCGT CCGGCTACAA AATCAACAGT GCCCGAGACG ATGCCGCAGG TCTGCAAATT TCTAACCGTT TGACATCGCA AAGTCGTGGT TTGGACATGG CGGTGAAAAA CGCCAACGAT GGTATTTCCA TCGCCCAAAC TGCAGAAGGG GCGATGAACG AAACGACCAA CATCTTACAA CGGATGCGCG ATCTTGCGTT GCAATCCTCT AACGGCTCAA ACTCTTCTTC GGAACGCCGC GCGATTCAAG AAGAAGTGTC TGCCCTCAAT GACGAGTTGA ACCGTATTGC AGAAACCACC TCTTTTGGTG GCAACAAACT GCTGAATGGT TCGTTTGGTA GTAAATCGTT CCAGATTGGT GCGGATTCGG GTGAAGCGGT CATGCTTAGC ATGGGCAGTA TGCGCTCGGA TACTCAAGCT ATGGGCGGAA AAAGCTATCG AGCTCAAGAA GGCAAGGCCG CAGACTGGCG TGTCGGCGCA GCAACCGATT TGACCCTGAG CTATACTAAT AAGCAGGGTG AAGCACGTGA AGTGACCATT AATGCCAAAC AAGGTGACGA CTTAGAAGAG CTTGCGACTT ACATCAACGG TCAAACTGAA GACGTTAAAG CGTCGGTCGG TGAAGACGGT AAGCTACAAC TGTTTGCTTC ATCACAAAAA GTCAATGGTG ATGTGACCAT TGGTGGTGGA CTGGGTGGTG AAATCGGTTT TGATGCTGGC CGTAATGTGA CGGTGGCGGA TGTGAACGTT TCAACCGTGG CCGGTTCGCA AGAAGCGGTA TCTATTCTGG ATGGGGCTCT GAAGGCGGTG GATAGCCAAC GCGCTTCATT GGGTGCATTC CAGAACCGTT TCGGTCATGC GATCAGTAAC TTGGATAACG TTAACGAAAA CGTCAACGCG TCTCGTAGCC GTATCCGTGA TACCGATTAT GCTCGTGAAA CCACGGCGAT GACGAAGGCG CAAATATTGC AGCAGGCGAG TACCTCTGTG TTGGCGCAAG CGAAGCAGTC ACCATCTGCA GCTCTGAGCT TATTGGGATA A
|
Protein sequence | MAINVNTNVS AMTAQRYLNG AADGMQKSME RLSSGYKINS ARDDAAGLQI SNRLTSQSRG LDMAVKNAND GISIAQTAEG AMNETTNILQ RMRDLALQSS NGSNSSSERR AIQEEVSALN DELNRIAETT SFGGNKLLNG SFGSKSFQIG ADSGEAVMLS MGSMRSDTQA MGGKSYRAQE GKAADWRVGA ATDLTLSYTN KQGEAREVTI NAKQGDDLEE LATYINGQTE DVKASVGEDG KLQLFASSQK VNGDVTIGGG LGGEIGFDAG RNVTVADVNV STVAGSQEAV SILDGALKAV DSQRASLGAF QNRFGHAISN LDNVNENVNA SRSRIRDTDY ARETTAMTKA QILQQASTSV LAQAKQSPSA ALSLLG
|
| |