Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1780 |
Symbol | flaA |
ID | 5135745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 1887997 |
End bp | 1889136 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640533237 |
Product | flagellin |
Protein accession | YP_001217705 |
Protein GI | 147674518 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.315393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATTA ACGTAAATAC CAACGTGTCG GCGATGACCG CACAACGTTA TCTGACCAAG GCGACGGGAG AGCTTAACAC CTCCATGGAA CGCCTCTCAT CAGGTAATCG CATTAACAGT GCAAAAGATG ACGCGGCAGG CCTGCAGATT TCAAACCGTT TAACGGCGCA ATCTCGTGGT TTGGATGTGG CAATGCGTAA CGCCAACGAT GGTATTTCGA TTGCTCAAAC CGCAGAAGGT GCGATGAATG AATCGACCAG CATTTTGCAG CGTATGCGTG ACCTCGCCTT ACAATCGGCG AACGGTACCA ACTCAGCGTC AGAGCGTCAG GCTCTGAATG AAGAGTCGGT GGCACTGCAA GATGAACTGA ACCGTATCGC TGAAACCACG TCATTTGGTG GTCGTAAGCT ACTCAATGGT TCGTTTGGTG AAGCTTCGTT CCAAATCGGT TCTAGCTCGG GTGAAGCGAT CATTATGGGA CTGACCAGTG TACGTGCTGA TGATTTCCGC ATGGGTGGCC AATCCTTTAT TGCCGAACAA CCTAAGACTA AAGAGTGGGG GGTACCACCT ACCGCTCGTG ACCTGAAGTT TGAATTCACC AAGAAAGACG GTGAGGCAGT CGTGCTTGAT ATCATTGCCA AAGATGGTGA TGACATTGAA GAGCTGGCCA CTTACATCAA CGGTCAAACG GATCTGTTCA AAGCTTCGGT TGACCAAGAA GGCAAACTGC AGATTTTTGT TGCTGAACCC AATATTGAAG GCAACTTCAA TATCTCCGGT GGTTTGGCAA CCGAACTTGG CCTCAATGGT GGCCCTGGTG TGAAAACCAC AGTTCAAGAC ATTGATATCA CCAGTGTCGG TGGTTCACAG AACGCCGTGG GTATCATCGA TGCCGCATTA AAATACGTCG ATTCGCAACG AGCTGACCTC GGTGCTAAAC AGAACCGACT CAGTCACAGC ATCAGTAACC TGTCGAATAT TCAGGAGAAC GTGGAAGCGT CGAAAAGTCG GATTAAAGAT ACGGATTTTG CGAAGGAAAC AACGCAACTT ACCAAATCTC AGATTCTGCA ACAGGCGGGG ACTTCAATTC TTGCCCAAGC GAAACAGTTG CCAAACTCTG CAATCTCGTT ATTGCAGTAG
|
Protein sequence | MTINVNTNVS AMTAQRYLTK ATGELNTSME RLSSGNRINS AKDDAAGLQI SNRLTAQSRG LDVAMRNAND GISIAQTAEG AMNESTSILQ RMRDLALQSA NGTNSASERQ ALNEESVALQ DELNRIAETT SFGGRKLLNG SFGEASFQIG SSSGEAIIMG LTSVRADDFR MGGQSFIAEQ PKTKEWGVPP TARDLKFEFT KKDGEAVVLD IIAKDGDDIE ELATYINGQT DLFKASVDQE GKLQIFVAEP NIEGNFNISG GLATELGLNG GPGVKTTVQD IDITSVGGSQ NAVGIIDAAL KYVDSQRADL GAKQNRLSHS ISNLSNIQEN VEASKSRIKD TDFAKETTQL TKSQILQQAG TSILAQAKQL PNSAISLLQ
|
| |