Gene VC0395_A1727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1727 
SymbolflaD 
ID5137844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1850752 
End bp1851885 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content49% 
IMG OID640533184 
Productflagellin 
Protein accessionYP_001217666 
Protein GI147675484 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTGA ATGTAAATAC CAACGTAGCA GCAATGACAG CTCAACGTTA TTTGACTGGT 
GCAACCAATG CACAACAAAC TTCAATGGAG CGTCTATCTT CAGGCTTTAA AATCAATAGT
GCTAAAGATG ATGCTGCCGG CCTACAAATC TCTAACCGCT TGAACGTACA AAGCCGCGGT
CTGGATGTGG CAGTACGCAA CGCGAATGAT GGTATTTCAA TTGCTCAAAC CGCAGAAGGC
GCGATGAATG AAACTACCAA CATTCTGCAA CGTATGCGTG ACTTGTCACT GCAATCTGCG
AACGGCTCGA ACTCCAAATC TGAGCGTGTG GCAATCCAAG AAGAGATCAC CGCACTGAAT
GATGAGCTGA ACCGTATTGC AGAAACCACG TCATTCGGTG GTAACAAGTT GCTCAACGGT
ACCTTCTCAA CCAAGTCGTT CCAAATCGGT GCTGACAACG GTGAGGCGGT CATGCTGACC
TTGAAAGACA TGCGCAGTGA TAACCGCATG ATGGGTGGTA CCAGCTATGT CGCGGCAGAA
GGCAAAGACA AAGACTGGAA AGTACAAGCG GGCGCGAACG ACATCACTTT CACGCTGAAA
GACATCGACG GCAATGACCA AACCATTACC GTGAACGCTA AAGAAGGCGA TGATATCGAA
GAAGTGGCGA CTTACATCAA CGGTCAAACC GACATGGTGA AAGCGTCTGT CAACGAGAAA
GGTCAGCTAC AAATCTTTGC TGGTAACAAC AAAGTCACCG GTGATGTAGC CTTCTCTGGT
GGTCTAGCGG GTGCTCTGAA CATGCAAGCG GGTACAGCAG AAACCGTTGA CACTATCGAT
GTGACTTCAG TTGGTGGCGC GCAACAATCG GTTGCAGTTA TCGACTCTGC GCTGAAGTAT
GTAGATAGTC ACCGTGCTGA ACTGGGTGCG TTCCAGAACC GTTTCAACCA TGCTATCAGC
AACTTGGATA ACATTAACGA AAACGTGAAT GCTTCTAAGA GCCGTATCAA AGACACCGAT
TTCGCGAAAG AGACTACAGC GCTCACCAAA TCGCAAATCC TGTCTCAAGC ATCAAGCTCT
GTGCTGGCTC AAGCCAAACA AGCGCCAAAC GCCGCACTCA GCCTGTTGGG TTAA
 
Protein sequence
MAVNVNTNVA AMTAQRYLTG ATNAQQTSME RLSSGFKINS AKDDAAGLQI SNRLNVQSRG 
LDVAVRNAND GISIAQTAEG AMNETTNILQ RMRDLSLQSA NGSNSKSERV AIQEEITALN
DELNRIAETT SFGGNKLLNG TFSTKSFQIG ADNGEAVMLT LKDMRSDNRM MGGTSYVAAE
GKDKDWKVQA GANDITFTLK DIDGNDQTIT VNAKEGDDIE EVATYINGQT DMVKASVNEK
GQLQIFAGNN KVTGDVAFSG GLAGALNMQA GTAETVDTID VTSVGGAQQS VAVIDSALKY
VDSHRAELGA FQNRFNHAIS NLDNINENVN ASKSRIKDTD FAKETTALTK SQILSQASSS
VLAQAKQAPN AALSLLG