Gene VC0395_A1728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1728 
SymbolflaE 
ID5136184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1852175 
End bp1853311 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content50% 
IMG OID640533185 
Productflagellin 
Protein accessionYP_001217667 
Protein GI147675318 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATGA CGGTAAATAC CAATGTGTCT GCGCTGGTAG CACAGCGACA TCTTAATTCT 
GCGTCCGAGA TGCTCAATCA GTCTCTGGAG CGGCTCTCTT CTGGCAATCG AATCAACAGT
GCCAAAGATG ATGCGGCAGG GCTGCAGATC TCCAATCGTT TGGAAACGCA AATGCGTGGC
CTCGGCATTG CTGTGCGCAA TGCCAACGAT GGGATTTCGA TCATGCAGAC GGCTGAAGGG
GCAATGCAGG AAACCACTCA GCTATTGCAA CGCATGCGCG ACCTCTCTTT GCAATCGGCC
AACGGTTCAA ACAGTGCAGC CGAAAGAGTC GCATTACAAG AGGAAATGGC TGCTTTAAAC
GATGAATTGA ATCGAATCGC TGAAACCACC TCTTTTGCAG GGCGCAAGCT GCTCAATGGC
CAATTTATGA AAGCCAGTTT CCAAATTGGT GCCAGCAGTG GTGAAGCCGT ACAGCTTTCA
CTGCGCAATA TGCGATCAGA CAGTTTGGAG ATGGGCGGGT TTAGCTATGT TGCGGCTGCG
CTAGCCGATA AACAGTGGCA AGTTACAAAA GGTAAACAAC AGCTCAATAT CAGCTACGTC
AATGCGCAAG GGGAGAATGA GAACATTCAG ATCCAAGCCA AAGAGGGAGA CGATATTGAA
GAGTTGGCGA CTTACATCAA TGGCAAAACC GATAAAGTCT CTGCGTCCGT GAATGAAAAG
GGACAACTCC AGTTGTACAT CGCGGGGAAA GAGACGTCAG GCACCTTGTC GTTCAGTGGC
AGTTTAGCCA ACGAATTACA GATGAACTTA TTGGGTTATG AAGCGGTAGA TAATCTTGAT
ATCAGCAGTG CTGGCGGAGC GCAGCGCGCC GTCTCGGTGA TTGATACGGC ACTCAAGTAT
GTCGATGGGC ATCGCTCAGA GCTAGGGGCG ATGCAAAATC GTTTCCAACA CGCGATCAGT
AACCTCGATA ACGTGCATGA AAACCTAGCG GCCTCGAACA GCCGGATTAA AGATGCGGAT
TACGCCAAAG AAACCACGCA AATGATTAAG CAGCAAATTT TGCAGCAAGT CAGCACTTCT
GTGCTCGCTC AAGCGAAACG CCAGCCGAAG TTTGTGCTGT TTTTGCTGCG TAATTAA
 
Protein sequence
MAMTVNTNVS ALVAQRHLNS ASEMLNQSLE RLSSGNRINS AKDDAAGLQI SNRLETQMRG 
LGIAVRNAND GISIMQTAEG AMQETTQLLQ RMRDLSLQSA NGSNSAAERV ALQEEMAALN
DELNRIAETT SFAGRKLLNG QFMKASFQIG ASSGEAVQLS LRNMRSDSLE MGGFSYVAAA
LADKQWQVTK GKQQLNISYV NAQGENENIQ IQAKEGDDIE ELATYINGKT DKVSASVNEK
GQLQLYIAGK ETSGTLSFSG SLANELQMNL LGYEAVDNLD ISSAGGAQRA VSVIDTALKY
VDGHRSELGA MQNRFQHAIS NLDNVHENLA ASNSRIKDAD YAKETTQMIK QQILQQVSTS
VLAQAKRQPK FVLFLLRN