Gene VC0395_A1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1789 
SymbolflgE 
ID5137908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1897469 
End bp1898773 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content47% 
IMG OID640533246 
Productflagellar hook protein FlgE 
Protein accessionYP_001217714 
Protein GI147674037 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATATG TCTCTCTTAG TGGTTTATCC GCCGCGCAGA TGGATTTAAA TACCACCAGT 
AACAACATTG CGAACGCCAA TACGTTTGGT TTTAAAGAGT CGCGTGCCGA ATTTGGTGAT
GTGTATTCCA CCTCACTCTT TACTAACGCG AAGACGACAC CAGGGCAAGG TGTGCAAGCG
GCCAAAGTGG CACAGCAGTT CCACGAAGGT TCTAGCATCT ATACCAATAA CCCACTGGAT
TTGCGTATCG CGGGTACGGG CTTCTTCGCT GTCGCGAAAG ATCGCTTAGT TCCACAGCAA
AATGAGCTGA CTCGTAACGG CGCATTCCAC TTGGATAAAA ATAGCTTCAT GGTCACAGCG
AATGATGAGT TCCTGCTGGG TTATGAAGTA AACCCTGATA CGGGTGATGT CTTATCTTAT
GAGCCTAAGC CAATTAATAT TCCGCCGCAG TTCGGTAAAC CGAAACAGAC CGCGAATATT
GACTTAGGAG CAAACTTGCC TGCTAACGGT GATCTCAAAG ACCCTGCGCT GTTTGATATT
ACCGATCCTG AGACTTATAA CCGTACGACC TCATCGACCA TTTACGATTC TATGGGGCAA
CCTTATAAGT TAACGACGTA TTACTTGAAA GATATGAATC AAGCTAATAC ATGGCAGACC
TACTACACCG TCACTGACAA AACCGGTGAA AAGCCGATTA ACGTTGTCGG TGGTGATGCG
GCAAGCCCAA CAGGCCACGT GGGACATACC ATGCGTTTTA ACAACGATGG TACCTTGTCA
AGCTTAAACA ACGGTCAACC AATTGTGACT GAGCCACTGG GTGGCGGTGC CAATCCAGTC
GATCTGAACG GTGCGGATGT TAACCAAACC TTGTCGTTTA GCTTGGATTC TGCCACGCAG
TTTGCCGCAC CTTTTGAGTT GACCAAGTTT GATCAAGATG GTGCGACGAC AGGCTTCCTG
ACGAAGATCG ACTTTGATGA AAATGGTAGC GTACTGGCTA CTTACTCAAA CGGCATTAAC
ACCACTCTAG GCCGTGTTGC GCTGGTGCGT GTTGCGAACG AGCAAGGACT GGACAAAAAA
GGTGGTACTC AGTGGGATGC TACTCAGTTC TCGGGCGCGA AAATCTGGGG TGAATCGAAT
AAAGGTTCGT TTGGGTCGAT CAGTAATGGT TCATTAGAGC AGTCGAACAT CGATATGACG
CAAGAGCTGG TGGATTTGAT TTCCGCTCAG CGTAACTTCC AAGCCAACTC ACGTGCTTTA
GAAGTACACA ACGGTCTGCA ACAAAATATC CTGCAGATTC GTTAA
 
Protein sequence
MSYVSLSGLS AAQMDLNTTS NNIANANTFG FKESRAEFGD VYSTSLFTNA KTTPGQGVQA 
AKVAQQFHEG SSIYTNNPLD LRIAGTGFFA VAKDRLVPQQ NELTRNGAFH LDKNSFMVTA
NDEFLLGYEV NPDTGDVLSY EPKPINIPPQ FGKPKQTANI DLGANLPANG DLKDPALFDI
TDPETYNRTT SSTIYDSMGQ PYKLTTYYLK DMNQANTWQT YYTVTDKTGE KPINVVGGDA
ASPTGHVGHT MRFNNDGTLS SLNNGQPIVT EPLGGGANPV DLNGADVNQT LSFSLDSATQ
FAAPFELTKF DQDGATTGFL TKIDFDENGS VLATYSNGIN TTLGRVALVR VANEQGLDKK
GGTQWDATQF SGAKIWGESN KGSFGSISNG SLEQSNIDMT QELVDLISAQ RNFQANSRAL
EVHNGLQQNI LQIR