Gene VEA_004161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVEA_004161 
Symbol 
ID8557900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio sp. Ex25 
KingdomBacteria 
Replicon accessionNC_013456 
Strand
Start bp2832149 
End bp2833282 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content47% 
IMG OID646407261 
Productflagellin protein FlaD 
Protein accessionYP_003286784 
Protein GI262394930 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0793606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTGA ATGTAAACAC TAACGTATCA GCGATGACCG CTCAGCGTTA CCTAAACAAC 
GCAAACTCAG CTCAACAAAC TTCTATGGAG CGTTTGTCTT CGGGGTTCAA AATCAACAGC
GCAAAAGATG ACGCTGCGGG TCTTCAAATT TCGAACCGTT TGAACGTACA AAGCCGTGGT
TTGGATGTGG CGGTTCGCAA CGCGAACGAC GGTATCTCAA TTGCACAAAC CGCTGAAGGT
GCAATGAACG AGACCACTAA CATTCTTCAA CGTATGCGTG ACTTGTCTCT ACAATCAGCA
AACGGCTCAA ACTCGAAATC AGAGCGTGTG GCGATTCAAG AAGAAGTGAC CGCGCTAAAT
GACGAACTAA ACCGTATCGC AGAAACCACG TCTTTCGGTG GTAACAAGCT ACTAAACGGC
ACACACGGTG CGAAATCGTT CCAAATCGGT GCGGATAACG GTGAAGCAGT GATGCTTGAG
CTAAAAGACA TGCGTTCAGA CAACAAAATG ATGGGCGGTG TGAGCTACCA AGCTGAAAGT
GGTAAAGGTA AAGACTGGAA CGTTGCTGAA GGTAAGAACG ACCTAAAAAT CAACCTAACG
GACAGCTACG GTCAAGAGCA AGAAATCAAC ATCAGCGCAA AAGCGGGTGA CGATATCGAA
GAGCTTGCGA CTTACATTAA CGGTCAAACT GACCTAGTGA AAGCATCAGT AGACCAAGAT
GGTAAACTGC AAATCTTTGC TGGCAACAAC AAAGTCGAAG GCGAAGTCGA GTTTTCAGGC
GGCCTATCTG GCGAGCTAGG TTTGGGCGAA GGTAAAAAAG TGACGGTAGA TACTATTGAC
GTAACCTCAG TTGGTGGCGC ACAAGAATCT GTGGCTATCA TCGATGCGGC ACTGAAATAC
GTAGACAGCC ACCGCGCAGA GCTGGGTGCA TTCCAGAACC GTTTCAACCA TGCAATCAGC
AACTTGGACA ACATTAACGA GAACGTGAAC GCGTCGAAGA GCCGTATCAA AGATACTGAC
TTCGCGAAAG AAACGACTCA AATGACCAAG TCACAAATTC TATCGCAAGC GTCAAGCTCA
ATCCTTGCGC AAGCGAAACA AGCGCCAAAC TCAGCGCTAA GTCTGCTAGG TTAA
 
Protein sequence
MAVNVNTNVS AMTAQRYLNN ANSAQQTSME RLSSGFKINS AKDDAAGLQI SNRLNVQSRG 
LDVAVRNAND GISIAQTAEG AMNETTNILQ RMRDLSLQSA NGSNSKSERV AIQEEVTALN
DELNRIAETT SFGGNKLLNG THGAKSFQIG ADNGEAVMLE LKDMRSDNKM MGGVSYQAES
GKGKDWNVAE GKNDLKINLT DSYGQEQEIN ISAKAGDDIE ELATYINGQT DLVKASVDQD
GKLQIFAGNN KVEGEVEFSG GLSGELGLGE GKKVTVDTID VTSVGGAQES VAIIDAALKY
VDSHRAELGA FQNRFNHAIS NLDNINENVN ASKSRIKDTD FAKETTQMTK SQILSQASSS
ILAQAKQAPN SALSLLG