Gene VIBHAR_01301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_01301 
Symbol 
ID5553172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009783 
Strand
Start bp1308150 
End bp1309283 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content47% 
IMG OID640906793 
Productflagellin 
Protein accessionYP_001444505 
Protein GI156973598 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTGA ATGTAATCAC AAACGTATCA GCGATGACCG CTCAGCGTTA CTTAAACAAC 
GCAAACTCAG CACAACAAAC TTCTATGGAG CGTTTGTCTT CAGGTTTCAA AATCAACAGC
GCGAAAGACG ACGCTGCGGG TCTTCAAATC TCGAACCGCT TGAACGTACA AAGCCGTGGC
CTGGATGTGG CTGTTCGTAA CGCGAACGAC GGTATCTCTA TTGCACAAAC TGCTGAAGGT
GCAATGAACG AGACCACTAA CATCCTACAA CGTATGCGTG ACCTGTCTCT TCAATCAGCA
AACGGTTCTA ACTCAAAAGC AGAACGTGTT GCGATTCAAG AAGAAGTAAC AGCACTGAAC
GATGAACTAA ACCGTATCGC TGAAACGACT TCTTTCGGTG GTAACAAGCT TCTAAACGGT
ACTTACGGTA CTCAGTCTTT CCAAATCGGT GCGGACAACG GTGAAGCAGT AATGCTTAAC
CTAAAAGACA TGCGCTCAGA CAATAAAATG ATGGGCGGCG TGAGCTACCA AGCTGACAAC
GGTAAAGACA AAAACTGGAA CGTTGAAGCG GGCAAAAACG ACCTGAAAAT CAGCCTGACT
GACAGCTTTG GTCAAGAGCA AGAAATCAAC ATCAGCGCGA AAGCGGGTGA CGATATCGAA
GAGCTAGCGA CGTACATCAA TGGTCAAACT GACCTAGTAA AAGCGTCAGT AGACCAAGAC
GGTAAACTGC AAGTTTTCGC TGGTAACAAC AAAGTTGAAG GTGACGTTGA ATTCTCTGGC
GGCCTATCTG GTGAGCTAGG TCTAAACGAC GGCAAGAAAG TAACGGTTGA TACTATCGAT
GTAACATCAG TAGGTGGCGC ACAAGAATCT GTGGCTATCA TTGATGCGGC TTTGAAATAC
GTAGACAGCC ACCGTGCAGA GTTGGGTGCT TTCCAAAACC GTTTCAACCA CGCAATCAGC
AACTTGGATA ACATCAACGA GAACGTGAAC GCGTCGAAGA GCCGTATCAA AGATACTGAC
TTCGCGAAAG AAACGACTCA AATGACCAAA TCACAAATTC TATCGCAAGC GTCAAGCTCT
ATCCTTGCGC AAGCGAAACA AGCGCCAAAC TCGGCGCTAA GCCTACTAGG TTAA
 
Protein sequence
MAVNVITNVS AMTAQRYLNN ANSAQQTSME RLSSGFKINS AKDDAAGLQI SNRLNVQSRG 
LDVAVRNAND GISIAQTAEG AMNETTNILQ RMRDLSLQSA NGSNSKAERV AIQEEVTALN
DELNRIAETT SFGGNKLLNG TYGTQSFQIG ADNGEAVMLN LKDMRSDNKM MGGVSYQADN
GKDKNWNVEA GKNDLKISLT DSFGQEQEIN ISAKAGDDIE ELATYINGQT DLVKASVDQD
GKLQVFAGNN KVEGDVEFSG GLSGELGLND GKKVTVDTID VTSVGGAQES VAIIDAALKY
VDSHRAELGA FQNRFNHAIS NLDNINENVN ASKSRIKDTD FAKETTQMTK SQILSQASSS
ILAQAKQAPN SALSLLG