Gene VIBHAR_03171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_03171 
Symbol 
ID5553942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009783 
Strand
Start bp3200524 
End bp3201654 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content45% 
IMG OID640908652 
Productflagellin 
Protein accessionYP_001446347 
Protein GI156975440 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATTA ACGTTAATAC TAACGTTTCT GCGATGACCG CACAGCGTTA CCTAAACCAA 
GCGGCTGAAG GTCAACAAAA ATCAATGGAG CGTTTGTCTT CGGGTTATAA AATCAATAGC
GCGAAAGATG ATGCTGCAGG TCTACAGATT TCTAACCGTT TGAATGCACA GAGCCGTGGT
CTAGACATGG CTGTGAAAAA CGCGAACGAC GGTATTTCTA TTGCACAGGT TGCTGAAGGT
GCAATGAATG AATCTACCAA CATCCTACAA CGTATGCGTG ACCTATCGCT TCAATCTGCG
AACGGTTCAA ACTCGCGTTC TGAGCGTGTA GCGATTCAAG AAGAAGTAAC AGCACTTAAC
GACGAACTAA ACCGTATCGC TGAAACAACT TCATTTGGTG GTAACAAGCT TCTTAACGGT
ACTTACGGTA CTCAATCTTT CCAAATCGGT GCGGACTCTG GTGAAGCAGT AATGCTTTCT
ATGGGTAACT TACGTTCTGA TACTTCTGCA ATGGGCGGTA AGAGCTACTC AGCAGAAGAT
GGCAAAGATG CATCTTGGGC AGTAGGTGAT AACACTGAAC TTAAGATGAC TTACACCAAC
AAGCAAGGTG AAGAGAAAGA GCTGACTATC AACGCGAAAC AAGGCGATGA TATCGAGCAG
CTAGCAACTT ACATCAACGG TCAAAGCGAA GATGTAAAAG CATCTGTTGG TGAAGATGGC
AAGCTACAAG TATTCGCTGC TACTCAAAAA GTAACAGGCG ATGTAGAGTT CTCTGGCAAC
CTAGCGGGTG AAATTGGCTT CGGCGATGCA AAAGACGTAA CGGTTAAAGA CATCGACGTA
ACCACAGTTG CAGGCTCTCA AGAAGCAGTA GCGATCATTG ACGGCGCACT AAAATCAGTA
GACAGCCAAC GTGCGTCTCT TGGTGCATTC CAAAACCGTT TCAACCACGC TATCAGCAAC
CTAGACAACA TTAACGAGAA TGTTAACGCG TCTAACAGCC GTATTAAAGA TACTGACTAC
GCGAAGGAAA CGACAGCAAT GACTAAGTCG CAAATCCTTC AACAAGCAAG TACTTCAATC
CTGGCACAAG CGAAGCAGTC ACCATCTGCA GCGCTAAGCT TGTTGGGCTA A
 
Protein sequence
MAINVNTNVS AMTAQRYLNQ AAEGQQKSME RLSSGYKINS AKDDAAGLQI SNRLNAQSRG 
LDMAVKNAND GISIAQVAEG AMNESTNILQ RMRDLSLQSA NGSNSRSERV AIQEEVTALN
DELNRIAETT SFGGNKLLNG TYGTQSFQIG ADSGEAVMLS MGNLRSDTSA MGGKSYSAED
GKDASWAVGD NTELKMTYTN KQGEEKELTI NAKQGDDIEQ LATYINGQSE DVKASVGEDG
KLQVFAATQK VTGDVEFSGN LAGEIGFGDA KDVTVKDIDV TTVAGSQEAV AIIDGALKSV
DSQRASLGAF QNRFNHAISN LDNINENVNA SNSRIKDTDY AKETTAMTKS QILQQASTSI
LAQAKQSPSA ALSLLG