Gene BMASAVP1_A3449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A3449 
SymbolfliD-1 
ID4679384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp3413385 
End bp3414905 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content67% 
IMG OID639847702 
Productflagellar hook-associated protein 2 
Protein accessionYP_994727 
Protein GI121599739 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0369122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACGC CCGTCACCAG CACGACGCAG CAGCAAACCA ACTCGGCGCT GCAGCAAGCA 
GCGCAGTCGA TCATCAGCGG CTCGACAGGC AATTCGTCGA TGGACGTCAA CTCGCTCGTC
ACCGCGCTCG TCAACGCGAA GACGGCCGGC CAGAGCGCGG CGCTGTCGAC TTCGATCGCG
ACCGACCAGA CGACGCTTTC CGCGCTCGGC ACGCTGAAAG CCGCGCTCAC CGCGCTGCAA
GCGGGGATCG GCTCGCTCAG CGACGGCACG CTGACCCAGA AATTCACCGC CACGGCAACG
GGCACCGGGC TCACCGCGAC GACGGGCGCG GGTGCGGTGG CCGGCAGCTA CTCGGTCGCC
GTCACGCAGA TCGCCACGTC GCAGACGCTG TCTTCCGGCG CATTCAATGC GACGCAGCAG
CTCGGCACCG GCACGCTGAC GCTGAGCGTC GGCGGCAAAT CGACGTCGAT CTCGATCGAT
TCGACGAACA ACACGCTTTC CGGCATCACC GCCGCGATCA ACTCCGCGTC GAACAACCCC
GGCGTGACGG CGACGATCGT CACGGGCACC GACGGCGCGC ACCTCGTGCT GCGCTCGGCA
AGCACGGGCG CGGCCAACGT GATCAACGTC GGCGTGAGCA ACCTGTCCGG CGACAACGGG
CTGTCGAGCC TCGCCGTCAC GTCGACGGCG AGCACGACGG GCGGCCAGTC GACGATCCGC
TCGGGCGGCA GCGTCGCATG GTCGCAAAGC ACCTCCGCTC AGGACGCCGA ATTCACGGTG
GGCGGCATCG CCGCGTCGAG CGCGAGCAAT GCGGTGTCGG GCGCGATCGC CGGCGTCACG
CTGAACCTCA CGCAAGCCGC CGTCGGCGCC ACGCAGACGC TGAACGTGAC CACCGACACC
ACCGCGCAGG CCACCGCGAT CACGAACGTC GTCAACCTGT ACAACACGGT GATCACGACG
ATGTCGTCGC TGTCGTCGCT ATCCGGCGCG GGCACCAGCT CGCAAAGCGC GGGGCCGCTC
CTCGGCGACT CCACGCTCAA CATGATCCGC AACTCGCTCG CGCGCGTGGT GGGCGCGGGC
GTGACGACGG GCGGCTCGAC CACGTCGCTC GCGTCGATCG GCATCAAGTT CGCCGACGGC
TCGTCGTCGT CGCAGACGGA CGGCGCACTG ACCATCGATA CGGCCAAGCT CAACGCCGCG
CTGCAAAACA GCCCGTCGAC CGTCGCCGCG CTCTTCAATT CGACGAACGG CATCGGCGCG
CAGCTGAACA CCACGATCCA GAACTATGTG CAGACGGGCG GCGTCTTCGA TACGCGCTCG
AACGCGCTGA ACCAGGACCT GAAGAGCCTC GCGCAGCAGC AGACGCGGCT CGCGTCGTAC
GCGTCGCAAC TCACGTCGCA ATACAACGCG CAGTTCACCG CGCTCAACAC GCTGATGGCG
CAGATGAACA GCAACTCGAA CTACCTGACG CAGCTGTTCG GCGGCAGCAA CAGCTCGGGC
GCGATGGCGA ACAACAAGTA A
 
Protein sequence
MSTPVTSTTQ QQTNSALQQA AQSIISGSTG NSSMDVNSLV TALVNAKTAG QSAALSTSIA 
TDQTTLSALG TLKAALTALQ AGIGSLSDGT LTQKFTATAT GTGLTATTGA GAVAGSYSVA
VTQIATSQTL SSGAFNATQQ LGTGTLTLSV GGKSTSISID STNNTLSGIT AAINSASNNP
GVTATIVTGT DGAHLVLRSA STGAANVINV GVSNLSGDNG LSSLAVTSTA STTGGQSTIR
SGGSVAWSQS TSAQDAEFTV GGIAASSASN AVSGAIAGVT LNLTQAAVGA TQTLNVTTDT
TAQATAITNV VNLYNTVITT MSSLSSLSGA GTSSQSAGPL LGDSTLNMIR NSLARVVGAG
VTTGGSTTSL ASIGIKFADG SSSSQTDGAL TIDTAKLNAA LQNSPSTVAA LFNSTNGIGA
QLNTTIQNYV QTGGVFDTRS NALNQDLKSL AQQQTRLASY ASQLTSQYNA QFTALNTLMA
QMNSNSNYLT QLFGGSNSSG AMANNK