Gene Vapar_4166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4166 
Symbol 
ID7971903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4407677 
End bp4408915 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content66% 
IMG OID644794752 
Productflagellin 
Protein accessionYP_002946045 
Protein GI239817135 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAAG TCATCAACAC CAACAGCCTT TCCCTGCTCA CGCAGAACAA CCTCAACGCG 
TCGCAATCGT CGTTGAACAC GGCGATCCAG CGCCTGTCCT CCGGCCTGCG CATCAACAGC
GCCAAGGACG ACGCCGCCGG CCAGGCCATC GCCAACCGCT TCACGGCCAA CATCCGCGGC
CTGACGCAAG CCTCGCGCAA TGCCAACGAC GGCATCTCGC TGGCGCAGAC GACGGAAGGC
GCGCTCAAGG AAGTGAACAA CAACCTGCAG CGCGTTCGCG AACTGTCGGT GCAAGCCGCC
AACGGCAGCA ACTCGGGCAG CGACCTGCAA TCGATCCAGG ACGAAATCAA GCTGCGCCTC
GGTGAAATCG ACCGCGTGTC CAAGCAGACC GACTTCAACG GCGTGAAGGT CCTGTCGTCG
TCGGCCAAGC CGCTGACCGT GCAGGTCGGC GCCAACGACG GCGAGACCAT CGACATCGAC
CTGAAGGAAA TCAGCTCGAA GACGCTCGGC ATGCAAGGCT TCAACGTGGC CGGCCCCGGC
GCCACGGCTG CCTTCGCATT CGACGGCGTG ACCGGTTCGG CCGCTGGCGA TGCGCCCACG
GTTGCCCAGC TGAAGTCGCT CTACGGTTCG ACCACGGCCG TCACCACGAC CACCGTCGCC
GAGAAGACCA CCGACGACCT GTCGACCAAG CTCGGCCTGG CCGCAGGCGG CGCCACGCTC
ACCGGCAACA CCGTTGCCGA CAAGAACGGC AACCTGTTCG CCGAAGTCTC GATCACGCCC
ACCGGCGCCG GCGAAACCTC GTCGCTCATC AACCAGGGCT TCACCGGCGC GGTCGACGGC
ACCGCCATGT TCCGCTACAT CGCGCTCGAC CCGGCTTCGG CCGACACGGC CACGACCGCC
GGCACGGCTG CCTTCACCGT CGACACCTCG AAGGTCTCGG TCGCCAGCCT GCAGACCGGC
TCGACCGCCA GCCCGCTCGA AGCCATCGAC GCAGCGCTCA AGCAAGTCGA CGACCTGCGC
AGCTCGCTGG GTGCGGTGCA GAACCGTTTC GACTCGGTGA TCTCCAACCT GGGCACCGCC
ATCACCAACC TGTCGTCCTC GCGCTCGCGC ATCGAAGACG CCGACTACGC AACCGAAGTG
TCGAACATGA CCCGTGCGCA GATCCTGCAG CAAGCCGGTA CCTCGGTGCT GGCCCAGGCG
AACCAGACCA CGCAAGGCGT GCTGTCGCTC CTGCGCTGA
 
Protein sequence
MAQVINTNSL SLLTQNNLNA SQSSLNTAIQ RLSSGLRINS AKDDAAGQAI ANRFTANIRG 
LTQASRNAND GISLAQTTEG ALKEVNNNLQ RVRELSVQAA NGSNSGSDLQ SIQDEIKLRL
GEIDRVSKQT DFNGVKVLSS SAKPLTVQVG ANDGETIDID LKEISSKTLG MQGFNVAGPG
ATAAFAFDGV TGSAAGDAPT VAQLKSLYGS TTAVTTTTVA EKTTDDLSTK LGLAAGGATL
TGNTVADKNG NLFAEVSITP TGAGETSSLI NQGFTGAVDG TAMFRYIALD PASADTATTA
GTAAFTVDTS KVSVASLQTG STASPLEAID AALKQVDDLR SSLGAVQNRF DSVISNLGTA
ITNLSSSRSR IEDADYATEV SNMTRAQILQ QAGTSVLAQA NQTTQGVLSL LR