Gene Dvul_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2421 
Symbol 
ID4664224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2816888 
End bp2818459 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content63% 
IMG OID639820669 
Productflagellar hook-associated protein 3 
Protein accessionYP_967864 
Protein GI120603464 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID[TIGR02550] flagellar hook-associated protein 3 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.303682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTTT CGCAACGGAT GATGTACCAG AACTTTCTGG GCAATATGAA CTCGTCGCTG 
GCCGACCTTG TGGAGTCGAA CATCCAGTCG TCGAGTCAGA AGCAGATCAA TCGCCCCTCC
GACGACCCCG TGGGCATGGC GCGAGTGCTC AACTACCGGG CTTCGCTCGG TGACGTGGAG
CAGTACAAGA AGAACGTGGA CACGGCCAAG GGGTGGCTCA ACCTCGCCGA CTCGACGCTC
ACGAGCATGA ACACCGTGCT CACACGCATC AAGGGGCTGG CCGAACAGGC CGCCACCAGC
AGCGTCAGTC CCGAGAACCG CAAACAGATC AGTTTCGAGT TGCGGCAGCA GTACGAGCAA
CTCATCAACC TCGCCAACAC CAAGTTCGAG AACAGGCACA TCTTCGCCGG TCACAAGTAC
GACGAGGCTC CCTTCGTGCA AGGGTTGCAG GTGACGACGA ACGACCCCAA CCTCCAGTCC
GCCGCACTGC CTGCGGGGAC GGAGTTTTCG GTCGAGGGCG CGTCGCCGAC CAGCATCATG
GTGCAGTTCC TCGATTCGGG CACCGTGGGC GGGGCCACAC CGCTGAACTA CCGCTTTTCC
GCCGACGGCG GGGCCTCATG GCAGACGGGT ACGCTTGCCG CGGCTGCCAC GTCCATCGAC
CTCGGCACCA CAGGTGTACG CCTCAACCTG CCCAACGGGG CGACGCTGAC CGCAGCCAAC
CCGCCCGACC CTGTGGATGC CGACAACGGT TCGCTCATCT ATGTGCGCCC CACGGCACTC
TACCGCGGTG ACGACAACGA CCTGCCCCCG CAGGTCGACG CCTACGGTGC TGCCGCCGGC
ACCACGGCAC AGGCGCGCGG GACGTTCTCG GGTGACGTGC TGGTGAAGGT GACGAATACA
GGCGGCGTGG ATGTGGCCGC CCCGCCCGCC ACCGTCAGCT ACGCCTACAG CACCGATGGG
GGCAGCAACT GGGTGGCGGC GACCTCCACC ACGTCGGTGG CGAATCAGGC GACCCTCTCC
ATACCCGGCG GTTTTCTCAC CGTGGATGTG CCAGCGGGTG ACACCACGCT GGACGAAGGC
CAGCAGTTTC TCGTAAGACC CCGCCGTGCC GACCTCGGTT TCGAGGTGTC GGAAGGCGAG
TACATCACCG TCAACTCGGT GGGTAAGGAC ATCTTCGGGG GTATCTACAA GAATCCTCTC
GACCCCGCTG CCACGGCACA GCCCGTGGGC GGCGGCACCA CGCCCAACAT GTTCGAGACG
GTGGGCAGAC TGGTCGCCTT CGCCGAGACG AACAATCAGG ACGGCGTCCA GAAGTGCCTT
GACGAACTCA ACGGGGTGAG CAAGACCATA CTGACGGCGG CTGCGAGCAT CGGCGGGCGC
GAGAACAGGC TCGATGTCGT CTCCGGGGTG CTCGACAACC AGAAGCTTGA TCAGACGACA
CGCATGAGCG CCGTGGAGGA CGTTGACTTC TCCGAACTCA TGACGCGTCT TGCGCAGGAG
CAGCTCATCT ACAACAGCGT CCTCAAGTCG TCGTCGATGA TCATGCAGAT GAACCTTTCG
AACTTCCTCT AG
 
Protein sequence
MRVSQRMMYQ NFLGNMNSSL ADLVESNIQS SSQKQINRPS DDPVGMARVL NYRASLGDVE 
QYKKNVDTAK GWLNLADSTL TSMNTVLTRI KGLAEQAATS SVSPENRKQI SFELRQQYEQ
LINLANTKFE NRHIFAGHKY DEAPFVQGLQ VTTNDPNLQS AALPAGTEFS VEGASPTSIM
VQFLDSGTVG GATPLNYRFS ADGGASWQTG TLAAAATSID LGTTGVRLNL PNGATLTAAN
PPDPVDADNG SLIYVRPTAL YRGDDNDLPP QVDAYGAAAG TTAQARGTFS GDVLVKVTNT
GGVDVAAPPA TVSYAYSTDG GSNWVAATST TSVANQATLS IPGGFLTVDV PAGDTTLDEG
QQFLVRPRRA DLGFEVSEGE YITVNSVGKD IFGGIYKNPL DPAATAQPVG GGTTPNMFET
VGRLVAFAET NNQDGVQKCL DELNGVSKTI LTAAASIGGR ENRLDVVSGV LDNQKLDQTT
RMSAVEDVDF SELMTRLAQE QLIYNSVLKS SSMIMQMNLS NFL