Gene DvMF_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0477 
Symbol 
ID7172363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp560337 
End bp561761 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content65% 
IMG OID643538976 
Productputative RNA polymerase, sigma 70 family subunit 
Protein accessionYP_002434902 
Protein GI218885581 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.0144971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA CAGTGAAAAC GCCGCGCCGC GCCGCGCGCA CGCGCGGTGC CAAGGCCGCC 
CAATCCGCCC AATCCGTCGG ACACGCAGGG GCGACCGGCG CCCTCTCCTC TGCCCCAGGA
TATTCCGAAG CACCCGAAGC GCCCGTCACG CCGGAAGTCG TGCTTGAGGC CGAGGTGCTT
CTGGATCAGC AGCCGGAAGT TGCCACCAGG GCCCGCGCCA AGGCGGGCTC CGGCAAATCC
TCTTCCGGCA AGGCCACTGC GGGCAAGGCC GCGCGGGGCC GCAAAGCCGC CTCCGCCCCC
GCTGGCGACG GCGACGAGGA CGCACCCGCC TCCGCCCCCC TGGCCGGCGA AGCCCCCGAC
GAAGCGGACG ACGATGAAAG TGCCGACGAC GAACTGGACA TCGACTTCGA CGGCCCCGGT
GACGACGCCA TCGACGTCGA CCTGCTGCAC GGCGACGACG ATTCGCCGCC TGCCCGGCAT
GACGCACATG ACGGGGCGGA CGCATCTGAC GAGAGCGCAG GCCGCCACCG TTCCCTGCCC
GTACTGCGCC CGGCCATGCC CGGCGCCTCC ACGCGCGACT CGCTGCACCT GTACCTGCGT
GAAATCAGCC GGTTTCCGCT GCTGAAGCCG GACGAGGAAT TCGACCTTGC CCGGCGCGTG
CAGGAACAGG GCGACAGCGA TGCCGCCTTC CGGCTGGTCA CCTCGCACCT GCGCCTGGTG
GTCAAGATCG CCATGGACTT CCAGCGGCGC TGGATGCAGA ATGTGCTGGA TCTGATTCAG
GAAGGCAACG TGGGCCTGAT GCGCGCGGTC AACAAATTCG ACCCGGAAAA GGGCATCAAG
TTCTCGTACT ACGCCGCCTT CTGGATCAAG GCCTACATCC TCAAGTTCAT CATGGACAAC
TGGAGGATGG TCAAGATCGG CACCACGCAG GCCCAGCGCA AGCTGTTCTA CAATCTCAAC
AAGGAACGCC AGCGCCTTAT CGCGCAGGGG TATGACCCCG ACGCCACCAT CCTGTCGGAA
AAGCTCAACG TGACCGTCGA GCAGGTCATA GAGATGGAGC AGCGGCTTGA TTCGTCCGAC
ATGTCGCTGG ACATCACCGT GGGCGAGGAT TCCGGCGGGG CTACCCGCAT GGACTTCCTG
CCCGCCCTGG GCCCCGGCAT AGAGGAAACG CTGGCCAACA GCGAAATCGC GCGCATGGTG
CAGGACAGGG TGCAGACCAT CCTGCCGAAG CTGTCGGACA AGGAGGCGTA CATCCTCCAG
CACCGCCTGC TTTCGGAACA GCCGGTCACC CTGCGCGAGA TCGGCGAGAA GTACGACATC
ACCCGGGAAC GCGTCCGCCA GATAGAAGCG CGCCTGCTGC AAAAGCTGCG CGACCATCTG
TTCAAGGAAA TCCGCGACTT CTCCTCGGAC TGGATATCAC AGTAG
 
Protein sequence
MTDTVKTPRR AARTRGAKAA QSAQSVGHAG ATGALSSAPG YSEAPEAPVT PEVVLEAEVL 
LDQQPEVATR ARAKAGSGKS SSGKATAGKA ARGRKAASAP AGDGDEDAPA SAPLAGEAPD
EADDDESADD ELDIDFDGPG DDAIDVDLLH GDDDSPPARH DAHDGADASD ESAGRHRSLP
VLRPAMPGAS TRDSLHLYLR EISRFPLLKP DEEFDLARRV QEQGDSDAAF RLVTSHLRLV
VKIAMDFQRR WMQNVLDLIQ EGNVGLMRAV NKFDPEKGIK FSYYAAFWIK AYILKFIMDN
WRMVKIGTTQ AQRKLFYNLN KERQRLIAQG YDPDATILSE KLNVTVEQVI EMEQRLDSSD
MSLDITVGED SGGATRMDFL PALGPGIEET LANSEIARMV QDRVQTILPK LSDKEAYILQ
HRLLSEQPVT LREIGEKYDI TRERVRQIEA RLLQKLRDHL FKEIRDFSSD WISQ