Gene Dvul_1549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1549 
Symbol 
ID4662551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1839178 
End bp1840248 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content58% 
IMG OID639819782 
Productsigma-70 region 2 domain-containing protein 
Protein accessionYP_966993 
Protein GI120602593 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000354786 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAC GCCCGAAAGC GCAACACGAA GACATCGCCG ATGAGAAGCA CGTCACCGTA 
GTCGACGACG TCGAGATCAT CGACGGCCCG GACGATTCGG CTGACGACAC CGAAGATACC
GAAGATTTCG TCGACGAAGA CGATGTCATA GACATCGGCG ACGACGACCA CGCGCCCGAC
ACGTTCCACC TCAACGCTCC TGCCACGGTA TCCACCGGGA AGGACAGCCT GCACCTCTAC
CTGCGCGAGA TAAGCCGCTT TCCCATGCTC AAGCCCGAAG AGGAGTATGA GCTGGCGAAG
CGTGTCCGCG AAACGGGCGA CGGTGATGCG GCCTTTCGCC TCGTCTCGTC GCACCTGCGT
CTCGTGGTGA AGATCGCCAT GGACTTCCAG CGGCGCTGGA TGCAGAACGT GCTCGACCTC
ATCCAGGAGG GCAACGTCGG CCTCATGCGC GCGGTGAACA AGTTCGATCC CGAAAAGGGC
ATCAAGTTCT CGTATTACGC CGCCTTCTGG ATCAAGGCCT ACATCCTCAA GTTCATCATG
GACAACTGGC GGATGGTCAA GATCGGCACC ACGCAGGCGC AACGCAAGCT GTTCTACAAT
CTCAACAAGG AACGACAGAA GCTCATCCTG CAGGGCTACG ACCCGGACGC AGCCACCCTG
TCGGAACGCC TGAACGTGAC CAAGGAACAG GTCGTGGAGA TGGAACAGCG CCTCGACGCT
TCCGACGTGT CACTCGACAT CCCGGTGGGT GACGAGGGCG GCGGGGCTTC GCGCATGGAC
TTCCTGCCCG CACTCGGCCC CGGCATCGAG GACGCACTGT CGAACCATGA GATTGCCAGC
ATGGTGCAAA ACCGTCTGCA ATCCATCATT CCCAAGCTTT CCGACAAGGA AGTGGACATC
CTGCAGAACA GGCTTCTTTC TGAAGAACCA GTCACCTTGC GCGAGATTGG CGAGAAATAC
GACATCACCC GTGAACGCGT CCGCCAGATA GAGGCGCGTC TGCTGCAAAA GATACGCGAC
CACCTGTTCA AGGAAATCAA GGACTTTTCA TCCGACTGGA TCAACCAGTA G
 
Protein sequence
MTKRPKAQHE DIADEKHVTV VDDVEIIDGP DDSADDTEDT EDFVDEDDVI DIGDDDHAPD 
TFHLNAPATV STGKDSLHLY LREISRFPML KPEEEYELAK RVRETGDGDA AFRLVSSHLR
LVVKIAMDFQ RRWMQNVLDL IQEGNVGLMR AVNKFDPEKG IKFSYYAAFW IKAYILKFIM
DNWRMVKIGT TQAQRKLFYN LNKERQKLIL QGYDPDAATL SERLNVTKEQ VVEMEQRLDA
SDVSLDIPVG DEGGGASRMD FLPALGPGIE DALSNHEIAS MVQNRLQSII PKLSDKEVDI
LQNRLLSEEP VTLREIGEKY DITRERVRQI EARLLQKIRD HLFKEIKDFS SDWINQ