Gene DvMF_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1158 
Symbol 
ID7173060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp1422747 
End bp1424039 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content68% 
IMG OID643539670 
Producttransglutaminase family protein cysteine peptidase BTLCP 
Protein accessionYP_002435580 
Protein GI218886259 
COG category[S] Function unknown 
COG ID[COG3672] Predicted periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.0287549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAAG GGGACAAGGG TGAGGGATTC GGCGCGGGAA GTGCGCCGCA TCATCCCGGT 
CCGCCGGTTG CGGCGGGCGG GGCAGGGGGT CTTTTCGGGC GTTGCCTGCG GGCGCTGCCG
GGACTTGCCC TGCTTGCCGT GTGTTCCCTG AGTATGGCGG CGGGCGTCGG CCCGTGGGAT
GCCATGCGGC CCGTCGCGGC CATGGCGGGC GGTTCCGCCC CCAAGGCCGC ACCGATGCAG
GCGGAGGATG CGGGCAGGGC CGCACGCACC GGGGCACCCG CCCCCGCGCA AGTGCGCGGC
GCGGCTGCAG GTGCCGCCAG TGATGCAGGC GAACCGGCGC GGGAAGCGGC GGGTGCGGAC
GCCGGCGTAC AGGCAGCGCG TGCCGTTGCC GATACGGCTG CCATGCCCGA CGTGCCCGAT
GCCGCGGCCA CACCCGGGGA GACCCGGCCC GCCCCGAAGA TGGACGGCGC CAGGACCGCC
GACGGAAAGG GAGCGCCCCC CGTGGCTGCG GACACCGCCC CGGCGGGCAC GAAAAAGGCA
GACGCGGAGA CGGTGGGCCT GCCCGATGCG CCACTTGTGG CCCAGGCGGA ACAGGCCGGA
CAGGCAGGAC AGGCCGGACA GGCGAAACCG GCAGGTCAGA CCGCGCAGCC GAGGCAGTCG
GCAGAATCCG AGCAGGCAGC GGAACCCCGG CAGGGCGCCT CGTCGGCCTC GCGTGGCATC
CGGTTGTTCA ACACCATAGA GTTTCGCGGG CCGCTGAAAA ACCTGCCCAA GTGGGACAGG
GTGCTGAACG TGGACCGCAA GACGCCGGGG CTGATTCCGG AACGGGCGCT GGGCGGGCGC
AATGCCCTGT GGGGCGAGCT GCGGTCCGAA TGGCGCAACC TGTCCACCAT GGACAAGCTG
CGCAACGTAA ATTCCCTGTT CAACCAGTGG CCCTACCGGC TGGACAGTGA AGTGTGGGGC
GTGGTGGACT ACTGGGCGGC GCCCATCGAG TTCCTGCGCA AGTCGGGTGA TTGTGAAGAC
TATGCCATAA CAAAGTATTT CGCCCTGAAG CAGCTTGGCG TGCCCGTTTC GGACATGCGC
ATCGTCATCC TGCTGGACTC CATTCGCCGG TTGGCCCATG CCATACTGGT GGTATACACG
GGGGGCGACG CCTACGTGCT GGACAACCTG TCCAACGTGG TGTTGTCGCA CCAGCGTTAC
GGGCATTATG TGCCGCAGTA TTCGATCAAC GAAGAATTCC GCTGGGCGCA CATCCCCATC
AAGGGCGGCC CAGGCACCCG AAGAATGCAA TAG
 
Protein sequence
MRKGDKGEGF GAGSAPHHPG PPVAAGGAGG LFGRCLRALP GLALLAVCSL SMAAGVGPWD 
AMRPVAAMAG GSAPKAAPMQ AEDAGRAART GAPAPAQVRG AAAGAASDAG EPAREAAGAD
AGVQAARAVA DTAAMPDVPD AAATPGETRP APKMDGARTA DGKGAPPVAA DTAPAGTKKA
DAETVGLPDA PLVAQAEQAG QAGQAGQAKP AGQTAQPRQS AESEQAAEPR QGASSASRGI
RLFNTIEFRG PLKNLPKWDR VLNVDRKTPG LIPERALGGR NALWGELRSE WRNLSTMDKL
RNVNSLFNQW PYRLDSEVWG VVDYWAAPIE FLRKSGDCED YAITKYFALK QLGVPVSDMR
IVILLDSIRR LAHAILVVYT GGDAYVLDNL SNVVLSHQRY GHYVPQYSIN EEFRWAHIPI
KGGPGTRRMQ