Gene Dvul_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1106 
Symbol 
ID4662795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1344774 
End bp1345889 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content69% 
IMG OID639819335 
ProductTPR repeat-containing protein 
Protein accessionYP_966553 
Protein GI120602153 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.178209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA CAGCCACGCA CGACACCTCT TCGAAGGGCC GCCATGCCCT GCCCGCCCTC 
GCGCTTCTGC TGCTCATGAC ACTCTCGGGC TGCGCTGCGC GTGACATGGG CGACGGCACG
TCTCCCTCGC TCATGGAACG GCATGCCACG GGCAAGAGCC TCGCCCCCGA TGCGCCGGGG
ACACCGCAAC GGCAAGGCAC ACCGGCGGCC GAGGCGGAAG CCCACCTCCA GCGCGGCCTC
GCCTACCTCG CACAGGACAG GCCCGAACTC GCCTTCGAGC ATTTCAGCCG CGCCGCCTCG
CTTGCCCCGG AGATGGTCGA ACCCCGTCTG CAACGGGCGC GCCTGCTCGT CCGGCGCGGC
ATGCCCAACG AAGCCGCCGC CGACATCGAG GCCGTGCTCG CCGCCTCCCC GCAACACGCA
CGCGCATGGG AACTTGCGGG CATGGTGGCT TTCGACAGGG GACTTTTGGA CGAAGCCGAA
GCGGACTTCA CGCGCGCCAT CACCCTCGAC CCCGACCTTG CCTCCTGCTA CGCGCATCTC
GGTGCCGTAC ACGACTACAA GGGTCGCCCC GACCTCGCAC GTGACGTGTA CGCCGCCGCC
CTTGTCCGCT TTCCGCAGTC GGGCGAATTG CACAACAACC TCGGTGTCGC CTTCTCCATG
CTTGGAGACG ACGCCTCGGC CCTGCACCAC TTCCACGAGG CCGTCGTGCT GGGCGCGTCC
TCCGAACGGT CATGGAACAA CATGGGGCTG GCCCTGTGCC GTCTGGGGCG CTTCGACGAG
GCCTTCGAAG CCTTCCGCAA CGCGGGGGGC GAGGCCGCAG CGCATAACAA CCTCGGCTAT
TTCTTCCTCG TCAACGGCGA CGCCTCGCTG GCCGTGCAGC ACCTGCAACG CGCCGTCGAA
CTCGAACCCC GCTACTACGT CCGTGCCGCC GAGAACCTCA AGCGTGCCCG ACTCGCGGCC
AGATTCGCAG CGGGCGGCGT ACCTGTGCCC GCCGCGGGGC CACAGGCAGG AGGCATTGCC
GGTACGCCCG TGAACAAGGC AGGCGTCTTG CCGCCAGCAA CGGGCAAGGG TCCCGGCACA
CGGACGACCG GCGCAGGCGA ACGGGTCATC CAGTAG
 
Protein sequence
MSITATHDTS SKGRHALPAL ALLLLMTLSG CAARDMGDGT SPSLMERHAT GKSLAPDAPG 
TPQRQGTPAA EAEAHLQRGL AYLAQDRPEL AFEHFSRAAS LAPEMVEPRL QRARLLVRRG
MPNEAAADIE AVLAASPQHA RAWELAGMVA FDRGLLDEAE ADFTRAITLD PDLASCYAHL
GAVHDYKGRP DLARDVYAAA LVRFPQSGEL HNNLGVAFSM LGDDASALHH FHEAVVLGAS
SERSWNNMGL ALCRLGRFDE AFEAFRNAGG EAAAHNNLGY FFLVNGDASL AVQHLQRAVE
LEPRYYVRAA ENLKRARLAA RFAAGGVPVP AAGPQAGGIA GTPVNKAGVL PPATGKGPGT
RTTGAGERVI Q