Gene Dvul_0954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0954 
Symbol 
ID4662705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1170533 
End bp1171969 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content67% 
IMG OID639819177 
ProductSel1 domain-containing protein 
Protein accessionYP_966402 
Protein GI120602002 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.683818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.302016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCA CCATTGCCGA AACACCCCGT CCCGCCGTAG CGTCCATGTC CAGCGGCTGC 
ATACCGCCCC ACCATCGCCC TGACACGTTG AAGCAGCGAT GGGGCACCCC GACTGCGGTC
TGCCTGCTGG TGACCTGTCT TTTCGCGATG ACGCTGCTCA CCACCGGCTG CAAGCCCCCC
CGCAGGACCC CGCCCACGCT CCCCGACCGG CCCGACATGG CTACGGTGGA ACAGCCCGTC
GTCCCGCCCG CCACTGTGGA CGGCAAGGAC GGCAAGGACC TTGACGCCCC CCAGACACCG
AAAGAACGCC TCGCGCTCGC CTTGCGCCTT CTTGATGACG GCGGCGACGG GGCCGACCCG
ACGCAGGCTG TGCAACTCAT CGAACAGGAT GCCACGACGG GATACGCCCC CGCGCAAGAC
CTGCTCGCCC GTCTTTCGCT GGAAGGGTAC GGCACGGCCA AGGACCCGGC GCGCGCCTTC
GCACTGGCCA TCGAAGCCGC CCGACAGGGA TTGCCCGATG CACAAAGGCT TGCCGGGACC
ATGTACACAC TCGGACTGGG TACCGCCCGC GACCTTGAGC AGGGAATGCG CTGGCTGCGC
GAAGCCGCCG ACGCAGGCGA CGGTGAAGCC GCAGCCATGG TAGCCGACTA TTACCGGCAA
GGCCTCGGGG TGGAACGGAA CGACACGGAA GCCTTCCTCT GGACCCACCG CGCAGCCGAA
CGCGGCGTGA AGCGTGCCGC GCTATGGCTT GGCCTGCACT ACCACTACGG TGTAGGCACT
CCTGCAGACC AGAAACAGGC CTTCGCCCTT CTCAGACCTT TCGCAGACGA AGGCGATGCA
GAGGCGCTGA CCATCATCGC CCTGCTACTG CACAAGGGGG AAGGCGTCGC ACAGGACAGG
AAGGCGGCCC TTCGCTATTT CGAGAAGGGA GCAGCAGCGG GGGACCCGGT GGCACAACTC
GACCTCGGGG TGCTGTACCA TCAGGGTGAC GGCGTCACGC GTGACATGGA CAAGGCGCGC
GGCCTCTTCC GTCAATGCGC GGAAGGCGGC AACGCCCGGT GCATGACCCT GTACGCCAGC
ATGCTCGAAG AAGGGGAAGG CGGACCGTCA GACCCGGCTG AAGCCCTCGC ATGGTACATG
GTCGCGTCCA TGGCCGGAGA TGATGACGCA ACGCCGTTCC TTGATGACCT GCGAAGCCGC
ATCACCGCGG AACAGGCAGC ACAGGGCAAG GTCAGGGCCA CCGCCATCAT CAGGTCGTTG
ACCGCAAGCC CCGCCGCTGA CGAAGCGTCA CGGCAATCAT CCGGGCAGAC TGCGGGCCAA
GCATCGGGCC ACGCTTCGGA CCAGACGCCA GAGAGTACCT CGGCGCGGAC ATCGGCCCCG
GTGCCATCTG CCCTTCCCGG CGAGGCGGCA GGCAAGCGTC CGACCGTACT GCCCTGA
 
Protein sequence
MSRTIAETPR PAVASMSSGC IPPHHRPDTL KQRWGTPTAV CLLVTCLFAM TLLTTGCKPP 
RRTPPTLPDR PDMATVEQPV VPPATVDGKD GKDLDAPQTP KERLALALRL LDDGGDGADP
TQAVQLIEQD ATTGYAPAQD LLARLSLEGY GTAKDPARAF ALAIEAARQG LPDAQRLAGT
MYTLGLGTAR DLEQGMRWLR EAADAGDGEA AAMVADYYRQ GLGVERNDTE AFLWTHRAAE
RGVKRAALWL GLHYHYGVGT PADQKQAFAL LRPFADEGDA EALTIIALLL HKGEGVAQDR
KAALRYFEKG AAAGDPVAQL DLGVLYHQGD GVTRDMDKAR GLFRQCAEGG NARCMTLYAS
MLEEGEGGPS DPAEALAWYM VASMAGDDDA TPFLDDLRSR ITAEQAAQGK VRATAIIRSL
TASPAADEAS RQSSGQTAGQ ASGHASDQTP ESTSARTSAP VPSALPGEAA GKRPTVLP