Gene DvMF_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1048 
Symbol 
ID7172944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp1273441 
End bp1275138 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content70% 
IMG OID643539555 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002435471 
Protein GI218886150 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACAG CCCGCCGCAT CGCCGCGCGT TTTCGTCCGT CCCCTGTTCC GCAAACCTCC 
AACCGGCAGC GCTCTCCGCG CCGCGCGGCA CGCCTGGCGC TTGTCGCCCT GCTGGCGGCT
GGCCTGCTGC TGGGCGGATG CAAGCCCCCC AAAAAGGGGA CGGCTGCCCT GCCGGAACGC
CCTACCTACA TCCGCGAGGC CACGGACCAG GAAACTTCCG CGCCCGATGC GGGCACGAGC
GCCACGGGCA CGACCGGGGC CACCACGCCG GATACCGGCG CCGCACCGGC TGATACCGGG
GCCAACACCG CGCAGCCTGC CGCCCCCGAT CTTGTGGTCG CCCCTGACAG CATGGCCAAC
AACCGCCAGC CCGAACCCAC GCCGCAACCG CAGAACCGCG CTGCCGCCCC CGTGGGTGAA
CTGGTGGAGC TTTCCGACGG CACCAGCATG GAGGCCCCCA CTCGCCCGCA GGACCAGTTT
GCCTTTGCCA TGGCCCTGTT GCAGGGGGCC GACGGCCAGC CCGATCCGGC CCGGGCCGCA
ACGTGGCTGG AAAAATCCGC CGATCAGGAC TTTGGCCCGG CGCAAGACGT GCTGGCGCGC
CTGTACCTGG ACGGCACCGG CGTGCGCAAG GACGAGGCCA AGGCCTTTGC GCTGGCCATG
TCCGCCGCCG AGCAGGACAT CATCAACGCG CAGGCCCTGG TGGGCGTGCT GTACACTTAT
GGACGCGGCA CCCGGCGCGA CTTCATGCAG GGCGAGAAAT GGCTCTCGCT GGCGGCGGAG
CGGGGCCACC CGCAAGCGTG CGACCTGCTG GCCGAATACC ACCGCAAGGG GTTGGCCGGG
CCGGAGAACC AGGAAGAAGC CTTCCGCTGG ACGGAACGCG CCGCCGCCCT TGGGGTGGTG
CGCGCACGCT TCTGGCTGGG CGTGCACTAC CGCTACGGCA TGGGCACCCC GCGCGACGAC
GCCAAGGCCC TGCACCTGCT GCGCGAAGCC GCCGACGCGG GCAACCCCGA CGCCATGGGG
CTGGTGGCCG AAATGCTTTA CCGGGGCCAG GGAAGCGAGC CGGACATGGC TGGCTCCGTG
CGCTACTTCC AGATGGGCGC CAAGGCGGGC GACCTGCACT CGCTGCTCAA CCTGGGCATC
CTGCATCACG AAGGCACAGG CGTGCCCAAG GACTACCCGC GCAGCCTGCA ACTGTTCGGC
CAATGCGCCG AGGGCGGCCA CCCGCGCTGC ATGACCTTGC TGGGCAGCAT GCTGGCGGAA
GGGGAAGGAG CCGAGGCGGA CATGGTCACC GCCCATTCCT GGCTGACCCG TGCCGTGCTG
TTCGGCGACG GCGACGCGGC TTCGGTGGCA GCCGAGGTGC AGCAACGCAT GACGCCCGAC
CAGTTGGTGC ATTCCAAGAA CATGGCCGCG CAGTGGATGC AGGCCCACCC GCAGTTCCAG
CCCGGCGTGC CCGCGCAACT CGAACCGGAG GCATCCACGG CGGTTCCGCA GGCCGCGCGA
CAGGAAGCCT CGCAATCCGC ACCGGGAGCA AGCGCCGACA CCGGCACGAC GGACATGACC
GCCACACCGC AGGGTTCCTC CAACGCGACG GCCCCTGTTA CCCAGACCGA CAAGACCGGC
AAGACGGGCA CCACCACTGC GGCCCCGGCG AAGAAGAAAG GCACCAAGGC CGCCAAGGGC
GCCCGCACCA CCAACTGA
 
Protein sequence
MNTARRIAAR FRPSPVPQTS NRQRSPRRAA RLALVALLAA GLLLGGCKPP KKGTAALPER 
PTYIREATDQ ETSAPDAGTS ATGTTGATTP DTGAAPADTG ANTAQPAAPD LVVAPDSMAN
NRQPEPTPQP QNRAAAPVGE LVELSDGTSM EAPTRPQDQF AFAMALLQGA DGQPDPARAA
TWLEKSADQD FGPAQDVLAR LYLDGTGVRK DEAKAFALAM SAAEQDIINA QALVGVLYTY
GRGTRRDFMQ GEKWLSLAAE RGHPQACDLL AEYHRKGLAG PENQEEAFRW TERAAALGVV
RARFWLGVHY RYGMGTPRDD AKALHLLREA ADAGNPDAMG LVAEMLYRGQ GSEPDMAGSV
RYFQMGAKAG DLHSLLNLGI LHHEGTGVPK DYPRSLQLFG QCAEGGHPRC MTLLGSMLAE
GEGAEADMVT AHSWLTRAVL FGDGDAASVA AEVQQRMTPD QLVHSKNMAA QWMQAHPQFQ
PGVPAQLEPE ASTAVPQAAR QEASQSAPGA SADTGTTDMT ATPQGSSNAT APVTQTDKTG
KTGTTTAAPA KKKGTKAAKG ARTTN