Gene Daud_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2042 
Symbol 
ID6025621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2150938 
End bp2152038 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content64% 
IMG OID641594863 
ProductVanW family protein 
Protein accessionYP_001718164 
Protein GI169832182 
COG category[V] Defense mechanisms 
COG ID[COG2720] Uncharacterized vancomycin resistance protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTGGT TTCCATTAAT GTTTCTGTTG CTCATTACCC TGTTCCTGGC GGGAGTCGGC 
CCGGCCGGGG CGGCGGTCAT GGTGCCGCTC CGTCCCGTCC TCGAAAACCG GGGGATCACG
ATAAGCTGGG ACGGTGAGGC GGCGACCGGC TACTGCTGCG GCCGGCTGCT GCGGGTGGTC
CCCGGGCGTG ACACGGCTGA GATGGGCGGT AACGAGGTAT TGCTGGATGT GCCGGCGGTG
ATGCTCGGTG GGATGACCTT CGTGCCCGAT ACCCTGCTTG CCGGTTTCTT GGACTCCCCG
CCTCCGGTTG TTGGCTTTCA CGCGTTTACC AGGGTGCGGA TCGGGGAACA GCTCTGGCTT
GTCGGGATGT TGGGTGAATC GCTGGAAGCC GAACCGGAGG ACTTCCGAAT ACCGTCGGAC
CTCGTTCCAG AGGAACCCGA CACGATCATT CCCGAAGCGC CGGGTCCCGC CCGGATCTGG
GTGTTGCCCG GCACGGGGGA AGATATGGTC CTGGAAGGAT TGGTTTCGCT GATGGGGGCT
GAACTCAGTT CAGGTGCACT CGGGTTCACT CTGGCGCCCG GCGCAAGCGG CATACTGCGC
GACGCCCTAA CGCAGAAGTT GGATTCGGTC ACCCTGCCAC TCGAACGCCG GATCGGATCT
TGCCTGGTGA ATTTCAATCC GGGCGGTGGC GACTACAACA ACATGTTGAA CGCGGTGCAG
GCCGCCGCCT ACCTGAACGG AATCACCGTA CAGCCCGGGC AGGTGTTTTC TTACAACCAA
ACGGTCGGGC CGCGGACGGC GGAGCGCGGT TTCGTGATCG GTTACGCCAT CAGCGGGGAT
CGGCACGTGC CGGCGCGGGG CGGGGGAGTA TGCCGCACCT CCACGGTGCT GTACGGCGCC
GTGCTCAATG CCGGCCTGCC TGTCATCGAA CGCCACGCGC ACACGAGGCC GGTGGGGTAC
GTGCCCATGG GCCGGGACGC CACCGTTTCC TACGGCACCG CCGACCTCAA ATTCCGCAAT
GATCTTCCCC GCCCGGTCCG GATCAAGGCC GGCGGCACCG TGCGGCAACT GCAGGTGACT
CTCTGGGAAC TTTGGGGGTA G
 
Protein sequence
MRWFPLMFLL LITLFLAGVG PAGAAVMVPL RPVLENRGIT ISWDGEAATG YCCGRLLRVV 
PGRDTAEMGG NEVLLDVPAV MLGGMTFVPD TLLAGFLDSP PPVVGFHAFT RVRIGEQLWL
VGMLGESLEA EPEDFRIPSD LVPEEPDTII PEAPGPARIW VLPGTGEDMV LEGLVSLMGA
ELSSGALGFT LAPGASGILR DALTQKLDSV TLPLERRIGS CLVNFNPGGG DYNNMLNAVQ
AAAYLNGITV QPGQVFSYNQ TVGPRTAERG FVIGYAISGD RHVPARGGGV CRTSTVLYGA
VLNAGLPVIE RHAHTRPVGY VPMGRDATVS YGTADLKFRN DLPRPVRIKA GGTVRQLQVT
LWELWG