Gene Daud_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0439 
Symbol 
ID6026375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp478741 
End bp479718 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content68% 
IMG OID641593279 
Producthypothetical protein 
Protein accessionYP_001716617 
Protein GI169830635 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCCG CGCCAACCGC CACCGGCGAT TCCAACCCGA CCGGAAGGGA AAGCTGGTCC 
GCCTATCTCG ACTTGCTGGT GGTGTACGTC GTCTGGGGCA GCACCTACCT GGCCATCCGG
GTGGCCGTGG GCGAAGGCGG CGGTTTTCCC CCTTTCATCA TGGGGGCCAT GCGCGTGCTG
CTCGCCGGGG TTTTGCTGCT GCTCTGGGCC GGCCTGACCC GGAGCCGCCT CCGGCCGACA
AAGGAAGAAC TGCTCGTCCT CGCGGTCTCG GGCATCCTGC TTTGGACTGG GGGAAACGGG
CTGGTAATGT GGGCCGAACA GCGGGCCGAT TCCGCCTACG CCGCCCTTCT GGTCGGTTCA
ACGCCCATCT GGGTAGCCGT CATGATGGCC TTTCTGGACC GCAGACCGCC CTCCTGGCTC
CTGGCCGGTT CACTGCTGAT CGGCTTCGCC GGCCTCGTCC TGTTGACCGC CCCCGTACTG
GCGACCGGTA CGCGGGCCGA CACCCTGGCC GTCATCGCCC TGCTGGCCGC ACCGGTCAGT
TGGGGGATCG GCTCCCTGAT CCAGAACCGG AGACCGGTGG GCATGAGCGT CATCGCCAGT
TCCGCCTACC AGCACCTCTT CGGCGGAGCC GGTTTTCTTG TGCTCGTCCT GATCTTCAAC
GAGCCGCTGC CGAACCCCAC GCCCGCCGCC TGGTGGGCCT GGGGCTACCT CGTGGTTTTC
GGGTCCCTTC TCGCCTTTAC CTCGTTCGTG CGGGCCCTGC GCAGTTTGCC CACGAACATC
GTCATGACCT ACGCCTACGT GAACCCGGTC ATCGCGATGA TCCTGGGCCG GCTGATTCTG
AACGAGGCGA TCACCGCGTG GACCGTCGGG GGCGCGGCCC TGATCCTGCT CGGGGTCGCC
GGAGTCTTCC GGGACCGCTA CGTCAACGGC ACGGCAACCG CGCCGCGCGC CCTCAAGGCA
ACCGGCCCGG CCCGGTGA
 
Protein sequence
MSAAPTATGD SNPTGRESWS AYLDLLVVYV VWGSTYLAIR VAVGEGGGFP PFIMGAMRVL 
LAGVLLLLWA GLTRSRLRPT KEELLVLAVS GILLWTGGNG LVMWAEQRAD SAYAALLVGS
TPIWVAVMMA FLDRRPPSWL LAGSLLIGFA GLVLLTAPVL ATGTRADTLA VIALLAAPVS
WGIGSLIQNR RPVGMSVIAS SAYQHLFGGA GFLVLVLIFN EPLPNPTPAA WWAWGYLVVF
GSLLAFTSFV RALRSLPTNI VMTYAYVNPV IAMILGRLIL NEAITAWTVG GAALILLGVA
GVFRDRYVNG TATAPRALKA TGPAR