Gene Daud_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0074 
Symbol 
ID6025513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp78456 
End bp79919 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content62% 
IMG OID641592927 
ProductMazG family protein 
Protein accessionYP_001716274 
Protein GI169830292 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACTG GCGTGTTTAT TGCCGGATTG GGGCCGGGGG CCGTCTCGGA TGTTCCGGCG 
GGTCTGGTGG GGGAGTTGGC CCGTTGCGAC CGGGTATTTC TGCGCACCGC CCGGCACCCA
GTGGTGCCCT GGCTTCTGGA GCAGGGTCTC CGGTTTGAAG CCTTTGACCG GTACTACGAG
GACGGTGACA CGTTCGAGGC AGTGTACCGG AGAATCGCCG CGACGGTGAT CGACGCGGCG
CGGCGGGAGA CGGTGGCTTA CGCCGTACCC GGGCATCCTC TGGTGGCTGA GGAGGCGACC
CGCTTGATTC TGGATGGAGC CCGGCGGGCC GGACTTGAAA CCCGGGTGCG GCCGGCCTTG
AGTTTTCTGG ACGCTCTGTT CGCGGCCTTG CGCCTGGACC CGGCCGGGGG GCTTTTGATT
GTGGATGCCT TGCAGCCGGC GCTTTTTGAG CCGGTACAGG CCAAGGGAGT GGTGCTGGTC
CAGGTTCACG ACCGGATGGT GGCCGGTGAG GCCAAAATCC ACCTGATGGC CTACTATGCG
GACGAACAAC CTGTGATGGT CGTCCGGGCG GCCGGCGTCC CCGGACTGGA ACGGATCGAG
GAAGTCCCGC TTTATGCCCT GGACCGCCTG GACTGGGTGG ACCACCTGAC CTCCGTGTAC
ATCCCGCCCG TTGCTCAATT CCGCCGGACC TACGACTTGA CCCCCCTGGT GGGCATCATG
GCCACACTCC GCGGCAAGGA CGGTTGCCCC TGGGACCGGG AACAGACCCA CCACTCCATC
GGGCGTTACC TGATTGAGGA AGCATATGAG GTTCTGGACG CCATAGAACG GGAAAACGTG
CATAGTCTCT GTGAAGAATT GGGAGACTTA TTGTTGCAGA TCGTTTTCCA CGCCCAGATG
GCCAGAGAAG AGGCAAGTTT CGACATGAAC GACGTAGTCC GGACGGTATG CGACAAGATG
GTCCACCGTC ACCCCCACGT GTTCGGCCGG GAAAAAGTGC GGGATGCGGA TGAGGTGCTG
ACCAACTGGG AAAGGCTTAA ACGCAAGGAG AAGCAAGACG CCGGTGAGTC TTGCCTGGCG
GGGGTTCCCC GGAGATTGCC GGCACTGCTC CGGGCCGACC GCGTGCAAGA GAAAGCCTCC
CGCCTGGGGT TTGACTGGCC GGATCACCAC GGGGCGCTGG CCAAGCTGGA AGAAGAACTG
GGCGAGTTTC GGGAAGCGCT GGACGCCGGG GACCGGGACC GGATCATTGA GGAACTTGGG
GACCTTCTGT TTGCTACGGT AAACGTGTCC CGCCTGATTA AAGTCGATGC GGAGGAATGC
CTGCGCCATA CGGTGGACAA GTTCGCCCGC CGCTTTGCGC AAATCGAGGA AAGATGCCGG
GTGTCGGGGC GGAAACCAGA CCAGGTCTCG CTTTCGGAGA TGGATGACTG GTGGGAGGAA
GCAAAAAAGT TGGAGAAATC CTAG
 
Protein sequence
MATGVFIAGL GPGAVSDVPA GLVGELARCD RVFLRTARHP VVPWLLEQGL RFEAFDRYYE 
DGDTFEAVYR RIAATVIDAA RRETVAYAVP GHPLVAEEAT RLILDGARRA GLETRVRPAL
SFLDALFAAL RLDPAGGLLI VDALQPALFE PVQAKGVVLV QVHDRMVAGE AKIHLMAYYA
DEQPVMVVRA AGVPGLERIE EVPLYALDRL DWVDHLTSVY IPPVAQFRRT YDLTPLVGIM
ATLRGKDGCP WDREQTHHSI GRYLIEEAYE VLDAIERENV HSLCEELGDL LLQIVFHAQM
AREEASFDMN DVVRTVCDKM VHRHPHVFGR EKVRDADEVL TNWERLKRKE KQDAGESCLA
GVPRRLPALL RADRVQEKAS RLGFDWPDHH GALAKLEEEL GEFREALDAG DRDRIIEELG
DLLFATVNVS RLIKVDAEEC LRHTVDKFAR RFAQIEERCR VSGRKPDQVS LSEMDDWWEE
AKKLEKS