Gene Daud_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1039 
Symbol 
ID6026506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1089611 
End bp1090852 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content61% 
IMG OID641593851 
Productglycosyl transferase, group 1 
Protein accessionYP_001717183 
Protein GI169831201 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATGCTTT CATGGGAGTA CCCTCCAAAG ACGATCGGCG GCCTGGCCCA ACACGTGTAC 
GATCTAAATG CAGCCTTGAG TCGGGAGGGC GTAGAAGTTC ACCTGTTAAC CTGCTCGGCT
CCGGGGGCGT CCGACTACGA GATGCAGGGA AACATTCATA TCCACCGCGT GCACCCCTTC
CAGGTTTCGG CGCCGGACTT CGTGACCTGG GTGTTGCAGT TCAACAACGC CATACTGGAA
CGGGCGATCA GCCTGTTCGA AAGGGTGGGC GCCTTCCGGG TGGTCCACGC CCACGACTGG
CTGGTGGCCT TTGCGGCCCG GGCAGTCAAG CACGCCAGGC ATCTCCCGCT GGTGGCCACA
ATTCACGCTA CCGAATTCGG CCGGAACCAG GGACTGCACA ACGCGACCCA AAACTACATC
AGCAACGTGG AGTGGTGGCT GACGTTTGAA GCGTGGAAGG TGATCGTGTG CAGCAGGTAC
ATGGAGAATG AACTCAAGTA CATCTTCCAG CTCCCGGCGG ACAAGATCCG GGTGATTCCC
AACGGGGTGG ATCCGGAGAA CTACAGGCTG CGTTCGGACC GGGTCAAGCG CAGCTTCTAC
GCGGCGCCGG AGGAAAGAAT CGTGTTCTAC GTCGGCCGCC TGGTCCAGGA AAAGGGGGTG
CAGGTGCTCT TGGACGCCGT GCCTCAGATT CTTGCGCGGA TGCCCAACAC CAAGTTTGTC
ATCGGCGGTA AGGGGCCGCA CCTGGAAGAA TTGCGGGCCC AGGTGGACAG AATGGGTATC
GCGCCGCGCA TCTACTTCAC CGGCTACATC GACGACGAGG TCAGGAACGC GCTTTACCAC
TGGGCCGACG TGGCGGTGTT CCCGAGCCTA TATGAACCGT TCGGTATCGT GGCCCTGGAG
GCGATGGCGG CCAAGACGCC GGTGGTGGCC TCCAATACCG GGGGTTTGAG CGAGATCATT
GAGCACGGCC TGGACGGCTT CAAGGTGCCG CCGGGGGACA GTCGGGCATT GGCCGAGCAC
ATTCTCCTGG TGCTTCAAAA CCCGGCCCAG GCGAAAATGC TCCATGAACG CGCTTTCCGG
AAGGTGCGGG AACAGTACGG TTGGAGGAAA GTCGCCCGCG AAACCGCCCG GCTGTACCGG
GAGGTCTGGA GCGAACGCCA GTCCGCGCCG TGGCCGACCC TTGAAGACCG GCCCGGACGG
ATCCTCGGCC GGGTGTATCA GCTCTTCGAA CGCTATTCCT AA
 
Protein sequence
MMLSWEYPPK TIGGLAQHVY DLNAALSREG VEVHLLTCSA PGASDYEMQG NIHIHRVHPF 
QVSAPDFVTW VLQFNNAILE RAISLFERVG AFRVVHAHDW LVAFAARAVK HARHLPLVAT
IHATEFGRNQ GLHNATQNYI SNVEWWLTFE AWKVIVCSRY MENELKYIFQ LPADKIRVIP
NGVDPENYRL RSDRVKRSFY AAPEERIVFY VGRLVQEKGV QVLLDAVPQI LARMPNTKFV
IGGKGPHLEE LRAQVDRMGI APRIYFTGYI DDEVRNALYH WADVAVFPSL YEPFGIVALE
AMAAKTPVVA SNTGGLSEII EHGLDGFKVP PGDSRALAEH ILLVLQNPAQ AKMLHERAFR
KVREQYGWRK VARETARLYR EVWSERQSAP WPTLEDRPGR ILGRVYQLFE RYS