Gene Daud_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1036 
Symbol 
ID6026862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1086443 
End bp1087699 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content65% 
IMG OID641593848 
Producthomoaconitate hydratase family protein 
Protein accessionYP_001717180 
Protein GI169831198 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00139] homoaconitase
[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCTGA ACATCATTGA GAAAATACTA TCCCGTAAGA GCGGACGCGC AGTGCGTGTC 
GGGGACATTG TGGTGGCCGA AGTTGATTTC GTGATGGGCC AGGACGGTAC TTCGCCCCTG
GCCATCAACG CGTTCCGCGC GATGGGGGGG AAGGCCCTGT TCGACCCGGC GAAGGTGGCG
CTGGTGATCG ACCACAGCGC GCCCAGCCCC TCCGAAGGTG TCTCCAACCT GCATAAATTC
ATGCGGAACT TCGCCCGGGA AACCGGGTGC CACCTGTACG ACGTAGGTGA CGGGGTGTGC
CACCAATTGA TGCTGGAATC GGGAGCGGTC GGGCCGGGTT CCTTAGTCGT TGGCGCCGAC
TCGCATACCT GCACCTACGG AGCGTTGAAC GCCTTCGCCA CCGGGGTGGG GTCGACCGAT
CTCGCCGCCG CCCTGATCTC CGGCCGGATG TGGTTCAAGG TGCCGCCCAC GCTGAAGTTC
GTTTGCCACG GGATACTGCC TCCCGGAGTG TACAGCAAGG ACCTGATCCT ATTTCTGATC
GGACAGGTCA CCGCCGACGG GGCCACCTAC ATGTCGGTCG AGTTCAGCGG AGAGGCGGTC
CGGGCACTGT CGATGGAGGC CCGGTTCACC ATCTCGAACA TGGCCGTGGA GATGGGCGCG
AAGGCGGGGC TGATGGAGGC GGATGAGCGG ACGGCCGAAT GGGTAGCCCG GTACAGCCGG
CGCACGTTCG AGCCGGCCGC ACCCGATCCG GATGCGGTTT ACGAACGAAT ACTGGAGTTT
GACGTGTCCG GGTTGGAGCC GCAAGTCGCC CGGCCCCACC GGGTGGACAA CGTGGCTCCG
CTCAGTGAGG TGGCCGGCTT GCCGGTGGAC GAGGCCGTTC TGGGGACCTG CACGAACGGG
CGTCTGGAGG ACTTGCGGAT CGCCGCGGCT CTCGTCAGGA GGTACCGGGT GTCCCCGAGG
GTGCGTTTCA TTGTGGCGCC CGCCTCGCGG CACGTGTACC TGCAGGCGCT GCGGGACGGC
ACCCTGGAGA CGCTGGTGCA GGCCGGGGCC GCGGTGGTCA CTCCGGGCTG CGGCCCGTGT
GTGGGTACAC ACAACGGCGT GCCCGCCGAT GGCGAACGGG TGATCTCGAC CGCCAACCGG
AACTTCAAAG GCCGCATGGG GAACCGGAAC GCGGAGATTT ACCTGGCTTC ACCAGCGGCG
GTGGCCGCGG CGGCCCTGAC CGGTGAGATC ACCGACCCCA GAGAATTCTT GAAGTAG
 
Protein sequence
MGLNIIEKIL SRKSGRAVRV GDIVVAEVDF VMGQDGTSPL AINAFRAMGG KALFDPAKVA 
LVIDHSAPSP SEGVSNLHKF MRNFARETGC HLYDVGDGVC HQLMLESGAV GPGSLVVGAD
SHTCTYGALN AFATGVGSTD LAAALISGRM WFKVPPTLKF VCHGILPPGV YSKDLILFLI
GQVTADGATY MSVEFSGEAV RALSMEARFT ISNMAVEMGA KAGLMEADER TAEWVARYSR
RTFEPAAPDP DAVYERILEF DVSGLEPQVA RPHRVDNVAP LSEVAGLPVD EAVLGTCTNG
RLEDLRIAAA LVRRYRVSPR VRFIVAPASR HVYLQALRDG TLETLVQAGA AVVTPGCGPC
VGTHNGVPAD GERVISTANR NFKGRMGNRN AEIYLASPAA VAAAALTGEI TDPREFLK