Gene Daud_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2020 
Symbol 
ID6026625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2125478 
End bp2126641 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content67% 
IMG OID641594842 
Productamidohydrolase 
Protein accessionYP_001718143 
Protein GI169832161 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGCCA TTACAAACGG GCGCATTCTC ACCATGGCCG GAAGGGACAT CCCGTCGGGG 
ACGGTGCTGA TCGAGGGCGG GCTGATCAAG GCGGTGGGCG CCGGAATAGG GGTGCCGGAC
GGGGCTGAGG TGCTGGAGGC CGCGGGCAAG CTGGTGCTCC CGGGGTTGAT TGAACCGCAC
TGCCACGTCG GGATTATGGA AGAAATCTTC CGCGAGGAAG GGAACGACGG GAACGAGTAC
TCCGACCCGG TCACGCCCCA GCTGCGGGCC ATCGACGGTG TTTACCCGGA GGATCTCGGG
TTTACCGACG CTCTAGCCGG GGGAGTCACT ACCCTTTGCA CCACGCCGGG CAGCGCGAAC
GTCATCGGTG GCGAGATGGT CATCCTGAAG ACGGCGGGAA AGACGGTGGA ACAGATGCTG
GTCCGCTTCC CGGCCGGCCT GAAAGCGGCC CTGGGTGAAA ACCCCAAACG GGCCTACGGC
AAGGAACGGA AGGCACCGGT GACCAGGATG GCCAGCGCGG CCCTTCTGCG CACCGCCCTG
GTGCAGGGCG CCGAGTACAT CCGCAAGTTG GAGCGGGCCG GAAGGGGCGA CGGCGACCCG
CCGGACAGGG ACCTGAAGTT GGAAGCCTTG GCCCGCGTGC TGCGGCGGGA AATTCCCCTG
CGGGTCCACG CCCACCGGGC CGACGACATC CTGACCGCCG TGCGGATCGC GCGTGAATTC
AACCTCGACC TGGTGATTGA GCACGGCACC GAGGCCGACC GGGTGGCCGA CATGCTGGTG
CAGGAAGACA TCCCCGTGGT CCTGGGGCCG CTCCTGGTGA ACCGGCCGAA GGTGGAAATG
CGGCACAAAT CGCTAGAGAC AGCCGCCCGG TTAGCCGAGG CCGGGGTGCG GTTCGCGGTG
ATGACCGACC ACCCCGCAGT GCCGGTCCAG TACCTGGGGC TTTCGGCGGC CCTGACCGTC
CGGGGCGGAC TCAGCGAGGA GCGCGCCCTG CGGGCGGTGA CCGCGGACGC CGCCGCCGTA
CTGGGTCTCG CAGACCGTTT AGGGACCCTG GAGCCCGGAA AAGAAGCGGA CGTGGTCATA
ATGGACGGGG ACTTCTTCGA CGTCCGCAGC CGGGTGGAAA GGGTGTACAT CAAGGGCCGG
CTCGTCTATA CGGCCGACCG CTAA
 
Protein sequence
MLAITNGRIL TMAGRDIPSG TVLIEGGLIK AVGAGIGVPD GAEVLEAAGK LVLPGLIEPH 
CHVGIMEEIF REEGNDGNEY SDPVTPQLRA IDGVYPEDLG FTDALAGGVT TLCTTPGSAN
VIGGEMVILK TAGKTVEQML VRFPAGLKAA LGENPKRAYG KERKAPVTRM ASAALLRTAL
VQGAEYIRKL ERAGRGDGDP PDRDLKLEAL ARVLRREIPL RVHAHRADDI LTAVRIAREF
NLDLVIEHGT EADRVADMLV QEDIPVVLGP LLVNRPKVEM RHKSLETAAR LAEAGVRFAV
MTDHPAVPVQ YLGLSAALTV RGGLSEERAL RAVTADAAAV LGLADRLGTL EPGKEADVVI
MDGDFFDVRS RVERVYIKGR LVYTADR