Gene Daud_0918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0918 
Symbol 
ID6027243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp979709 
End bp980746 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content64% 
IMG OID641593730 
Productaminodeoxychorismate lyase 
Protein accessionYP_001717063 
Protein GI169831081 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.174013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACCTTCC GGCAGACAGT CATCGACCGT TCCCTGTACA TTCTTTTGGG GCTCCTGGGA 
TCGGCCGCTT TTCTGATCGG CGCCGCCTGG ATCACCGCCA ACGCGATGCT GGCGCCGGTT
TGCGACCGGG AAACGGATCC GGTGCTGGTG GAGATCCCGG CCAGGGCAAG CACCGGGCAG
ATCGGCGCCA TCTTGGCCGA CAAGGGCCTA ATCCGGAACG CCACCGCTTT CCGGTTGTAT
GCCCGGTTTC GGCGACTGGA TGCCGTCCTG AAAGCGGGAG AGTACGAGCT TTCCCCGTCT
CTGTCCACCC CGGAGATCAT TGAGATTCTG GCCCAGGGCC GGGCCAGGCT CGTGGCGTTC
ACCATCCCCG AGGGGCTGAC CTTGAAGCAG ACCGCCGTTT TGCTGGCAGA CCGCGGGTTC
GTGGACGCCG ATGTCTTTAC GCGGCTCCTG GACGAGAAGG CGGCGTCTCA TCCGCTGTTG
TCCGGCCTGC CGGAGGAGCA ACGCTCGCTG GAAGGCTACC TTTTCCCGGA CACCTATATG
ATTTCCATCG GGACCAGCGA AGAACAGATC ATCCGGCTTC TGCTCGCCCG TTTCGAAGAG
GAAACTGCCC GCCTGGATCT GGAGCGCCGG GCCGCGGCAC ACGGTCTTAA TCTGCACGAA
GCGGTGACCC TTGCCTCCCT GATCGAGCGT GAGGCACGCG TGGCTGAAGA GCGCCGGGTG
ATTTCCGGGG TGCTCCACAA CCGGCTCAAG CGGAATATGC TCCTGCAGGT TGACGCCACC
ATCATCTACG CGCTGGGCGA CTTCGACCGC CAGGTGGTGC TGTACCGCGA CCTGGAGGTT
GACTCCCCCT ACAACACCTA CCGGTATTCC GGCCTCCCCC CGGGTCCCAT CGCCAGCCCG
GGCCGGGACT CCCTGATTGC CGCGGTGGAC CCCGACCAAC ACGACTACCT CTACTACGTC
GCCAAACCCG ACGGCACCCA CGCCTTTTCC CGCACCCTGG CCGAGCACAA CGCCAACAAG
CGGCGGTACC TGCCCTAG
 
Protein sequence
MTFRQTVIDR SLYILLGLLG SAAFLIGAAW ITANAMLAPV CDRETDPVLV EIPARASTGQ 
IGAILADKGL IRNATAFRLY ARFRRLDAVL KAGEYELSPS LSTPEIIEIL AQGRARLVAF
TIPEGLTLKQ TAVLLADRGF VDADVFTRLL DEKAASHPLL SGLPEEQRSL EGYLFPDTYM
ISIGTSEEQI IRLLLARFEE ETARLDLERR AAAHGLNLHE AVTLASLIER EARVAEERRV
ISGVLHNRLK RNMLLQVDAT IIYALGDFDR QVVLYRDLEV DSPYNTYRYS GLPPGPIASP
GRDSLIAAVD PDQHDYLYYV AKPDGTHAFS RTLAEHNANK RRYLP