Gene Daud_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2049 
Symbol 
ID6026131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2160643 
End bp2161860 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content63% 
IMG OID641594870 
Productputative stage IV sporulation YqfD 
Protein accessionYP_001718171 
Protein GI169832189 
COG category 
COG ID 
TIGRFAM ID[TIGR02876] sporulation protein YqfD 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTGTAT TGAAGATTCT GGCTTACCTG TTCGGCTACG TAACCATGGT GGTCAACGGG 
CGGACCTTGG AGCGGTTTAT CAACCTGGCG ACCAGAAGGG GCATCTATTT CTGGGATATC
CGGCGGCTGG ACGAGGACCG GATCCTGATT AAGGCCCGGT TGAGCGGCGT CAAACCTCTG
CGTCACATCG CCCGGGACAC GGGCAGCCGG TTCAGAATCG AGAGGCGGTT CGGTCTGCCC
TTCTTCCTGG GCCGGGCCCG GCGCCGGAAG ACGCTGGCCC TGGGCGTGGT CTTTTTCCTG
ACCGCGCTGT ATGTTCTGTC TTCTTTTGTG TGGTTCGTCG AGGTTGAGGG GAATGTTAAG
GTCGGGGAGC ACGAAATTCT CCAAGCCGCC CGGGAAGCCG GTTTTTACCG CGGCGCACCC
AAGTGGCGTT TCGAGGTGTT CGCGGTGGAA GAGAAGATCA AGGAGCGGCT CCCCGCGGTG
GCCTGGGTCG GCCTGGAAAT CAGGGGTACG CGGGCGCTGA TCACGGTGGC CGAAAAGAAA
TTCCTGCCGC CCGACCACGA CGGGCCCTCC GATATCCTGG CGGCCAAGGC CGGCCTGGTG
CGGGATGTAC TGGTCTTAAG CGGCCAGGCG GTCGTCCGGG AGGGGGATAC GGTTTTCCCC
GGACAGCTCC TGATCTCGGG GGAAATCTGG CCCCCGGAGG CGCTTGATCC CCAGGGGGCG
ATTCTTAACG TTGAGCCTCG CCTCGTAAGG GCTCGGGGGA TCGTCAACGC CCGGGTCTGG
TACGAGGCCT ACGGGGAATC GGCCCTTGTG GAGCATGCGC AAAGACCCAC CGGCCGGGAA
GAACGGCGCG TGGCCGTACG GCTCATGGGA CGGCAGGTGG TGGTCTCCGG ACCCCGCAAA
GACCCTTTTC CGAGCTTTGA AGCGTCGGAA ACGGTATGGA CGGGGCCGGG CTGGAGGAAT
TACACCCTGC CTGTCGAAGT AATCACCACG GTTTTCCGGG AAACGAGGCT CCACCTGGTG
CGCCACAGGC GGGAGGAAGC GCTGCGGCTG GCCACCGAAC AGGCCCGCGA GATACTGAAG
GAACGGGTGC CGCCGGAGGC GGAGGTGCTG CAACAGCGGG TCGACCTGGT GGACACCGGG
ACCGCCGAGG AGTTGATTAG GGTCCGGATC ATCGTGGAAA CCCTGGAGGA CATCGGCGTG
GAAAAACCGC GCTCGTAG
 
Protein sequence
MFVLKILAYL FGYVTMVVNG RTLERFINLA TRRGIYFWDI RRLDEDRILI KARLSGVKPL 
RHIARDTGSR FRIERRFGLP FFLGRARRRK TLALGVVFFL TALYVLSSFV WFVEVEGNVK
VGEHEILQAA REAGFYRGAP KWRFEVFAVE EKIKERLPAV AWVGLEIRGT RALITVAEKK
FLPPDHDGPS DILAAKAGLV RDVLVLSGQA VVREGDTVFP GQLLISGEIW PPEALDPQGA
ILNVEPRLVR ARGIVNARVW YEAYGESALV EHAQRPTGRE ERRVAVRLMG RQVVVSGPRK
DPFPSFEASE TVWTGPGWRN YTLPVEVITT VFRETRLHLV RHRREEALRL ATEQAREILK
ERVPPEAEVL QQRVDLVDTG TAEELIRVRI IVETLEDIGV EKPRS