Gene Daud_2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2227 
Symbol 
ID6025784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2337654 
End bp2338880 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content67% 
IMG OID641595047 
Producthypothetical protein 
Protein accessionYP_001718346 
Protein GI169832364 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAACG TGCGCCTCCT CTACTGGGAT TGTTTCGCCG GGATCAGCGG CGACATGGCC 
CTGGGTTCCC TGATTGACGC CGGAGCCTCG CCCGACGATA TTCGGGCTGT TTTGTCCGGA
CTCCCTCTCG GAGGCTGGAC ACTGGAGGTA CGCGAAGAAA AAAGCGGCGG CCTGCGGGGC
ACCGGGGTCA AGGTGCACGT CCGCCGGGAA CAGCCACACC GGCGCCTGCC CGACATTCTG
GCGATCATCC AGGCGGGAGG GCTGCCCGAA CCGGTGGCCA GGAATTCCGC CCGGGTGTTT
ACCCGCCTGG CGCAAGCCGA GGCCCGAGTC CACGGTGTCT CCCCGGACCA GGTGCACTTT
CACGAAACCG GCGGCGTGGA CGCCATCATC GAAATCATCG GCACGGCCGC CGCCCTGCAC
CTGCTGGAGG TGGAACAGGT ACGGCTCTCG CCCCTGCCGC TTTCCAGGGG CTTTGTGCAC
TGCGCCCACG GCACGCTTCC GGTGCCGGCC CCCGCCGTGC TCGAACTCAT CCGCGGCTTT
CCCACCCGGC CCGCCGAGGT CGAGGGCGAA CTGGTCACGC CGACGGGCGC GGCCCTAGCG
GTCACCCTGG CTTCCACGGC CGGCGCCTAC CCGGGTCTGT TCATCGAAGG GATTGGTTAC
GGGGCCGGAA CCTCGGCCTT CCCTTTCCCG AACCTGCTGC GGGCGGTGCT GGGCCGGGTT
GAGTCACCGG CAGGGGAACA CCCCCCGGCA CGGGAGCAAA TCACCGTCCT GGAAACGGTG
CTGGACGACG TTAATCCCGA ACACCACCCC TATGTCCTGG AGCAGCTCCT GGCCGCCGGA
GCCCGGGACG CTTACCTGAC CCCCCTGATC ATGAAGAAGG GCCGCCCGGG GGTAAAGCTG
ACGGTAATAG CCGGTGCGGA TGAATGGAAT TCGCTGGTAA TGGTTATACT GAAGGAAACA
GGCACACTGG GTATCCGCGT GCGCCGCGAG GACCGGGTTG TTCTGGACCG CCGGGTGCTG
GTGGTGGATA CGGACTATGG CCCGGTGCAG GTGAAGGCAG CCCTGCTGGA CGGCCGCATG
CTGCGAGTGA AACCGGAATT CGAGGATTGC CGCACCCTGG CGCTAAAGCA CCGCGTCCCG
GTGCGCATCG TCACGGCCGC GGCCGAACGG GCGGCGGAGG CCCTGTTCGG AGAAGAGGGC
TTGCCCGTTG AGCAGCAGCA AGAATAG
 
Protein sequence
MTNVRLLYWD CFAGISGDMA LGSLIDAGAS PDDIRAVLSG LPLGGWTLEV REEKSGGLRG 
TGVKVHVRRE QPHRRLPDIL AIIQAGGLPE PVARNSARVF TRLAQAEARV HGVSPDQVHF
HETGGVDAII EIIGTAAALH LLEVEQVRLS PLPLSRGFVH CAHGTLPVPA PAVLELIRGF
PTRPAEVEGE LVTPTGAALA VTLASTAGAY PGLFIEGIGY GAGTSAFPFP NLLRAVLGRV
ESPAGEHPPA REQITVLETV LDDVNPEHHP YVLEQLLAAG ARDAYLTPLI MKKGRPGVKL
TVIAGADEWN SLVMVILKET GTLGIRVRRE DRVVLDRRVL VVDTDYGPVQ VKAALLDGRM
LRVKPEFEDC RTLALKHRVP VRIVTAAAER AAEALFGEEG LPVEQQQE