Gene Daud_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2012 
Symbol 
ID6026620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2117565 
End bp2118581 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content67% 
IMG OID641594834 
Productmetalloendopeptidase glycoprotease family 
Protein accessionYP_001718135 
Protein GI169832153 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000649848 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTGA TCTTGCTGGG CGTGGAAACC TCCTGCGACG AGACTGCGGC GGCGGTGGTG 
GAGAACGGTC ATCTGGTCCG TTCCAACACC ATCGCTTCGC AGTTTGACCT GCACGGCAAG
TTCGGCGGTG TCGTCCCCGA GGTGGCCTCC CGCCGCCACC TGGAGAGCAT AAACCCCGTC
ATCCGGCAGG CGCTGGAGGA AGCCGATGTC TCTTTCCGGG ATCTCGACGG GGTCGCGGTC
ACCTACGGCC CGGGTCTGGC CGGGTCGCTT TTGGTGGGGT TGATGGCGGC CAAGACCATC
GCCTATGCAC TGGACATCCC CCTGTTCGGC ATCAACCACC TGGAGGCCCA CATTTACGCG
AACTTTCTGG TAGCACCCGA CCTGCCCTTT CCCCTGCTCT GCCTGATCGT ATCCGGCGGG
CACACCGATC TTGTGCTCAT CACGCGGCTC GGCGAGTACC GTCTTCTGGG TCGTACCCGT
GATGACGCCG CGGGCGAGGC CTTCGACAAG GTGGCCCGGG TCCTGGAACT GGGTTATCCG
GGCGGCCCGC TCATCGAAAA GCTGGCCCGG GAGGGGGACC CGGAGGCGGT GCCTTTTCCC
CGCGCCTACC TGGAGGAGGG CACCCTGGAC TTCAGCTTCA GCGGCCTGAA AACGGCGGTG
ATCAACTACC TGGACCGGGC GCGGCGCGAG GGCCGGAAAG TGGCCGAGGC CGACGTGGCG
GCCGGTTTTC AGGAGGCGGT GGTGGGCGTG CTGGTGGACA AGGTACTGGC CGCGGCGCGA
GTCCACCGCC CGGCCCGCAT CCTGCTGGCC GGCGGGGTGG CCGCCAACCG CGTGCTGGCC
CGGGAGTTGG AGCGGCGGGC GGCGGCAGAG GGTTTCGGCG TCACCGTGCC GCCGCCCGTA
TTCTGCACCG ACAACGCGGC CATGGTCGCC TGCGCCGGTT ACTACCGCTA CCTGCACGGG
GATTCGTCCC CCCTGACTTT AAACGCCCTG GCCGGGCTTG GTCTGGGCTG TGAATGA
 
Protein sequence
MSVILLGVET SCDETAAAVV ENGHLVRSNT IASQFDLHGK FGGVVPEVAS RRHLESINPV 
IRQALEEADV SFRDLDGVAV TYGPGLAGSL LVGLMAAKTI AYALDIPLFG INHLEAHIYA
NFLVAPDLPF PLLCLIVSGG HTDLVLITRL GEYRLLGRTR DDAAGEAFDK VARVLELGYP
GGPLIEKLAR EGDPEAVPFP RAYLEEGTLD FSFSGLKTAV INYLDRARRE GRKVAEADVA
AGFQEAVVGV LVDKVLAAAR VHRPARILLA GGVAANRVLA RELERRAAAE GFGVTVPPPV
FCTDNAAMVA CAGYYRYLHG DSSPLTLNAL AGLGLGCE