Gene Daud_1880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1880 
Symbol 
ID6027150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1979881 
End bp1981041 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID641594698 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001718005 
Protein GI169832023 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0947444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTGGA GACGGAAGTT AGTGTTCACC ACCCTTTTGG CTCTGGTAGC CGGGATAATG 
TTCGCCGCCG GCAATATGGC GACCAGGGAT TTTTGGCCAC AGGAGCCCCT CGTCAACACC
AACAACCGCA CGGCGGTGGC CGCTCCCGAA TTGCCCGGCG TAAACCCGGG TACGATCGTC
GAAATCGTTG GCCGGGCCGG GCCGGCGGTA GTGAAAATCG ACACAGTGGT ACCAACCTCC
GGCAGGGAAT GGAGCCCGTT CTTTGACGAT CCCTTCTTCC GTGACTTTTT CGGCCTTCCC
GACATTTCAC CCCCGCAGTC GAGCCCCAGC CGCCGGGGTA TGGGGTCCGG GTTTCTGTTT
TCCGAGGACG GTTACATCCT GACTAATGAA CACGTAATCC GCGGTGCCGA GGAGATCTGG
GTGACCCTGA CCGGGTTCGA AACACCGCTG GCCGCAAAGG TGGTGGGTTC GGACTACGAC
CTCGACCTGG CGGTGCTCCG GGTGAACGCC CCGCGCAAGC TGCCCCACCT GAAACTTGGT
GATTCGGACA ATGTCCGGGT TGGTGAGTGG GTGATCGCTA TCGGCAACCC CTACGGCCTG
GACCATACGG TCACCGTCGG GGTGATCAGC GCCAAGGGCC GCCCGGTGAC CATCGAGGAC
CGGTACTACG ACAATCTCCT GCAGACTGAC GCCTCCATCA ACCCAGGGAA CAGCGGCGGC
CCGCTGTTGA ATCTCCGGGG CGAGGTCGTG GCGATCAACA CGGCTGTCAA CGCCCAGGCC
CAGGGGATCG GCTTCGCCAT CCCGACCAGC ACCATCCGTC CCGTCCTGGA TGAGTTGATC
AGAACCGGCG GCATCAGTCA CGCTTGGCTG GGTGTGCAGT TGGACACTGT TTCTCCGGAA
CTGGCCCGGT ACCTGAAACT TCGGGGCACT ACCGGGGCGC TGGTAATCGG GATTGTGGCC
GACAGCCCGG CGGCCAGAGC CGGTTTCCGG CCGGGTGACG TGATCCTGGA GTTAAACGGG
GCTCCGGTGA ACAACCCGGA AAAAGTGATC CGGGCGATCA GGACCCATAA GGCGGGCGAA
ACCCTAAAGG TGAAGATATT CCGGGACGGC AGCGTCCGCG AGCTGGAAGT GAAACTGGGC
GAGAAACCGG TCCGCCGTTA A
 
Protein sequence
MSWRRKLVFT TLLALVAGIM FAAGNMATRD FWPQEPLVNT NNRTAVAAPE LPGVNPGTIV 
EIVGRAGPAV VKIDTVVPTS GREWSPFFDD PFFRDFFGLP DISPPQSSPS RRGMGSGFLF
SEDGYILTNE HVIRGAEEIW VTLTGFETPL AAKVVGSDYD LDLAVLRVNA PRKLPHLKLG
DSDNVRVGEW VIAIGNPYGL DHTVTVGVIS AKGRPVTIED RYYDNLLQTD ASINPGNSGG
PLLNLRGEVV AINTAVNAQA QGIGFAIPTS TIRPVLDELI RTGGISHAWL GVQLDTVSPE
LARYLKLRGT TGALVIGIVA DSPAARAGFR PGDVILELNG APVNNPEKVI RAIRTHKAGE
TLKVKIFRDG SVRELEVKLG EKPVRR