Gene Dred_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_0641 
Symbol 
ID4955927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp690363 
End bp691508 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content42% 
IMG OID640179816 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001112006 
Protein GI134298510 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAAT GGAGGAATAT AGCTGTCTGT ACCTTAATTA TTGCATTCCT GTCTGGAATC 
ATGTTTGCTG CGGGTTGTAC CTTAATTAAG GATATTAGCC CAAAGGAAAA TATAGCACAG
CAGCAGGAAG GAGTAGCTAA CGCCGGTATG CCGGGAGTGG GACCGAATGC CATTGCAGAC
ATAGTAGAAA AGGTTAGTCC GGCAGTTGTT AAAATTACGA CGGTGGTGGC GGTGAAAGGC
TACATAGATA ACAACCCATT TCTAAATGAT CCGTTTTTTA GACAATTCTT TGGAGAAAAT
GCGCAACCCA AGTATCAGAG TGGTTTAGGT TCTGGATTTA TTATTTCTAA GGATGGTTAT
ATTTTGACCA ATGATCATGT TGTTGAAGGT GCTCAAAAAA TTAGTGTGCT AGTGAAAGGA
TATAAAAAGC CCTATGAAGC AAAACTCATA GGAGCAGATC CTTCAATGGA TTTGGCAGTA
TTAAAAATTG AGGGAAAGGA GTTCCCAACC TTACCACTGG GGGATTCAAA AAAAATACGG
GTAGGCAATT GGGTGATAGC CATTGGGAGC CCCTTTGGGT TGGAAGACAC TGTTACCATT
GGTGTCATTA GTGCTAAGGA ACGTCCTTTA GAAATCGACA ACCGTACCTT TGAACACCTA
CTACAAACGG ATGCATCCAT CAACCCAGGT AACAGTGGGG GTCCCTTGCT CAATTTAAAT
GGGGAAGTAA TTGGTATAAA TACTGCCATC AATGCCCAAG CCCAGGGTAT TGGTTTTGCG
ATACCCACCA GCACCGTGAA AGAAATAATT GATGATTTGA TTCAGCAGGG AAAAGTGAAG
AGACCATGGC TAGGTGTACA AATCCAGCCG GTAACCCAGG ATATTGCCAA TTTTCTTGGT
TATGATGGTA CAACAGGAGC TGTCATCTAT GGAGTTGTAC CTGATGGACC AGCAGCTAAA
GCCGGTATAC AAGAGGGTGA CATTGTCTTA AGCATTGATG ATACAAAGAT TGATGACCCG
GATACATTAA TAAAAACCAT GCAGAAGAAG AAGGTAGGTA CCAAGGTGTC AATGAAAGTG
TTCCGCAAGG GAAAGACCAT CCAAATCACT GTTCTTACAG ATGAAAGACC AGCAAATGTA
AAATAA
 
Protein sequence
MSKWRNIAVC TLIIAFLSGI MFAAGCTLIK DISPKENIAQ QQEGVANAGM PGVGPNAIAD 
IVEKVSPAVV KITTVVAVKG YIDNNPFLND PFFRQFFGEN AQPKYQSGLG SGFIISKDGY
ILTNDHVVEG AQKISVLVKG YKKPYEAKLI GADPSMDLAV LKIEGKEFPT LPLGDSKKIR
VGNWVIAIGS PFGLEDTVTI GVISAKERPL EIDNRTFEHL LQTDASINPG NSGGPLLNLN
GEVIGINTAI NAQAQGIGFA IPTSTVKEII DDLIQQGKVK RPWLGVQIQP VTQDIANFLG
YDGTTGAVIY GVVPDGPAAK AGIQEGDIVL SIDDTKIDDP DTLIKTMQKK KVGTKVSMKV
FRKGKTIQIT VLTDERPANV K