Gene Dtox_3963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3963 
Symbol 
ID8430978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4146896 
End bp4148047 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content46% 
IMG OID645036181 
Productcarboxyl-terminal protease 
Protein accessionYP_003193279 
Protein GI258517057 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000206146 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000432275 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGGCGGA GATTTGGTAC CGGGCCAAGA TGGTCAATAG TTGTTGTTGG CCTGGCGGTG 
TTGTTATTTG CCGGTGTTGT ATTTGCCGGT GGGATAATAG CGGTAAACTA TAAACATATG
GGAAACCTGG TGAGGGTAAT ATCACTGGTG CGTTCTCAAT ATTTACACCC GGTTGAAACG
TCTGATTTAA TTGACGGCGC GATTAAGGGT TTGGTTGATT CCTTGCATGA TGAGTATTCA
GTCTACTTGG AACCTAAGAC CTATGCGCAG CTCCAGGCGC AAATCAGGGG TTCCTTCGGA
GGTTTAGGTA TTTTAGTCGG TGTTAAGGAT GATTATTTGA CAGTAGTGCG GGTTTATGAC
AACACTCCCG CGGCCAAAAA AGGGATTAAA GCCGGAGATA AGATTGTAAA AATCGGTGAT
CAGGACGCGC AAGGAATACA TTTGGATAGT GCGGTGGAAT TAATGCGAGG GGCGGTTGGT
TCGAAAATTA AATTGACAGT AAAAAGAGAG CATGAGCCTG AATTGCTGGA AATTAATCTG
GTCAGGGAAG AAATCAGTGT TCCTACTGTG GAAGGCAAGG TTATAGAGGG TACCGATATA
GGTTATATGG TGCTTAGCCA GTTTTCTGAG AAAACTCCTG ATGAGTTGGA TAAGGTATTG
TCTGATTTAG AGAGAGAGGA TATCAAGGGA ATTATTTTGG ACCTGCGGGA CAACCCGGGT
GGCGAATTGG TTTCGGCTAC CAAGGTGGCT AATTATTTTT TGCCGGCCGG TCCCATTGTT
TATGTAGACT ACCGGGTGGG CAAGGATCAG ACTTTTACTG CGGACGGGCA TAGAGTGAAA
CTTCCGCTGG TGGTACTGGT GAATGGCAAC AGCGCCAGCG CAGCGGAAAT TTTGTCAGGG
GCAATAAAGG ATACCGGCGC GGGAACTCTT GTCGGAGAAA AGACCTTCGG TAAAGGTATT
GTGCAGACGG TATTTCCCCT GGATAACGAG GCGGGCCTGA AGCTGACCAC GGCCAGGTAT
CTGACTCCTA AAAAGAAGGA TATTCATAAA AAAGGAATCG AGCCTGATGT AGAGGTTAAG
CAGAAACCAA ATGCCCAGCC TGATTTGCAG TTTGAAAAAG CTATAGAAAT TATGAAGCAG
AAGATATCAT AA
 
Protein sequence
MRRRFGTGPR WSIVVVGLAV LLFAGVVFAG GIIAVNYKHM GNLVRVISLV RSQYLHPVET 
SDLIDGAIKG LVDSLHDEYS VYLEPKTYAQ LQAQIRGSFG GLGILVGVKD DYLTVVRVYD
NTPAAKKGIK AGDKIVKIGD QDAQGIHLDS AVELMRGAVG SKIKLTVKRE HEPELLEINL
VREEISVPTV EGKVIEGTDI GYMVLSQFSE KTPDELDKVL SDLEREDIKG IILDLRDNPG
GELVSATKVA NYFLPAGPIV YVDYRVGKDQ TFTADGHRVK LPLVVLVNGN SASAAEILSG
AIKDTGAGTL VGEKTFGKGI VQTVFPLDNE AGLKLTTARY LTPKKKDIHK KGIEPDVEVK
QKPNAQPDLQ FEKAIEIMKQ KIS