Gene Dhaf_4739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_4739 
Symbol 
ID7261768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp5063460 
End bp5064647 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content47% 
IMG OID643564650 
Productcarboxyl-terminal protease 
Protein accessionYP_002461170 
Protein GI219670735 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTGC AGGAGAGTCG TTGGAAAGAG TATTTAAAAA ACCTGGGATG GGTTCTTGCC 
ATAGGGAGTT TGATTTTTAC GGTGGTTGTC GGGGGGTTCA TTGTAACCAA CCTGGATCAT
TTAGGCCGCT TGGCAAGAGT AGTCAAGCTT GTTGAAAGTG ATTATTTGGA GGAAGTGTCT
GTAGACACCC TGATCGAAGG TGCCACCAAA GGAATTGTGG ATTCCTTGGG CGATCCTTAT
TCAAGTTATA TGAATGCTCA AGAGAATGAA GAGCTTATGC AACAAATCGA AGGGAAATTC
GGCGGCGTGG GGATTATTTT AAGTCTGAAA GATCCTCAGA AACTTGTGGT CCTAAGACCC
ATTAAAAACA CTCCGGCCGC TAAAGCCGGA CTACAGCCTG GAGATGTGAT TATTAAGATC
GATGATGTGG ACGCCACCAC CATCGATCAG GAAAAAGCCG TCTCCCTGAT GCGCGGGAAC
CCGGGGACTA ACGTGACTCT GGTGGTCTAT CGGGAAAGCA TTAAGCAGAA TGTGACCGTT
CCTTTAACCC GGGAAAATAT CGCAGTACCC ACGGTGGAGG GACTGGCTCT GCCAGGGAAT
TCGGATATAG CTTATATCGG AATTTCCCAG TTCTCCTCCC ATACAGCTCT TGAACTCAAT
GAAGTGCTGC GCAATATGGA TATCAGCAAA TACAAAGGGA TGATCTTGGA TTTACGCTAT
AATCATGGCG GGGAATTAGA ATCTGCTGTA GGAGTAGCCA GTTATTTTGT TCAGCCCGGC
CCCATTGTCT ATATTGTGGA TAAAGGAGGC AATGCTGTAA CCAAGGCTTC GGAAGGCAAT
TATTTAGGCA TTCCCTTTGT GGTTTTGGTC AATGAGGAAA GCGCTTCCGC AGCTGAAATC
GTTTCCGGGG CCATCAAAGA TCGGGGAACG GGCACCCTTG TGGGTACCAA GACCTTCGGT
AAAGGGATTG TGCAGACGAT TTATCAACTG GATAGGGGGA CCAGTGTGAA GCTGACCACC
GCCAAGTATT TGACCCCTAA TAAGATCGAT ATTCATAAAA AGGGCATCGA GCCTGATGTG
GAAGTGAAGC TGAAGGATGG AGAGGAAGCA ACTCTTTCTC CTACTACGAA AGCCTTTGAT
ACTCAGCTCA CGGAAGCTCT TAAGGTGCTT CGCCAACAGA TGAAATAA
 
Protein sequence
MDLQESRWKE YLKNLGWVLA IGSLIFTVVV GGFIVTNLDH LGRLARVVKL VESDYLEEVS 
VDTLIEGATK GIVDSLGDPY SSYMNAQENE ELMQQIEGKF GGVGIILSLK DPQKLVVLRP
IKNTPAAKAG LQPGDVIIKI DDVDATTIDQ EKAVSLMRGN PGTNVTLVVY RESIKQNVTV
PLTRENIAVP TVEGLALPGN SDIAYIGISQ FSSHTALELN EVLRNMDISK YKGMILDLRY
NHGGELESAV GVASYFVQPG PIVYIVDKGG NAVTKASEGN YLGIPFVVLV NEESASAAEI
VSGAIKDRGT GTLVGTKTFG KGIVQTIYQL DRGTSVKLTT AKYLTPNKID IHKKGIEPDV
EVKLKDGEEA TLSPTTKAFD TQLTEALKVL RQQMK