Gene Dde_3095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_3095 
Symbol 
ID3758089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp3083368 
End bp3084717 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content64% 
IMG OID637784005 
ProductU32 family peptidase 
Protein accessionYP_389584 
Protein GI78358135 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCTGA CAACATTTTC TCCGGAACTG CTGGCCCCTG CCGGTGACAT GTCCAGACTG 
GACGCCGTAC TGCGCTACGG TGCCGACGCC GTCTATCTGG GCGGCACGGA AATGAATCTG
CGCGCCGGTG CCGCGGGGTT CACCCCCGAA GCTCTGGGCA CTGCACTGGC CAAAGCCCGC
CGCTGCGGCG CCAGAGTATA CTGCTGTGTC AACGCCCTGC CCTACGAGCA TCAGCTGGAA
ACGGTGCGCG CCACACTGGA AAGACTGGCG CAGCCTTTCA GCTGCAACCA GCAGACTCTG
GAACCGCACT GGCGCGACGA CACGCTGCCC GACACCACGG AAAACAGCAT TGACGGGCTG
ATCATAGCCG ACCCCGGCGT GCTGCGCATG GCGCGCCGCA TAGCGCCGCA CATCCCCGTG
CACATGAGCA CACAGGCCAA TACGGCCAAC AGCGAATCCG CAGCATTCTG GCGCGATATG
GGGGCAAGCA GGCTAAATCT GGCACGCGAG CTGGGCGCGG CGGACATACG TGCCATCATG
CGGGCCGTAC CCGACGTGGA ATATGAAACA TTTGTCCACG GGGCCATGTG TCTTGCCGTT
TCCGGCCGCT GCCTGCTGAG CGCATGGATG AACGACCGCC CCGCCAATCT GGGGCAGTGC
ACCCACCCGT GCCGGTTCGA CTACCGCACG GCGGGGCTTG CCGCCAGTGA TGCGGAACCG
GACCATGCAG GCATCGAACT GGACGTCGAA GAACGCACCC GCGCCGGCGC ACCGGCATGG
ACGGTGACGC AGGACAGTGG CTGGTCACAC ATCTGGAGTC CGCATGACCT GTGTCTGGTG
CGGTATCTGC GCTGGTTTGC CGTGCAGGGC GTGGCTGCAC TGAAAATTGA AGGCCGCATG
AAAACCGCAG GCTATGCCGC GCAGGTTGTT GATGTTTACC GTACTGCCGT AGATGACCTC
GCCGCCGGGC GTTTTCGCCC TGCGCTGTAC ATGCGTGAGC TGTGCAACAC GGCCACCCGT
CCGCTTTCGT CAGGTTTTTT CCTGCCCCGC GGACGCAGGC GCACCTGGCA GGCGGCCTCT
TCCGGCCATC GTCTGCCGCT GGTGGCCAGA ACAGGCCGCC GTCTTTCCGC AGGCAGCTGG
GAAATGGCTG TACTGGCACC GTGGCAGTGC GACAGACCGG TCGAGATTCT CGTTCCGGGA
CTGAAAAGGC CCCTGCTGCA ACCGCAGCAC TGCCGCGTGG AAAACCACAG AGGCGAAACC
GCGCGGCAGG TGCACCCCGG CACTTCCGCG ATACTGCACT GCGACCACCC CGACCTTGCT
CCGGGGCTGT TTCTGCGGGC CTGCACATAG
 
Protein sequence
MPLTTFSPEL LAPAGDMSRL DAVLRYGADA VYLGGTEMNL RAGAAGFTPE ALGTALAKAR 
RCGARVYCCV NALPYEHQLE TVRATLERLA QPFSCNQQTL EPHWRDDTLP DTTENSIDGL
IIADPGVLRM ARRIAPHIPV HMSTQANTAN SESAAFWRDM GASRLNLARE LGAADIRAIM
RAVPDVEYET FVHGAMCLAV SGRCLLSAWM NDRPANLGQC THPCRFDYRT AGLAASDAEP
DHAGIELDVE ERTRAGAPAW TVTQDSGWSH IWSPHDLCLV RYLRWFAVQG VAALKIEGRM
KTAGYAAQVV DVYRTAVDDL AAGRFRPALY MRELCNTATR PLSSGFFLPR GRRRTWQAAS
SGHRLPLVAR TGRRLSAGSW EMAVLAPWQC DRPVEILVPG LKRPLLQPQH CRVENHRGET
ARQVHPGTSA ILHCDHPDLA PGLFLRACT