Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_3095 |
Symbol | |
ID | 3758089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 3083368 |
End bp | 3084717 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637784005 |
Product | U32 family peptidase |
Protein accession | YP_389584 |
Protein GI | 78358135 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCTGA CAACATTTTC TCCGGAACTG CTGGCCCCTG CCGGTGACAT GTCCAGACTG GACGCCGTAC TGCGCTACGG TGCCGACGCC GTCTATCTGG GCGGCACGGA AATGAATCTG CGCGCCGGTG CCGCGGGGTT CACCCCCGAA GCTCTGGGCA CTGCACTGGC CAAAGCCCGC CGCTGCGGCG CCAGAGTATA CTGCTGTGTC AACGCCCTGC CCTACGAGCA TCAGCTGGAA ACGGTGCGCG CCACACTGGA AAGACTGGCG CAGCCTTTCA GCTGCAACCA GCAGACTCTG GAACCGCACT GGCGCGACGA CACGCTGCCC GACACCACGG AAAACAGCAT TGACGGGCTG ATCATAGCCG ACCCCGGCGT GCTGCGCATG GCGCGCCGCA TAGCGCCGCA CATCCCCGTG CACATGAGCA CACAGGCCAA TACGGCCAAC AGCGAATCCG CAGCATTCTG GCGCGATATG GGGGCAAGCA GGCTAAATCT GGCACGCGAG CTGGGCGCGG CGGACATACG TGCCATCATG CGGGCCGTAC CCGACGTGGA ATATGAAACA TTTGTCCACG GGGCCATGTG TCTTGCCGTT TCCGGCCGCT GCCTGCTGAG CGCATGGATG AACGACCGCC CCGCCAATCT GGGGCAGTGC ACCCACCCGT GCCGGTTCGA CTACCGCACG GCGGGGCTTG CCGCCAGTGA TGCGGAACCG GACCATGCAG GCATCGAACT GGACGTCGAA GAACGCACCC GCGCCGGCGC ACCGGCATGG ACGGTGACGC AGGACAGTGG CTGGTCACAC ATCTGGAGTC CGCATGACCT GTGTCTGGTG CGGTATCTGC GCTGGTTTGC CGTGCAGGGC GTGGCTGCAC TGAAAATTGA AGGCCGCATG AAAACCGCAG GCTATGCCGC GCAGGTTGTT GATGTTTACC GTACTGCCGT AGATGACCTC GCCGCCGGGC GTTTTCGCCC TGCGCTGTAC ATGCGTGAGC TGTGCAACAC GGCCACCCGT CCGCTTTCGT CAGGTTTTTT CCTGCCCCGC GGACGCAGGC GCACCTGGCA GGCGGCCTCT TCCGGCCATC GTCTGCCGCT GGTGGCCAGA ACAGGCCGCC GTCTTTCCGC AGGCAGCTGG GAAATGGCTG TACTGGCACC GTGGCAGTGC GACAGACCGG TCGAGATTCT CGTTCCGGGA CTGAAAAGGC CCCTGCTGCA ACCGCAGCAC TGCCGCGTGG AAAACCACAG AGGCGAAACC GCGCGGCAGG TGCACCCCGG CACTTCCGCG ATACTGCACT GCGACCACCC CGACCTTGCT CCGGGGCTGT TTCTGCGGGC CTGCACATAG
|
Protein sequence | MPLTTFSPEL LAPAGDMSRL DAVLRYGADA VYLGGTEMNL RAGAAGFTPE ALGTALAKAR RCGARVYCCV NALPYEHQLE TVRATLERLA QPFSCNQQTL EPHWRDDTLP DTTENSIDGL IIADPGVLRM ARRIAPHIPV HMSTQANTAN SESAAFWRDM GASRLNLARE LGAADIRAIM RAVPDVEYET FVHGAMCLAV SGRCLLSAWM NDRPANLGQC THPCRFDYRT AGLAASDAEP DHAGIELDVE ERTRAGAPAW TVTQDSGWSH IWSPHDLCLV RYLRWFAVQG VAALKIEGRM KTAGYAAQVV DVYRTAVDDL AAGRFRPALY MRELCNTATR PLSSGFFLPR GRRRTWQAAS SGHRLPLVAR TGRRLSAGSW EMAVLAPWQC DRPVEILVPG LKRPLLQPQH CRVENHRGET ARQVHPGTSA ILHCDHPDLA PGLFLRACT
|
| |