Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_4299 |
Symbol | |
ID | 8431313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 4466962 |
End bp | 4468251 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 645036491 |
Product | proteinase inhibitor I4 serpin |
Protein accession | YP_003193589 |
Protein GI | 258517367 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4826] Serine protease inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0655436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCC TTATAGTCAT GCTTTTAGCA ACCGTAATGC TGTTCGGCCT GACAGCGTGC AGCCAACCAG TGAAGGTGAC GGGAACCAAC CTTGTTGCAG CCCCTGTTTA TCCCAAAGGA ATTGACTTTG GGGACTCGGA CAAGCAGCGA GAGCTCCGGG AAAACAACCC GGTAAATAAA GATACTGTAA ACGCTGTCAA CCAATTCTCT TATGACACCG CGGCTCAGCT ATTGAAAGGA AGCGATACAA ATGGCTGTTA TTCGCCATTG AGCCTGTATT ATGCCCTGGC GCTGGCTGCA GCCGGAGCCG AAGACTCCAC CAGGGACGAG CTGCTAACCC TTTTGGGCTT TGAAGATGCG GACAGCCTGT CGAAACAATG CGGGAACCTC TACCGCCTGC TCTATACCGA CAATAAGGTT TCCAGGCTGA AGATCGCCAA TTCCCTGTGG CTGGCCGATG AAACTGACGG ACAGCAAATC TCCTTTAAGG ACAGCTATAT CAAAAACGCC ACGGAGCATT TTTATACATC CATCTTTACC GCCGATTTTG CCGACGAGAA TACCGGCAAG GCAATGGGCC GCTGGATCTC GGAGAACACT AACGGCACCC TTGCCCCAGA ATTCAAGACA AACACCGAGC AAATCATGAG TATTCTCAAC ACGGTTTACT TTTACGATCA ATGGACTGAC CGCTTCAATG CAGAAAAGAC CAAAGAGGAC ACCTTCTATC TTCAAAGCGG TCCGGAAGTT GTCTGTGATT TTATGAATAT GAATTATTGG TCACACGGTT TCAGCAAGGG CAACGGGTAT ACCCGTTCTT CGCTAGGCCT AAAAACAAGC GGCAGCATGA TATTTATCCT GCCTGATGAA GGCGTTGCCG TCGCAGACCT GCTGTCTTCC CCGCAAAAGC TGGAGAAAAT ATTTACGCAG GGCGAAGACA AAAACGGAAA GGTTGTCTGG AGCGTCCCTA AATTCAAATA CGGATCCAGC TTCGACCTGG TTGATACGTT GAAAGCATTA GGTATCACCT CAGCTTTTTC TTTGGACAGC GCAGACTTCT CCGCTCTGAC CAATGCCCCC GCGTTTATCT CCGGGGTCAA ACAGGAAACT CATATTTCTA TTGACGAAAA CGGCGTGGAG GCTTCCGCGT TTACCAAGAT CGACTATATG GGTGCCGCAC AGCCCAAGGA TAAAGCGGAA ATGATACTCA ACCGCCCTTT CATCTACGGT ATTACGGCCG CCAACGGAGC ATTGCTGTTC GTGGGCATAT GTATGAACCC TGCTTCCTGA
|
Protein sequence | MKRLIVMLLA TVMLFGLTAC SQPVKVTGTN LVAAPVYPKG IDFGDSDKQR ELRENNPVNK DTVNAVNQFS YDTAAQLLKG SDTNGCYSPL SLYYALALAA AGAEDSTRDE LLTLLGFEDA DSLSKQCGNL YRLLYTDNKV SRLKIANSLW LADETDGQQI SFKDSYIKNA TEHFYTSIFT ADFADENTGK AMGRWISENT NGTLAPEFKT NTEQIMSILN TVYFYDQWTD RFNAEKTKED TFYLQSGPEV VCDFMNMNYW SHGFSKGNGY TRSSLGLKTS GSMIFILPDE GVAVADLLSS PQKLEKIFTQ GEDKNGKVVW SVPKFKYGSS FDLVDTLKAL GITSAFSLDS ADFSALTNAP AFISGVKQET HISIDENGVE ASAFTKIDYM GAAQPKDKAE MILNRPFIYG ITAANGALLF VGICMNPAS
|
| |