Gene Dtox_4299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4299 
Symbol 
ID8431313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4466962 
End bp4468251 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content50% 
IMG OID645036491 
Productproteinase inhibitor I4 serpin 
Protein accessionYP_003193589 
Protein GI258517367 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0655436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCC TTATAGTCAT GCTTTTAGCA ACCGTAATGC TGTTCGGCCT GACAGCGTGC 
AGCCAACCAG TGAAGGTGAC GGGAACCAAC CTTGTTGCAG CCCCTGTTTA TCCCAAAGGA
ATTGACTTTG GGGACTCGGA CAAGCAGCGA GAGCTCCGGG AAAACAACCC GGTAAATAAA
GATACTGTAA ACGCTGTCAA CCAATTCTCT TATGACACCG CGGCTCAGCT ATTGAAAGGA
AGCGATACAA ATGGCTGTTA TTCGCCATTG AGCCTGTATT ATGCCCTGGC GCTGGCTGCA
GCCGGAGCCG AAGACTCCAC CAGGGACGAG CTGCTAACCC TTTTGGGCTT TGAAGATGCG
GACAGCCTGT CGAAACAATG CGGGAACCTC TACCGCCTGC TCTATACCGA CAATAAGGTT
TCCAGGCTGA AGATCGCCAA TTCCCTGTGG CTGGCCGATG AAACTGACGG ACAGCAAATC
TCCTTTAAGG ACAGCTATAT CAAAAACGCC ACGGAGCATT TTTATACATC CATCTTTACC
GCCGATTTTG CCGACGAGAA TACCGGCAAG GCAATGGGCC GCTGGATCTC GGAGAACACT
AACGGCACCC TTGCCCCAGA ATTCAAGACA AACACCGAGC AAATCATGAG TATTCTCAAC
ACGGTTTACT TTTACGATCA ATGGACTGAC CGCTTCAATG CAGAAAAGAC CAAAGAGGAC
ACCTTCTATC TTCAAAGCGG TCCGGAAGTT GTCTGTGATT TTATGAATAT GAATTATTGG
TCACACGGTT TCAGCAAGGG CAACGGGTAT ACCCGTTCTT CGCTAGGCCT AAAAACAAGC
GGCAGCATGA TATTTATCCT GCCTGATGAA GGCGTTGCCG TCGCAGACCT GCTGTCTTCC
CCGCAAAAGC TGGAGAAAAT ATTTACGCAG GGCGAAGACA AAAACGGAAA GGTTGTCTGG
AGCGTCCCTA AATTCAAATA CGGATCCAGC TTCGACCTGG TTGATACGTT GAAAGCATTA
GGTATCACCT CAGCTTTTTC TTTGGACAGC GCAGACTTCT CCGCTCTGAC CAATGCCCCC
GCGTTTATCT CCGGGGTCAA ACAGGAAACT CATATTTCTA TTGACGAAAA CGGCGTGGAG
GCTTCCGCGT TTACCAAGAT CGACTATATG GGTGCCGCAC AGCCCAAGGA TAAAGCGGAA
ATGATACTCA ACCGCCCTTT CATCTACGGT ATTACGGCCG CCAACGGAGC ATTGCTGTTC
GTGGGCATAT GTATGAACCC TGCTTCCTGA
 
Protein sequence
MKRLIVMLLA TVMLFGLTAC SQPVKVTGTN LVAAPVYPKG IDFGDSDKQR ELRENNPVNK 
DTVNAVNQFS YDTAAQLLKG SDTNGCYSPL SLYYALALAA AGAEDSTRDE LLTLLGFEDA
DSLSKQCGNL YRLLYTDNKV SRLKIANSLW LADETDGQQI SFKDSYIKNA TEHFYTSIFT
ADFADENTGK AMGRWISENT NGTLAPEFKT NTEQIMSILN TVYFYDQWTD RFNAEKTKED
TFYLQSGPEV VCDFMNMNYW SHGFSKGNGY TRSSLGLKTS GSMIFILPDE GVAVADLLSS
PQKLEKIFTQ GEDKNGKVVW SVPKFKYGSS FDLVDTLKAL GITSAFSLDS ADFSALTNAP
AFISGVKQET HISIDENGVE ASAFTKIDYM GAAQPKDKAE MILNRPFIYG ITAANGALLF
VGICMNPAS