Gene Dtox_4357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4357 
Symbol 
ID8431376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4533227 
End bp4534624 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content38% 
IMG OID645036550 
Productputative aminopeptidase 1 
Protein accessionYP_003193643 
Protein GI258517421 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATA TTAATGGACT ATTCTATGAA CGCAAAAATG TATGGGAAAG AATAGATGAT 
GATGAAAAAG AAGCGGTATT GTCCTTCTGT GAAGACTATA AAATATTTTT AAACAAGGTG
AAAACAGAAA GGGAAGCGGT ACAATTTTGT TCAGAATTTG TGCAAGCAAA AGGTTTTAAA
GATATTTCAA CTATAGAGAA ATTAAAGCCC GGAGATAGTG TCTTTGTGGA AAAAAATCAG
AAAGCCTTAG TAATGGCGGT CATAGGCAGT AGACCAATAG TGGAGGGACT TAATTTGATT
GGCGCGCATA TTGATAGTCC ACGTTTAGAT TTGAAACCAC AAACTCTCTT TGAAAAGGAG
AATCTCGGTT TATTAAAAAC CCATTACTAC GGAGGCATAA AAAAATACCA GTGGACATCA
ATTCCGTTAT CCTTGCATGG AGTAATTATA AAAAGTGATG GCAGTAAGTT TGATTTTATT
ATCGGCGAAA AAGAGGATGA ATCTGTTTTC ACTATTACAG ATCTATTGCC CCACTTGGCT
AAAGAACAAA TGGAAAAGAA AATGTCCGAA GCGATTCCCG GAGAATCATT GAATATTCTT
TGCGGCAGCA ACCCAGTAAC AGACAAAAAT ATAAAAGAAA AAGTAAAGGC ATATATATTG
GAATACTTGT ATTCTAATTA TGGTATAATT GAGGAGGATT TTATAAGCGC TGAAATTGAG
GCAGTACCGG CTTGGGGGGC CAGGGATATA GGCTTTGACC GGAGTTTAAT AGGTTCTTAT
GGGCAAGATG ACAGGGTTTG TGCTTTTACC TCAATGAAAG CTTTAGTTGA TGCAGGAATT
CCTGAGACAA CTTCTTTGGT AATATTGGCT GATAAAGAGG AGATAGGGAG TAATGGTAAT
ACAGGTATGA TGAGCACATT GCTCGAAAAT TCTGTAGCTG AAATAGCGGC AAAATTAAAC
CCGGGTGATT GCAGCGAATT ACTATTAAGG CGAGTCCTTG CTAAATCAAA AGCTCTCTCG
GCAGACGTGA ATGCGGGATT GGATCCTAAT TATGAAGATG TCATGGAAAA AATGAATGCC
GCTAAACTGG GCTACGGTTT AGTGATAACA AAATACACGG GCTCAAGAGG AAAAAGTTCT
TCAAATGATG CAAATGCAGA ATTTATAGCC TACATAAGAA ATCTCTTAAA TAAAGAAAAT
ATTATCTGGC AAACAGGAGA ATTGGGCAAG ATTGATCAGG GAGGCGGTGG GACTATAGCC
TATCTTATGG CTGCATCGGG TATGGACGTG GTTGATTGTG GAGTAGCGCT TCTGGGTATG
CACTCCACTT TTGAGGTGGC TGCAAAGACA GATATTTATA TGGCTTATAA AGGATATAAG
GCATTTTTTA ATGCATAA
 
Protein sequence
MANINGLFYE RKNVWERIDD DEKEAVLSFC EDYKIFLNKV KTEREAVQFC SEFVQAKGFK 
DISTIEKLKP GDSVFVEKNQ KALVMAVIGS RPIVEGLNLI GAHIDSPRLD LKPQTLFEKE
NLGLLKTHYY GGIKKYQWTS IPLSLHGVII KSDGSKFDFI IGEKEDESVF TITDLLPHLA
KEQMEKKMSE AIPGESLNIL CGSNPVTDKN IKEKVKAYIL EYLYSNYGII EEDFISAEIE
AVPAWGARDI GFDRSLIGSY GQDDRVCAFT SMKALVDAGI PETTSLVILA DKEEIGSNGN
TGMMSTLLEN SVAEIAAKLN PGDCSELLLR RVLAKSKALS ADVNAGLDPN YEDVMEKMNA
AKLGYGLVIT KYTGSRGKSS SNDANAEFIA YIRNLLNKEN IIWQTGELGK IDQGGGGTIA
YLMAASGMDV VDCGVALLGM HSTFEVAAKT DIYMAYKGYK AFFNA