Gene Dtox_3636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3636 
Symbol 
ID8430644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3828591 
End bp3829706 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content39% 
IMG OID645035866 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_003192971 
Protein GI258516749 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTAGACC CAAGAATAAT TACACTGGCA AGAAACCTTG TTAACTATTC CTGCGACCTA 
CAGCCCGGAG AAAAAATCCT TATAGAAGCA ATAGGACTTG AATTACCGTT TGTTAAAGAA
TTGATAAAAG AAGTTTGCCG AGTCGGAGGA GTACCCTTTG TTACCATAAA GGACAACTCT
GTTAACAGAT CCCTGTTAAT AAATGCAAGT GAGGAACAAT TAAAAATGAT GGCTAAATAC
GAAGCAGCCA GAATGGAAGA CATGGACGCT TATATAGGCA TCCGGTCAGG CAACAACTCG
GCAGAATTAT CTGATGTACC ACAAGATAAG CTTGAGTTGT ACAATAAATA TTTTATTAAT
GAAGTTCATA TGAAAATCAG AGTTCCCAAG ACTAAATGGG TGGTATTAAG ATATCCCTCC
CCTTCCATGT CACAGCTGGC CAATACCAGT ATAGAAAACT TCGAGAATTT TTACTTTAAT
GTTTGTAACT TAGACTATGA AAAGATGTCG CAAGCAATGA CTCCTCTGGT AGAATTAATG
AACAAAACAG ATAAAGTGAG GATTGTGGGC ACAGGAACCG ACTTGACCTT TTCTATAAAA
TCTCTCCCGG CAATTAAGTG CGCCGGGAAA CGAAACATAC CGGATGGGGA GGTTTTTTCC
GCCCCGGTAA AAGACTCCGT AAACGGCTAT ATTACCTACA ACACACCTGC TCAATACCAG
GGTTTTACTT ATGAAAACAT CAGGCTTGAA TTCAAGGATG GGAAAATTGT TCAAGCTACA
GCTAATGATA CGGAAAAAAT AAATAAAGTC TTCGATACTG ACAAAGGGGC AAGATATGTA
GGTGAATTTG CTCTGGGTGT AAATCCTTAT ATCACAAAAC CAATGAAAGA TACCCTGTTT
GATGAGAAAA TTGCAGGCTC TATTCATTTT ACACCAGGAA GCGCTTATGA TGAATGTTTT
AACGGAAATA AATCAGCTAT CCACTGGGAT CTGGTATATA TTCAAACGCT GGAATACGGT
GGTGGGGAAA TATATTTTGA TGATGTTTTA ATAAGAAAGG ATGGAAGGTT TGTAATCCCT
CAATTGGAGG GGTTAAACCC AGAGAATTTA AAGTAA
 
Protein sequence
MVDPRIITLA RNLVNYSCDL QPGEKILIEA IGLELPFVKE LIKEVCRVGG VPFVTIKDNS 
VNRSLLINAS EEQLKMMAKY EAARMEDMDA YIGIRSGNNS AELSDVPQDK LELYNKYFIN
EVHMKIRVPK TKWVVLRYPS PSMSQLANTS IENFENFYFN VCNLDYEKMS QAMTPLVELM
NKTDKVRIVG TGTDLTFSIK SLPAIKCAGK RNIPDGEVFS APVKDSVNGY ITYNTPAQYQ
GFTYENIRLE FKDGKIVQAT ANDTEKINKV FDTDKGARYV GEFALGVNPY ITKPMKDTLF
DEKIAGSIHF TPGSAYDECF NGNKSAIHWD LVYIQTLEYG GGEIYFDDVL IRKDGRFVIP
QLEGLNPENL K