Gene Dtox_1398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1398 
Symbol 
ID8428349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1431373 
End bp1432455 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content39% 
IMG OID645033735 
Productputative polyglutamate synthase CapA 
Protein accessionYP_003190897 
Protein GI258514675 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAATGT TATATTTGCA ATTTTTTTTA GCTATGACAG TATCATCAAT AGACGGTTCA 
GGGACATTGG TCTTATCGAA TGCAATTTTA GAAAACACAC TTCAGAATGC ACAGATAAGC
GATACTGAGA AAATGGCCAA TAATAAGGCA ACTGAAATAA GCAATACTGA AATAACTATT
ACTTCCATTG GTGATAACAC AATAGGCTAT GATACTGCAT TCGGTTACTC CGGTTCATTT
ATTCAAGAAG TTGATTATAA CGGTATAGAC TACCCTTATA AAAATGTAGC AGGCCTATTT
CTAAATGATG ACCTGACTAT TGCCAACTTG GAAACAACCT TAACTGATTC TAAGAACAGG
GCAGTTAAAA AATTTAAATT CGCGGGCAAG CCCGAATACC GCAATATGCT TGTAAACAAC
GGAATAGATG CCGTTAACCT GGCTAATAAT CATATCATGG ACTACCTGGA ACAAGGTTAT
CAGGATACAA TGGCGAATTT GAATCAAGCT GGTATTGGGT TCTTTGGAGA GGATATTAGG
CTTGTAAAAG ATATTAAAGG CGTGAAAGTT GGCATGCTTG GATACCAGGG TTGGTCAAAC
AATGCCGCCT TCAAAACCAG GGTCAGCAAG GATATAGCGG GGCTTAAAGA ACAAGCAAAA
ATAGTTGTCG TTAGTTTTCA CTGGGGTAAT GAAGGGGTAA ATTATCCTAA CAATGCACAG
ATGGATTTGG GGCGTTTTTC TATAGATTGT GGAGCTGATT TAGTGTTAGG CCACCACCCG
CATGTGATAC AGGGAATTGA AAACTATAAC GGCAAAAATA TTGTTTACAG TCTGGGCAAC
TTTGTTTTTG GCGGTAACAA AAATCCTGCT GATAAGGATA CCTTTATCTT TCAACAAAAA
TTTGTTGTCA GTCCTGAGGG TGAATTAACC CTAGCTGCAC CAAACATTAT CCCGGCTTCT
CTTTCTTCAG TCAACTGGCG GAATAATTAC CAACCTGTGC TTCTACAGGC CGAAGAGGCT
GAGAGAGTGC TGAACAGGCT GCGAATATAC AGCAGTGCTT TGGAATATGG TTTGAAGTTT
TAA
 
Protein sequence
MLMLYLQFFL AMTVSSIDGS GTLVLSNAIL ENTLQNAQIS DTEKMANNKA TEISNTEITI 
TSIGDNTIGY DTAFGYSGSF IQEVDYNGID YPYKNVAGLF LNDDLTIANL ETTLTDSKNR
AVKKFKFAGK PEYRNMLVNN GIDAVNLANN HIMDYLEQGY QDTMANLNQA GIGFFGEDIR
LVKDIKGVKV GMLGYQGWSN NAAFKTRVSK DIAGLKEQAK IVVVSFHWGN EGVNYPNNAQ
MDLGRFSIDC GADLVLGHHP HVIQGIENYN GKNIVYSLGN FVFGGNKNPA DKDTFIFQQK
FVVSPEGELT LAAPNIIPAS LSSVNWRNNY QPVLLQAEEA ERVLNRLRIY SSALEYGLKF