Gene Tmz1t_3548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3548 
Symbol 
ID7873054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3888506 
End bp3889537 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content73% 
IMG OID643700489 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_002890519 
Protein GI237654205 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.927154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTGC CGCACATCGA CCGGATGAAC GCCGGTGGCG ACCTGCCGCT GCTGTCGGTA 
CGCAACCTGC GCGTGGAATT TCCCCGCCGG CGCGGGACGC TGGTGGCGCT CGACGACGTC
TCCTTCGACA TCGCGCGCGG CGAGGTGCTC GGCGTGGTGG GCGAGTCGGG TGCGGGCAAG
TCGCTCACCG GCGCGGCCAT CATCGGCCTG CTCGAGCCCC CCGGGCGTAT CGCCGGCGGC
GACATCCTGC TCGATGGCGA GGCCATCCAC GCGCTGCGCG GCGAGGCCAT GCGGCGCCTG
CGCGGACGCC GCATCGCGAT GATCTTCCAG GATCCGCTCA CCAGCCTGAA CCCGCTATAT
ACGGTGGGCG AGCAGCTGGT CGAGACCATG CTGACCCACC TCGACCTGAC GCCCGCCGCC
GCGCGCGAGC GCGCGCTCGC GCTCCTCGAC GAGGTCGGCA TCCCGGCGCC GGCGCAGCGC
ATCGACCATT ACCCCCACCA GTTCTCCGGC GGCATGCGCC AGCGCGTGGT GATCGCGCTC
GCCTTGTGCG CCGAGCCCGA GCTGATCATC GCCGACGAGC CCACCACCGC GCTCGACGTC
TCGGTGCAGG CGCAGATCAT CGCGCTGCTG CGTCGCCTGT GCCGCCAGCA CCGCACCGCG
GTGATGCTGA TCACCCACGA CATGGGCGTG ATCGCCGAGA CCGCCGACCG CGTCGCGGTG
ATGTACGCCG GGCGGGTGGT GGAGATCGGG CCGGTGGCCG AGGTGGTGCG CGCTCCGGCG
CATCCCTACA CCCGCGGCCT GATGGGCGCC ATTCCCGTGC TCGGCGCCGA GGTCGAGCGC
CTGGTGCAGA TCGACGGCGC GATGCCGCGC CTGGATGCGA TCCCGTCCGG CTGCGCCTTC
CATCCGCGCT GCACCGAGGC CAGCGCGCGC TGCCGCGTCG AGCGCCCGGA GTTGCTGCCG
GCCGGCGCCA CGCGCGCGGC GTGCTGGTTG TACGCGCCGG CGCGCGAGAA CACCGACGGA
GGCCCGCAGT GA
 
Protein sequence
MNVPHIDRMN AGGDLPLLSV RNLRVEFPRR RGTLVALDDV SFDIARGEVL GVVGESGAGK 
SLTGAAIIGL LEPPGRIAGG DILLDGEAIH ALRGEAMRRL RGRRIAMIFQ DPLTSLNPLY
TVGEQLVETM LTHLDLTPAA ARERALALLD EVGIPAPAQR IDHYPHQFSG GMRQRVVIAL
ALCAEPELII ADEPTTALDV SVQAQIIALL RRLCRQHRTA VMLITHDMGV IAETADRVAV
MYAGRVVEIG PVAEVVRAPA HPYTRGLMGA IPVLGAEVER LVQIDGAMPR LDAIPSGCAF
HPRCTEASAR CRVERPELLP AGATRAACWL YAPARENTDG GPQ