Gene Tmz1t_0431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0431 
Symbol 
ID7084941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp493185 
End bp494135 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content67% 
IMG OID643697463 
ProductTRAP transporter solute receptor, TAXI family 
Protein accessionYP_002354106 
Protein GI217968872 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCCC TGATCCGCAA GTGCGCCCTC GGCACCGTGT TCGTCGCCAT GACCACCGCG 
GCCTCGGCCG CCACCTTCGT GAACGTGCTG ACCGGCGGCA CCAGCGGGGT GTACTACCCG
CTCGGCGTCA CCCTGTCGCA GCTCTACGGC GAGATCATTC CCGACAGCAA GGTCCAGGTG
CAGGCCACCA AGGCCTCGGC GGAGAACCTC AACCTGCTGC AGGCGGGCCG TGGCGAGATC
GGCTTCTCGC TTGCCGACTC GGTGTCCGAC GCCTGGAAGG GCAACGCCGA TGCGGGCTTC
GCCAAGCCGC TCGACAAGCT GCGTGCGATC GCCTCGGTCT ACCCCAACTA CATCCAGATC
GTCGCCCTGG CCGACGCCGA CGTGAAGACG CTCGCCGACC TCAAGGGCAA GCGCATCTCG
GTCGGCGCGC CGCGCTCGGG AACGGAGATC AACGCGCGTG CCATCCTGAA GGCCGCGGGC
CTCTCCTACG CGGACTTCGC CAAGGTCGAA TACCTGCCCT TCGGCGAATC GGTCGAGCTG
ATGAAGAACC GTCAGATCGA CGTCACCCTG CAGTCGGCCG GCCTCGGCGT GGCGGCGCTG
CGCGACCTGT CGGCGGCGGT GAAGGTCAAC TTCGTGCCGG TGCCGGCCGA GGTGGTGGCC
AAGGTGGGCG ACCCCGCCTA CCGTGCGGCC GCGGTCCCGG CCAACACCTA CGAGGGCCAG
GCCGCCGAGG TGCCGACGGT GGCGATCAAC AACCTGCTCG TGACCAACGA CAAGGTCTCG
AACGAGGTCG CCTACCAGAT GACCAAGGGC CTCTTCGACA ACCTCGAGCG GCTGGGCAAC
TCGCACTCCG CCGGCCGCCA GATCAAGCTC GAGAAGGCGG TCGAAGGCCT GCCGATCCCG
CTCCATCCGG GTGCGGAGAA GTTCTATCGC GAGAAGGGCC TGATCCAGTA A
 
Protein sequence
MKSLIRKCAL GTVFVAMTTA ASAATFVNVL TGGTSGVYYP LGVTLSQLYG EIIPDSKVQV 
QATKASAENL NLLQAGRGEI GFSLADSVSD AWKGNADAGF AKPLDKLRAI ASVYPNYIQI
VALADADVKT LADLKGKRIS VGAPRSGTEI NARAILKAAG LSYADFAKVE YLPFGESVEL
MKNRQIDVTL QSAGLGVAAL RDLSAAVKVN FVPVPAEVVA KVGDPAYRAA AVPANTYEGQ
AAEVPTVAIN NLLVTNDKVS NEVAYQMTKG LFDNLERLGN SHSAGRQIKL EKAVEGLPIP
LHPGAEKFYR EKGLIQ