Gene Tmz1t_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2004 
Symbol 
ID7083759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2264818 
End bp2266557 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content69% 
IMG OID643699029 
ProductTrkA-N domain protein 
Protein accessionYP_002355651 
Protein GI217970417 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0569] K+ transport systems, NAD-binding component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.88767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGCAGG CCCTCGCCCG CCACCACAGC ATCTTCCTGC TGATCCTGCG TCGCCTGCGC 
GCACCGCTGA TCCTGCTGAT CGTGCTGTTC GCGATCGCGG TGCTCGGCCT CACCCTGGTG
CCCGGCCCGG TCGTCGACGG CGAGACGAGC TATCTCAGCT TCTTCCACGC CTTCTACTTC
ATCAGCTACA CCGCGACCAC GATCGGTTTC GGCGAGATTC CCTACACCTT CTCCGACCAG
CAGCGGCTGT GGGTGATCGT CAGCATCTAC CTGTCGGTGA TCGGCTGGGC CTATACGCTG
GGCTCGGTGT TCAGCCTGCT CGCCGACCGC AGCCTGCGCC AGGCGATCGC GATGCAGGGT
TTCGTGCGCG CGGTGCGGCG GCTGCGCGAG CCCTTCTACC TGGTGTGCGG CTACGGCGAG
ACCGGCCGCC TGATCTGCGA CGCGCTCGAC CGCATGGGCC TGCGCGTCGT GGTGATCGAG
GTCGACGAGA CCAAGCTCGG CGAGCTCGAT CTGCACAGCT ATTCGGCCGA CGTGCCCGCG
CTGTGCGCGG ACGCCGCAAA CCCCGAGACG CTCCAGTTCG GCGGCCTCAC CCACGCGAGC
TGCATCGGCG TCATCGCGCT CACCAACGAC GACGCCACCA ACCTGGCGAT CGCGATCGCC
GCACGCCTGC TGGCGCCGAA GGTGCCCGCA CTGTGCCGCG CCGAGCACAC CGCGACCTCG
GCCAACATGA CCTCCTTCGG CACCCGCCAC ATCCTCAACC CCTTCGAGCG CTTCAGCGAA
ACGCTCGCAC TCTCGCTGCA CGCGCCCAAG GCCTCGCAGC TCTTCGACTG GCTCACCGGC
CTGCCCGGCA GCCACGTCGA GCAGCGCCGC GATCCGCCGC GCGGCAACTG GATCGTCTGC
GGCCACGGGC GCTTCGGTCG CCTGCTGGTC GATGCGATGG ACTCCGAGGC AGTGCCGGTG
ACCATCATCG ACATCGACCC CAAGCCCGAC GGCATTCACC GCTGGGTGCA GGGCGACGGC
ACCGGTGCCG CATCCCTGCT CGAGGCCGGC GTGCGCGAGG CGACCGGGAT CGTGTGCGGC
ACCAGCTCGG ACGTGGACAA CCTTTCGATC GCGGTGACGG CGCGCGAGCT CAACCAGGAG
CTCTTCGTGA TCCTGCGCCA GAACCACGAA TCCAACCGCG CGCTCTTCGA GGCCTTCGAA
TCCGACATCA CCGTGGTCCC GAGCCGGGTG ATCGCGCACG AATGCATCGC GATCCTGAGC
ACACCGCTGC TCGCGCCCTT CCTGGCCGAA ATCCGCCGCC GTGACGAGGA GTGGTGCGGC
GCACTGCTGC ATCGCCTGAC CCGCCACCTC GGCTGGAGGG TTCCGCGGAT CCGCAGCCAG
CGCGTCAACC TGTCGAGCGC GCCGGCGCTG TATCGCCGCC TGATGCGCGG CGAGACGATC
ACGCTCGAAC GCCTGCTGCG CTCGCCCGCC GACCGCTCGA TGGCGCTCGA CTGCGCGGTG
CTCTACCTCG AGCGCGACGA CGACGACCAC CGGATGACGC CCGCCGCCGA CGAGAAGCTC
CGCCCCGGCG ACGAATTGCT CTTCGCCGGC ACCCGCCGCG CGCTCGAGGA TGTCGCCCTG
ATCTTCGCCA ACGAGCACAC CCTCGAATAC ATCCTCACCG GCCGCGACCT GCCCGGCGGA
CGGGTCTGGG AAATGCTCGC GCAGCGCAAG CACGGGAAAC GCTCGCCGCA GCTGCCCTGA
 
Protein sequence
MQQALARHHS IFLLILRRLR APLILLIVLF AIAVLGLTLV PGPVVDGETS YLSFFHAFYF 
ISYTATTIGF GEIPYTFSDQ QRLWVIVSIY LSVIGWAYTL GSVFSLLADR SLRQAIAMQG
FVRAVRRLRE PFYLVCGYGE TGRLICDALD RMGLRVVVIE VDETKLGELD LHSYSADVPA
LCADAANPET LQFGGLTHAS CIGVIALTND DATNLAIAIA ARLLAPKVPA LCRAEHTATS
ANMTSFGTRH ILNPFERFSE TLALSLHAPK ASQLFDWLTG LPGSHVEQRR DPPRGNWIVC
GHGRFGRLLV DAMDSEAVPV TIIDIDPKPD GIHRWVQGDG TGAASLLEAG VREATGIVCG
TSSDVDNLSI AVTARELNQE LFVILRQNHE SNRALFEAFE SDITVVPSRV IAHECIAILS
TPLLAPFLAE IRRRDEEWCG ALLHRLTRHL GWRVPRIRSQ RVNLSSAPAL YRRLMRGETI
TLERLLRSPA DRSMALDCAV LYLERDDDDH RMTPAADEKL RPGDELLFAG TRRALEDVAL
IFANEHTLEY ILTGRDLPGG RVWEMLAQRK HGKRSPQLP