Gene Tmz1t_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1866 
Symbol 
ID7084289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2104897 
End bp2106789 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content72% 
IMG OID643698889 
Producthypothetical protein 
Protein accessionYP_002355514 
Protein GI217970280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0316432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA TCCTCAAGCG CTTCACCACC GACGCGAGCG CATTCACCGA ACTCAAGTTC 
GTGGACGCGA AGGGCACGGC AGCCGCGGTC GATCTCGCCG ACGCGACCCG GGGCGAGGCC
ACGATCTCCG CCGGCATGCT GGCCGCCCTC GCCGACCTGC AGCCGAAGGC CGCGCCCGAC
GAGCCGCTCA CCGAGCCGGT CTTCAGCAAG GCAACGAGCA CAGGCAGGGG AATGCTGGCG
CATGTCGAGC GGGTGCACGA CAGCTTCCTC AACGTGCGCC GCGACCTCCT CGAGCTGCGC
GCCAGGGCCA ACGCTCGCCA GAATCAGGCG AAATCGCTCA TCGACCGCGT GTGCGACTTC
CTCGACGACT ATCTCAAGCC GGAAGAGCTG CGCCGGCTCG GCGTCATCGA CGACGAGGCC
CAGGCACGTC CCTTCCGCCT GCTCGTCGCG CCCGATCTGC ACAACGCCGA GGGCGTGCTG
CTGCGCCCCC GACTCGACCT CGCCCTTCCC CGCCCGGGCG AGGACGACAT CGAAGGCGTC
AAGCACAACG ACGAATCGGC GCTCTTCCAT GCCGGTAAGG TCGTTGCCGA CGACATCCGC
CAGGTCGAGG CGCTGCGCGG CGCCCTGACC ACGCTGCGCC GCCGCCATCA GGAGGCGCTC
GAGGACCTGC GCACCCGCCT GGTCGCACTC GAGGCCGAGC TACCCGGCGA GCTTCGCCGG
CTCGACGTCC TGGAGCGCGA ACGCACCGAG ACCCTGGACG ACTACGCGGT TGCACAACGC
CTGCTGGCCG AACACTGGCG CGAGGTCGAA GCCGCCCACG CCGAGCGCCG GCGAATCATC
GAGGCTCATC AAGGCCTCTT CTACGTGAAG GTGCGCGAGA CCCCGCTCGG CCGCAGCCTG
CCCGACCCGC TCGAGCTGCG CCCGAGCAGC CCCGACGAGC TCGTGCCCGG CTGCGCAGGA
CGCGACACCG CGCTGCCGGC TGCGCTCGCG CCTTTCATCG AGGCGGTGTA CGACATCCCG
GCGGCCGATT GGGCACACCT GCGGCCACTC GGCCACCTGC TGCCGGGGCG CACGATCCTC
GCCGGCCTGG TCGAGATGCG CCGCCAGAAG CTCGCACTGC GCCTGAACCG CCCCCCGGAC
GCCGGCCTCG CCTTGCTGTC CGGCCTGGTG CAGCAGAACA GGGCCCTGGT GCGCGACATC
GCCGCGCGCC CCTTCAGCGC CGGCGCACTC GGCGAGTTGC AGCGCCAGGC AGGCGCGATC
CTCGCCCTCG ACGACCTGCT CGCGATCCCC TCGCCACAGC TGCGCGACCC GGCGCGTGCC
CTGCACCAGC GCCTGGATAC CGCCGCCGGC TGCCTGCTCG AGCGTCTGCG CACGATCTCT
CCCTCGATCC GGCTGAACTG GGCGAGCGCG GCCGAGGCTG ACCGGCTCGC CGTCGAGGCC
CCCGAGCGCT GGCCCGGCCT CGCCGAGGCG GAGGAGCGCG ACTTCAACGG CGTGCGCACG
CTGGTCGAGC TCGTCGCCTG GTGGTTCCGC CAGCTCGATG CCGACGCCTC CGCCGCCGCG
CACGGCGCGA TGCGCAACCT GGTGCGCGCC TGCCTGCTGC TCGCGGCCAG CGACGACCCG
CAACAGCTCG TGCAGGGGCG CCTGCAGAGC ATCCCGGGGC GCTTCCGCCT CGGCGAGGCG
CTGCGCCTGA AGCTCGACCG CGAGGCCGCG CCCGGCACCC TGCTCCAGCT CTTCGACGAC
GACCAGCGCG TGATCGCGAC CCTGCGTGTC GACGACCATG ACGACCAGGG CACCGTCGCC
TCGGTCGCCT CCATCCTCGA CCCCGAGCTC GAACGCAACC CCGGCGCCGT GCTCGCCACG
GGACTGCACA TCAGCGGAGT CGATCGAGGC TGA
 
Protein sequence
MSDILKRFTT DASAFTELKF VDAKGTAAAV DLADATRGEA TISAGMLAAL ADLQPKAAPD 
EPLTEPVFSK ATSTGRGMLA HVERVHDSFL NVRRDLLELR ARANARQNQA KSLIDRVCDF
LDDYLKPEEL RRLGVIDDEA QARPFRLLVA PDLHNAEGVL LRPRLDLALP RPGEDDIEGV
KHNDESALFH AGKVVADDIR QVEALRGALT TLRRRHQEAL EDLRTRLVAL EAELPGELRR
LDVLERERTE TLDDYAVAQR LLAEHWREVE AAHAERRRII EAHQGLFYVK VRETPLGRSL
PDPLELRPSS PDELVPGCAG RDTALPAALA PFIEAVYDIP AADWAHLRPL GHLLPGRTIL
AGLVEMRRQK LALRLNRPPD AGLALLSGLV QQNRALVRDI AARPFSAGAL GELQRQAGAI
LALDDLLAIP SPQLRDPARA LHQRLDTAAG CLLERLRTIS PSIRLNWASA AEADRLAVEA
PERWPGLAEA EERDFNGVRT LVELVAWWFR QLDADASAAA HGAMRNLVRA CLLLAASDDP
QQLVQGRLQS IPGRFRLGEA LRLKLDREAA PGTLLQLFDD DQRVIATLRV DDHDDQGTVA
SVASILDPEL ERNPGAVLAT GLHISGVDRG