Gene Tmz1t_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_4020 
Symbol 
ID7873666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4418138 
End bp4419466 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content68% 
IMG OID643700957 
Producthypothetical protein 
Protein accessionYP_002890980 
Protein GI237654666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.811457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACAT CGACATGGAG TGCTTCCGGC GCCGCAGCAT TCGCCGCTTG TGTGGCTGCT 
GCAGCGACAC CGGCCTTCGC CGCGGCTCCC GCTGACTGGG CAGCCATCCC GGTGCAGACG
GTGACGCTGT TCTATCCCGG GCAGGTCGAT TACGGCTGGC TGCGCAGCGC GGAGCACAAG
CGTGCCAATG CCAAGGTCAG GGAGGGCGAA GCCTGTCTGT CCTGCCACGA GGGCGAGGAG
GCCGAACTCG GCGCCACCCT CGTCAAGGGC GGGCGGCACG AACCGGTCCC GATCGCCGGC
AAGCTCGGCG CGGTCGCGCT CCAGGTCCAG GCCGCCCATG ACGACGCCAA CCTGTACCTG
CGCTTCCAGT GGAAGACGCA GATGGCGCGG GCCGGGCAGA TGCACGACTA CATGATGTAC
GACGGCGAGA AGTGGGCCTT CATCGGCGGG CCGCGCTCGA AGGAAGCCGT GCGCAGCGGC
GCCCAGCCGC CGCTCTACGA GGATCGCCTG TCGGTGATGA TCGACGACGG CAAGGTGCCG
ATGTTCGCCA ACCAGGGCTG CTGGCTGACC TGCCACACCG GCATGCGCGA CATGCCCGGA
GAGCCGACCA AGGAACAGGT CCAGGCCCAT CCGCTGATCG GTCAGACGCA CAAGGAAAGC
GACGTGCGCA AGTATCTGCC GGCCACGCGC ACGGACGAGG CGGCGAGCTG GGACAAGACC
CGCGCGCCGG AGGAGATCGC CCGCCTCAAG GAGGCGGGCG CCTTCGTCGA GCTGATGCAG
TGGCGCGGTC ATCGCAGCAA TCCGGTGGGC ATGGCCGACG ATGGCTACGT GCTCGACTAT
CGCCTCGTCG ACGCCGGCAA GGGCCCGTTC GGCTGGAACG TCGACCGCAA GACCATGACG
CCGAAGTTCA TGTTCGACCC TGCGAAAGTC GGTGTGAAGG CGCTTGCGCT CGCGGATGTC
GGCAACGCGT CGAAGCCGCA CGCGCTGATC CGGGAAGACA ACGCCGTGGC CTACGATCCG
GCCGCGGGCT GGAAGAAGGG CGACGTCCTT CCCGGGCGCC TGCTCTCACG CGCCGACGCA
AGCGGTTCGG CGGCGGACAA TGCCGACGTC CGCGGCGAGT GGGCGGATGG CCAGTGGACG
GTGCTGTGGA CGCGCAAGCT CGACACCGGG CATGCCGACG ACGACAAGGC CCTGAAGCCG
GGCGGCGTCG TCAACGTGGG CTTTGCCGTT CATGACGACA ACGTCACAAC GCGTTTCCAT
CATGTGTCCT TCCCGCTGAC CCTGGGGATC GGCACGAAAG CCACCATCTC CTCGGTCGCG
CTCGAGTGA
 
Protein sequence
MTTSTWSASG AAAFAACVAA AATPAFAAAP ADWAAIPVQT VTLFYPGQVD YGWLRSAEHK 
RANAKVREGE ACLSCHEGEE AELGATLVKG GRHEPVPIAG KLGAVALQVQ AAHDDANLYL
RFQWKTQMAR AGQMHDYMMY DGEKWAFIGG PRSKEAVRSG AQPPLYEDRL SVMIDDGKVP
MFANQGCWLT CHTGMRDMPG EPTKEQVQAH PLIGQTHKES DVRKYLPATR TDEAASWDKT
RAPEEIARLK EAGAFVELMQ WRGHRSNPVG MADDGYVLDY RLVDAGKGPF GWNVDRKTMT
PKFMFDPAKV GVKALALADV GNASKPHALI REDNAVAYDP AAGWKKGDVL PGRLLSRADA
SGSAADNADV RGEWADGQWT VLWTRKLDTG HADDDKALKP GGVVNVGFAV HDDNVTTRFH
HVSFPLTLGI GTKATISSVA LE