Gene Tmz1t_2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2129 
Symbol 
ID7085399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2403353 
End bp2405854 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content68% 
IMG OID643699148 
Productvon Willebrand factor type A 
Protein accessionYP_002355765 
Protein GI217970531 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACAAC AACAATCGGC ACGACTCGAG ACGAGGGCGG GTGAGGCACT GACGCTGCAG 
GGCGTGCGGT TCACCGGCAC CTTGCGCGGC ACGCTCTTCG AAGCGGATCT CGAACAGCGC
TTTGCCAACC CCTTCGAGCG CCATGTCGAG CTCGTCTACA GCTTCCCGCT GCCCTGGGCG
GCGGTGCTCC TCGGGGTGGA GGTGCGGATC GGCGAGCGCC GCCTGTCCGG TGCGGTCATC
GAGAAGAGGC AAGCCGAGCA GGGCTACGAG GACGCGCTGG CCGAGGGCAA CACCACCATC
CTGCTCGAGC AGAACGTCGA TGGCAGCTAC ACGCTGAACC TGGGCAACCT CGCACCCGGG
GAGACCTGTG TGGTACGGCT GCGTTACGCC CAGGTGCTGC AGTTCGAGCA GCATGGCCTG
CGCCTGGTGG TGCCCACGGT GATCGCCCCG CGCTACGGCG ACCCGGTGGC CGATGCCGGC
CTGAAGCCAC ATCAGCTGGT CGAGCACGAC CTGATGGCCG TCTATCCGTT CGAGCTCACC
TTGCGCATCG AAGGCGAGCT CGCGCGGGCG CGCATCGGCT CGCCGAGCCA TCCGTTGTCG
ACGCGGCTCG AAGGAGAGGG GGAGACGGCG GCGATGCTCG TATCGCTCGG TCGCGGCGGG
GCACTCGATC GCGACTTCAT CCTGGTGCTC GACGAGGTTG CGCAGGATTC GCTTGCCGTG
TGCGCGCACG ACACCCTCGA TGAGGGCGCG GTGAATGTGC TGGCGAGCTT CTGCCCGCGT
GTGCCGGCGG CGGCCCATCC GCTCGCGGTG AAGATCCTGG TGGACTGCTC GGGCTCGATG
CAGGGCGACA GTATCGCCGC TGCGCGACGT GCGCTGCAGG CCATCGTCGC CGGCCTGCGC
GAGGGCGAGC GGTTCTCGCT GTCACGCTTC GGCAGCACTG TCGAGCATCG CTCGCGCGCG
CTGTGGCGCA CGAGCCCCGC TACCCGGCTG GCCGGGCAGC GCTGGGCGGC ACAGTTGCAA
GCCGACCTGG GCGGCACGGA GATGGAGAAG GCGCTGGACT CCACGCTGGC CCTGGCCGGA
GACGCATCGG TAAGCCCCGG TGCTGGGGAA GGTGCCGCGC CGGTCGATCT GCTCCTCATC
ACCGATGGCC AGATCCACGC CATCGACCGA ACGGTGGCGA AGGCGCGCGC GCTGGGTCAC
CGGGTGTTCG TCGTCGGTAT CGGCAGCGCC CCTGCCGAGG GTGTGCTGCG CCGCCTGGCC
GAGGAGAGCG GCGGGGCCTG TGACTTCGTC GCCCCCGGAG AGACTGTGGA GCCTGCCGTG
CTGCGCATGT TTGCGCGCCT GCGCTCGCAG CGCATGGCGT CCCTGGCGCT TGCGTGGCCG
GCTGGTGCGA AGCCCCTGTG GATGAGCGCG CTACCCGGCT CGGTGTTCGA TGGTGACGCG
GTGACCGTGT GGGCACGCTT CGCGCAGGTG CCGGATGGGA CGGTGCGTCT GATCGGCCGG
CGCACGCACG CCTCGGTGCC CGAGTCGCTG GGCGAAGCCT GCCTGACGGC TGCGGAGCAT
GATTCGGCGC TCAGCCGGAT GGCCGTGGCC GCGCAGATCG AAACCCTGCT CGCGACCGAG
GGCGCCCAAT CGCGTCAGGC GCTGGAACTG GCCGTCGCGT ATCAGCTGGT CAGCCCGCTG
ACGCATTTCC TGCTCGTCGA GACGCGTGCC GAAGCCGACA AGCCGGCGGA CATGCCCGAT
CTCGTGAAGG TGCCCTCGAT GCTGCCCGCC GGCTTTGGCG GGCTCGGCAG TCTCGACTTC
TGCATCGACC CCTGCATGGC GCCGCTCACT GTTAATGAGG CACCGGAGCA CTACGGCTCG
CCGGCTGCGG CAAGCGGTGC GTATCTCGAC TTCGACGATC CCTTCGATGC GCCCGTGGTG
CTGCGCTCGG GTCGACGTGC GGATCACGGC GATACGCCCA ACAAGGCTGG GACCTATGAC
ATCCCGGCTT TCCTGCGCCG AAGCTCGAAC CAGGACGCCG GGCAGCCCCC CCGGGACGAT
CCGCGTTACT GGTGTGCCGA ACCGCATTAC ACAGGTCTCA CTCCCCTCGG TCTGACACAG
TGGCTCCGCA GTCACCCGCA GGCCGAATGG CCGCAACGTT ATGCCGAGTT GCGCCGACTC
GGTGTCGGTA CGGCAGTGCT CGACTGGCTC GAGTTCGTGT TGGCTGAAGG GGAGGGTGAG
TCGCTGGTGG TCGCCTGCTT CGTTCAGGTG ATGGCGCAGC GCGACCTGTA CGAAGCGCTG
CTGTCGGACA CCGGCGCGCT TGGTAGGCTT AAGGCCTTGG CACAACGCGT GGCACCGGGC
GCGGCGCTGA AGGTCAGCCA GGATGACCCT GCGGCTGCGT CGATCCTTGC TCGCCTGCAG
GTGTTTGTAA GTACGCTGCG GGCAGAGCGC TGGCCCGACT GCGTCTTTGC GCTCCAGGAC
GGGGCGTCGG CGCTCGAACA GTCGGGTGTC GGCGTTGGAT AG
 
Protein sequence
MIQQQSARLE TRAGEALTLQ GVRFTGTLRG TLFEADLEQR FANPFERHVE LVYSFPLPWA 
AVLLGVEVRI GERRLSGAVI EKRQAEQGYE DALAEGNTTI LLEQNVDGSY TLNLGNLAPG
ETCVVRLRYA QVLQFEQHGL RLVVPTVIAP RYGDPVADAG LKPHQLVEHD LMAVYPFELT
LRIEGELARA RIGSPSHPLS TRLEGEGETA AMLVSLGRGG ALDRDFILVL DEVAQDSLAV
CAHDTLDEGA VNVLASFCPR VPAAAHPLAV KILVDCSGSM QGDSIAAARR ALQAIVAGLR
EGERFSLSRF GSTVEHRSRA LWRTSPATRL AGQRWAAQLQ ADLGGTEMEK ALDSTLALAG
DASVSPGAGE GAAPVDLLLI TDGQIHAIDR TVAKARALGH RVFVVGIGSA PAEGVLRRLA
EESGGACDFV APGETVEPAV LRMFARLRSQ RMASLALAWP AGAKPLWMSA LPGSVFDGDA
VTVWARFAQV PDGTVRLIGR RTHASVPESL GEACLTAAEH DSALSRMAVA AQIETLLATE
GAQSRQALEL AVAYQLVSPL THFLLVETRA EADKPADMPD LVKVPSMLPA GFGGLGSLDF
CIDPCMAPLT VNEAPEHYGS PAAASGAYLD FDDPFDAPVV LRSGRRADHG DTPNKAGTYD
IPAFLRRSSN QDAGQPPRDD PRYWCAEPHY TGLTPLGLTQ WLRSHPQAEW PQRYAELRRL
GVGTAVLDWL EFVLAEGEGE SLVVACFVQV MAQRDLYEAL LSDTGALGRL KALAQRVAPG
AALKVSQDDP AAASILARLQ VFVSTLRAER WPDCVFALQD GASALEQSGV GVG