Gene Tmz1t_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1124 
Symbol 
ID7084653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1232219 
End bp1234069 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content68% 
IMG OID643698139 
ProductHeparinase II/III family protein 
Protein accessionYP_002354779 
Protein GI217969545 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.781044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTCG CAATCAAGGC GCGGACGGCG CTGGCGCTCG GTGTGCCGAA CCTGGTGCGG 
GCGCTGTCGT ATCGCCTGGG GGTGAAGACG GGGCTCAACC CGGTGCGCCG CCTGCGCGGG
GCTGCGCCGG CGGGACCGTT CTTTGCCATG CCCGCGTGCG CTGCACGCCC TCCGAAGCCC
GAGCGCTTGC TCGTGGCGTC GCAGGCGTGG CAGACGAGCG CGCGGCTGTT CAGCCATTGG
CCGATCGCGG TGACGGACGA GCCGCCCGAC TGGCACAGCA ATCTGCTTAC AGGTAAGCAG
ATTGCCGAAC CCGATCGGGA GTGGTGGCGG ATCCCGGACT TCGATCCGGC AGTGGGCGAC
ATCAAGCAGA TCTGGGAGGC GTCGCGTTTC GACTGGGTGC TGGCCTTCGC GCAGCGTGCG
CGCAAGGGCG ACGCGGCCTC TTACGAGCGT CTTGAGCGCT GGCTTGCCGA CTGGAGCGCA
CGCAATCCGC CCTACTGCGG GCCGAACTGG AAGTGCGGGC AGGAAGCGTC GATGCGGGTG
ATGCATCTCG CCATGGCGGC GGTGATGCTG GGGCAGGTCG ATACGGCCAG CCCGGCGCTG
CTGGAGCTGG TGCGCCTGCA CCTGCAGAGG ATCGCGCCGA CGCTGCGCTA TGCGATGGCG
CAGGACAACA ACCACGGCAC GTCCGAGGCG GCGGCGCTGT TCATCGGCGG GAGCTGGCTG
TCGCGCAAGG GCGTGGCGGG TGGTGCGCGC TGGGCGGCGC TGGGGCGCAA ATGGCTGGAG
AACCGGGCTG CACGGCTAAT TTCGCTGGAT GGCACCTTCA GCCAGTATTC GGTGACCTAC
CACCGGGTGA TGCTCGACAC CTATTGCATG GTGGAGGTGT GGCGCAGGCG CCTGGCCCTG
ACGGCGTTTT CGGGGCAGAT GGCGGTGTTC ATGCAGGCGG CGACGCGCTG GCTGTATGTG
ATGACCCGCC CCGAGAACGG CGATGCGCCC AACCTGGGGG CCAACGACGG TGCCCGCCTG
CTGCAACTGA CGGATACCGA CTACCGCGAT TTTCGGCCGA GCGTGCAGCT TGCTGCGGCG
CTGTTCTGGG GCATGCGCGC CTACGCCGAG GACGGCGCCT GGAACCAGCC GCTGCGCTGG
CTCGGGGTGG ACGTGCCGGA GCGCGTCTTT ACGCCCGGCT GGCTGGAGCT TTTCGACGAC
GGCGGTTTCG GAGTGTTGCG CCGTGGCGAT GTGCTCGCGG TGCTGCGTTA TCCGCTGTTC
CGCTTCCGGC CCAGCCAGGC GGATGCGCTT CACCTCGACC TGTGGGTGGG TGGGCGCAAC
CTTCTGCGCG ACGGTGGCAG CTACAGCTAC AACACCGAGC CGAAGTGGTT GAACTACTTC
GGCGGGACAG AAAGCCACAA CACGGTGCAG TTTGACGATC GCGACCAGAT GCCGCGCTTG
AGCCGCTTCT TGTTCGGCGA CTGGCTGCGG ACGATCGAGC ACGAAGGGCG CAGGGAGGCT
GCCGATGCGC TGACCTATGG CGCGGCTTAC CGCGACCGCC AGGGCGCCTA TCACCGTCGC
CGGGTCGCGT TGGCAGGCGA TGCCTTGCAT GTGCAGGACC GCATCGAAGG CTTTGCGCGC
AAGGCCGTGC TGCGCTGGCG GCTTGAGCCG GGCGAGTGGC AGGTGGAGGG CCATTGCGTG
CGCAACGGCG CGCATACGCT GCGCGTGCAG GCCGACGTGC CGCTGGTCCG CTTCGAGCTG
ACGACCGGCT GGGAGTCGCG CTACTACCTC GAGAAAACCG AACTACCGGT GCTGGAGATC
GAAGTCGATC AGCCCGGTAC ATTGATGAGC GAATATCGCT GGGCACCATG A
 
Protein sequence
MSVAIKARTA LALGVPNLVR ALSYRLGVKT GLNPVRRLRG AAPAGPFFAM PACAARPPKP 
ERLLVASQAW QTSARLFSHW PIAVTDEPPD WHSNLLTGKQ IAEPDREWWR IPDFDPAVGD
IKQIWEASRF DWVLAFAQRA RKGDAASYER LERWLADWSA RNPPYCGPNW KCGQEASMRV
MHLAMAAVML GQVDTASPAL LELVRLHLQR IAPTLRYAMA QDNNHGTSEA AALFIGGSWL
SRKGVAGGAR WAALGRKWLE NRAARLISLD GTFSQYSVTY HRVMLDTYCM VEVWRRRLAL
TAFSGQMAVF MQAATRWLYV MTRPENGDAP NLGANDGARL LQLTDTDYRD FRPSVQLAAA
LFWGMRAYAE DGAWNQPLRW LGVDVPERVF TPGWLELFDD GGFGVLRRGD VLAVLRYPLF
RFRPSQADAL HLDLWVGGRN LLRDGGSYSY NTEPKWLNYF GGTESHNTVQ FDDRDQMPRL
SRFLFGDWLR TIEHEGRREA ADALTYGAAY RDRQGAYHRR RVALAGDALH VQDRIEGFAR
KAVLRWRLEP GEWQVEGHCV RNGAHTLRVQ ADVPLVRFEL TTGWESRYYL EKTELPVLEI
EVDQPGTLMS EYRWAP