Gene Tfu_1867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_1867 
Symbol 
ID3579546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp2180004 
End bp2190893 
Gene Length10890 bp 
Protein Length3629 aa 
Translation table11 
GC content71% 
IMG OID637685559 
Productnon-ribosomal peptide synthase:amino acid adenylation 
Protein accessionYP_289923 
Protein GI72162266 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGACA CCTCCACCAC TCGGCTGCCT CTCACCGACG CACAGGCCGG AGTCTGGTAC 
GCCCAGCAGA TCGTGCCGGA CAGCCCCGTG TTCAACGTCG GCCAATACAC CGACATCCCC
GCCGACCTCG ACGTCGACCG GTTCATCCGT GCCGTGGAGA GCGTCGTCCG GGACAGCGAA
ACGCTGCGGT CCCGCCCGGT GGCCCGGGGC GACACCGCCG TGCAGGAGAT CCGTGCCGAC
GGTGCGGGAG TGGTGGAGGT CCACGACCTC ACCGCGGAAG CCGACCCGCG GCACGCCGCC
GCCGCGTGGA TGCGCCGCGA CATGGCGCAC CCGGTCCGGT TCGACACGGA CGAGCCGCTC
GTCCGCTACG CCCTCCTGCG GGTAGGGGAA CGCCGCTGGT ACTGGTACCA GCGCTACCAC
CACATCCTCG TCGACGCCTA CGCGGTCACC CTGCTCGCCC GGCGGGTCGC CGACGTCTAC
ACCGCCCTGG GACGCGGCCA GGAACCGCCG CCGTCCCGGT TCGGCAGCCT CGCCGACATC
GTCGCCGACG AAACCGCCTA CGCCGCCTCC GAACAGTGCG CGGCCGACCG CGCCTACTGG
ACCGGACTGC TCGGCGACGG CTACCCCACA GCCCTGCTGT CCTCCCGCCC GCAGGCGCCC
TACGCCGGGG TGCTGCGGGC ACGGGCCGAC GTGGCCGCCG ACGTTCTCAC CGGGCTCACC
GAACTGGGGG AGCGCACCGG GGCCACCTGG GCGGACACCG TGATCGCCAG CTCTGCCGCC
TACCTGTCCC GGATGACCGG GCACCGCGAC GTAGTCCTCG GCATCCCGGT CATGGGGCGG
CTCGGCCGGG CCGCGCTGCG CACCCCCGCC ATGGTGGTCA ACGTGCTGCC GCTGCGGGTG
CACGTCCGCC CCGGGGACAC GGTGGAACAG GTTGTCGCCG CCACCGCGGC CGCGCTCCGC
GACCTGCGCG CCCACCAGCG CTACCGCGCT GAATGGCTGC GCCGCGACCT CGCCCTGGTC
GGCACGGACC GCCCGCTGTT CGGCCCTGAG ATCAACATCA AACTGTTCGA CTACGACCTG
TCCTTCGACG GGGTTTCCGC TACCACGGTG ACCCTGTCCG AGGGGCCCGT CGACGATCTG
GCTCTCTCGG TCTACCGCGC CCCGCACGGC GGCCTCACCC TGGAAGCCAA CGCCAACGAC
CGCCGCTACG ATCCCGCAGA CGTCCAAGCT CGACTCGCCG AGATCGTCCG CCTGCTCGAC
GCGGCCGCTG CGGCCCCGGC CCACACTCCT GTGGCACGCC TCGACTACAC CGGCGCCACC
GCGGCCGGGC CCGCTGCACC ACCGGTCGAC GAGCACCCCC CGCTCATCCC CAGCCTGCTC
GACCAGCTCG CTGCAGACGA CCCGGACGCT GTCGCCGTGG TTGCCGACGG CCGCAGTGTG
ACCCGGGCCG AGTTCCTCGA CCGCGTCGAC CGGCTGGCCC GGCTCCTGCG CGCACACGGG
GTCGGCCCGG AGCGGATCGT GGCGCTCGCC CTGCCCCGCA CCCTGGACGT GCTCGTCGCC
TTGTTCGCGG TGCTGCGCGC CGGCGGCGCC TACGTGTACC TGGACCCGGC CCACCCCGTG
GAGCGGCTCG CCGCCATCGT CGCCGACACC CGTCCGGTGG TGGCGGTGAC CGCACCGGAC
TTCGGCGCGC CCCTCCCGGA CTTCGGGGAC GCGCACCGCA TCGACCTCGC CGACCCCCAG
GTGCGCACCC GCCTCGCGGA AACCCCCACG ACCAGCGAAC CGCTGCCCCT GCCGCACCCC
GACAACGCGG CCTACCTCAT CTACACCTCC GGAACCACGG GGAAGCCTAA GGGCGTCGTC
GTCCCTCACC GGGCCCTCGC CAACCTGGTC GCCGCCCACC GCCACGTCCT GTTCGACGGT
ACGGCCGCGC AGCGCCTGCG GGTCGGCCAC ACCGGGTCGT TCGGTTTCGA CGCGTCCTGG
GACCAGTTGC TCGGCCTGCT CTACGGCCAC GAACTCCACC TGCTCGGCGA CGACTACATC
TACGACTACG CCCGCCTCGG CGCCTACATC TCCGCCCACC GCATCGACTA CCTCGACTTC
ACCCCCACCT ACCTGCGCGG CCTGCTCGAC TCCGGGCAGG TGTGGCACCT GCCGCACCTC
CTCAGCTTCG GTGGGGAAGC CTGCCCCGAG GACCTGTGGC GCCGCCTGCG CTCCCTCCCC
GCCACCCGCG CCGTCAACTG CTACGGGCCC ACCGAGAACA CGGTGGACGC GCTCGTGGCG
TCCGTGGCAG ACAGCGACAC CCCTACCGTG GGCCGACCCG TCCCCGGCGT GGCCGTCCGC
ATCCTCGACG ACGCGCTGCA GCCGGTGCCG GTCGGGGTGG CCGGCGAACT CTACCTGGCC
GGCGTCCAGG TCGCGCGAGG CTACCTGGGC CGCCCCGACC AGACCGCGGA CCGGTTCGTT
GCCGATCCGT ACGGTCCGCC CGGCAGCCGC ATGTACCGCA CCGGGGACCG GGTGCGGCAG
CGCGCTGACG GCCAGCTCGA ATACCTCGGC CGCGTCGACA CCCAGCTCCA GGTCCGCGGA
TTCCGCGTCG AAGTAGAGGA AATCGAAGCA GTCGCGGAAA CCCATCCCGC GGTGGCCCGC
TGCGCGGTCG CCGCCCACAC CGCTGCCTCC GGATCGGTGC GGCTCAGCGC CCACGTGGTG
CTCCACCAGG GGGTCACCCT CACCCCCGAC CAGCTCCGCG CCCACCTCGC CGAACACCTG
CCCGACGCTA TGGTGCCCGC GGCGGTCGTG TTCACCAGCG ACCTTCCGGT GACCCCCAAC
GGGAAACTGG ACCGGGCCGC CCTGCCCGAC CCGGGAGTGG AGAGCGCCGG CAGCCACGAT
GCGCCCGCCA CCCCCCGGCA GCAGACCTTG GCGGACATCT TCGCCTACGT CCTCGGCGTG
CCGACCGTGG GAGTCCACGA CGACTTCTTC CGGCTCGGCG GGGACAGCAT CACCGCGATC
CAACTGGTCA ACCGGGCCCG GGCCGCCGGA CTGGCGCTGC GGGTGCGCGA CGTCTTCGAC
CGGCCCACCG TGGCCCGGCT CGCCGCGGCC GCCACCCCCC TCACCGGGCA TTCTGCCGCT
GAGCCGGACG ACGATCCTGT CGGCGACCTG CAGCCCACCC CTCTCATGGC TGACCTGCTC
GACCAGGGCG TGCCCCCCGC GCGGTTCGCC CAGTCGCAGG TCCTGTGCAC CCCGCCCGGC
CTCAGCGAAG AGGTCCTCGC CGCCGCCCTG GGCGACCTCG TCCGCCACCA CGACGCGCTG
CGGCTGCACG CCACCGGCAC CGCCCTGCAG GTCGCCCCGC CTGACACGGT CCCCTCCGGT
CTGCTGCGCC GGGTGGATGC CGCCGAGTGG ACCGACACCG ACCTGGCCGC AGCAGTGGCC
CGGGAGAACG CGCGCGCCGC TGACCGAATC GACCTGGCCC AAGGCCGCAC CCTGGCCGCG
GTCTGGTTCG ACCGGGGTGC GCACCGCACC GGGCGGCTGC TGCTGCGCAT CCACCATTTC
GTGGTCGACG GGGTCTCCTG GCGGATCCTC GGCCCCGACC TGCGTGCCGC TGTGACAGCC
CGAGCCCAAG GACACCCCCC GGCCCTCGCC CCGGCCGGAA CCTCGCTGCG CCGTTGGACC
CGCCTGCTCG CCGCGGAAGC GCGCCGCGCC GCACGCGTCG CGGAACTCGA CTACTGGCAG
CGGACCGTCG ACCCCGCCAC CCACCAGCCG CTCGGCATGA GACCCCTCAA CGCAGGCGAC
ACGGTAGCTA CGCGCCGCAG CATGGAGTGG ACTGTCGACC CCGACCTGTC GGCCGCTGTG
CTCACCGACA CCGGTCCCGC CCTGGGCATG GGCGTTGACG AACTGCTGCT CACCGCCCTG
GCCGTGGCTG CCGCCCGGCT GCGGGCACGG CACGGCCACG CCGACAGCGG CGTGCTCGTC
GACGTCGAAG GACACGGACG CTACGAGCTG GCCGAACCCG CCGACACCAC CCGCACCGTC
GGCTGGTTCA CCAGCGCCCA CCCGGTCCGG TTGGCGCTCA CCCGCGACCA GGCCGCCGAC
GTCGGAGCGG CCCTCGCCCG CGTCAAGGAG ACCCTGCGCG CCGTACCCGG CGACGGCCTC
GGCTACGGCC TGCTGCGCCG CCTCAACCCC GAGACCGCGC CTGCTCTCGC GCATGCGGCC
CGCTCCGACA TCCTCTTCAA CTACCTGGGC CGGTTCGGCG CCTCCCACGA CGAGCCCTGG
CAGACCGCCC CAGAGGTGGC GGACCTGCTG CTCGACGAAG ACCCCGACCA GCCGCTCACC
TGCGGTTTGG AAGTCGGTAT CGCCGCCCGC GACACTGCCC GCGGCCCCCA GCTGGCGGTC
ACCTGGCGGT GGGCCGAAGG CGTCCACGAC ACCGCCGACG TGGCTTTCCT CGCCGAAGAG
TTCACCGCGG CGCTGCGCAG CCTCACCACC TACGCCGCCA CCCCCGGCGT GTGCACCCTC
ACCCCCTCCG ACGTGCCGCT CGTCGCCGTC GACCAGGAGC GCATCACCCG GATCGCCCGC
GACTGGGCGG AGCACGCCGA CGTCGCCAAC CCGCGGATCG TCGACCTGTG GCCGCCCACA
CCGCTCCAAG CCGGGCTAGT CTTCCACAGC CTCTACAGCG GCGGCCGCGA CGCCTACACC
ACCCAGTCGT GCACGGACAT TACCGGGCTG TTGGACGCGG ACCGGCTGCG GGATGCCGCT
GCTGCCCTCC TCGACCGGCA CCCGTCGCTG CGCGTCGGCT TCTGGAGCGA CGGCACCGAC
ACTGTCCAGT TCGTCCCCGC TGAGGTGGCG CTCCGCTGGC GCATCGTCGA CCTGTCCGGA
CAGGACGCGG CAGCCCAGGA GGCCTGCTGC GCGCAACTAC GCGCCGAAGA ACGGGACACC
CCTTTCGACC TCGCCCGTCC GCCCCTGATC CGCTTCGTCC TCATCCGCCT GGCCCCGGAC
CACCACCGGC TCGTGGTCAC CGACCACCAC ACCCTGCTCG ACGGCTGGTC CACCCCGCTC
TTCCTGCGGG AGCTGTTCAC CCTGTACGCC AACCCCACCA GCCCGCCGCC CACGGCCACC
TTCCGCGACT ACCTGGTGTG GCTGGCCGAC CGGGACCTGG CAGCCGCGGA CCGCGCCTGG
CGGGCGGAAC TCGCCGACCT GCCCGGACCG TCCCTGCTCG CCCCCGACGC CGACCCCTAC
GGCGGTGACG GAGCCCAGCA GGAACTCTTC GCTGAACTCA GTGAGGAGGA GACCGCCCGA
CTCACCGAGA CCGCCCGTTC CCTCGGCGTG ACCGTCGGAG TCCTCGTCCA AACAGCATGG
GGCCTGCTGC TGGCCGGGCT CACCGGGCGC GACGACGTCG TCTTCGGGGT CACCGTCTCC
GGGCGGCCCG CCGAACTCGA CGGGGTCGAC GACATCCTCG GCCTGTTCAT CAACACCATC
GCGGTACGGC TCCGCGCCCA CCCGGCCCGG AGCATCGCCG ACCTCCTCAT CGACCTGCAG
CGTCGACAGG CCGCGCTCGC CGAACATCAC CATGTCGGCC TCACCCGGCT GCAGGAGCTC
ACCGGTACCG CACCGCTCTT CGACACCCTC CTCGTCTTCG AGAACTTCCC GTACCGGGAC
GCGGTGGCCG ACGAAGAATA CGCGGGCGTG CGGCTGCGCG ACGTCGACGT CACCGACACC
ACCCACTATC CGGTCTCCGT CAACGTGTTT CCCGGACCGC GCCTGCAACT GCGGTTGTGC
CACCGCCCCG ACGCGGTCGA CTCCGCGCAG GCCCACGCGC TCCTGGACCG CTTCCGCGAC
CTCCTCACCC GCATCGCCAC GGATCCGGGG GCCCGCGTCG GCACCGTGGG CGTAGCCGCG
GAGGCAGAAC AGTCCCGCAT GCTCGGGGAG TGGAATGCGA CGGGGCGGCC GGTGACGGCG
GTGCGTCCGG ATCGGGCGGT GGCTGAGTGG GCGCGGCGTG TGCCGGGGGC GGTGGCGGTG
CGCTGTGGTG GGGCCGTGTG GTCGTATGCC CGGTTGGATG CTGAGGTGGA GCGACTGGCT
GGTCTGCTGG TTGCGGGTGG GGTGCGGCCT GGTCAGGTGG TGGCGGTGTT GCTGCCGCGG
GTTCCGGAGC TGGTGGCTGC GCTGTTGGCT GTGCAGCGTG TTGGTGCGGT CTATGTGCCG
TTGGATCCGG ATTTCCCGGC GGAGCGTCTC GCCTTCATGC TCACCGACTC GGGCGCCGTC
ACCGTGGTCA CCACTCGCGG ACTCGAGGCG GCAGTACCCG CGGGAGTGGG CCGAATCCTC
CTTGACGATA CAGCACCCGC TGCCCCCGAC ACTCCGGCAC CGGCCTGGGA CGGCCCGGAT
GGGGCGGCGT ACATCTTGTA CACGTCTGGG TCGACGGGGC GGCCTAAGGG TGTGGTGGTG
TCGCACCGTA ATCTGGCGAA CTTCCTCACC GATATGGCTG AGCGGGTGCC TATGGGGCCG
CAGGATTCTT GGCTTGCGGT GACCACGGTG AGTTTCGACA TTTCCGCGTT GGAGCTGTAC
CTGCCGCTCC TCGCCGGAGC GACGATCACC CTCGTGGACG CCGCCACGGT GCGCGATCCG
CGGGAACTCG CCGCGGTCAT GCGCGCCAGC CAGCCCACGA TCATGCAGGC CACCCCCACG
CTGTGGCAGA TGCTCGCCGA CGAGGATCCG GACGTGCTCA ACGGGTTGCG GATCTTCGTG
GGAGGCGAGG CGCTGCCCGT ACCGCTGGCG GACGTGCTCG CCAGCCGGGC AGCCGTCGTC
CACAACGTGT ATGGGCCGAC GGAGACGACG ATCTGGTCGA CCGCCGACCG GGTACGCAGC
GGCGCCCCCG TCACCATTGG CGTGCCGATG GCGAATACGC GGGTGTATGT GCTGGATGCG
GGGTTGCGTC TGGTGCCGCC AGGTGTGGCG GGGGAGCTGT ATATCGCTGG TGAGGGGGTG
GCGTGGGGCT ACCACGGCCG GTTCGATTTG ACTGCGCAGC GGTTTGTTGC TGATCCGTAT
GGTCCGCCCG GGTCTCGCAT GTATCGCACG GGGGATGTGG TGCGGTGGCG ATCGGATGGG
CGGTTGGATT TCCTGGGTCG TGCCGACTTT CAGGTCAAGA TCCGCGGTTT TCGCGTCGAA
CTCGGGGAGA TCGAAACCGC GCTCGCCCGT ATCGACGGGA TCTCCCAAGC AGTGGTGGTG
GCCCGTAACG ACAGCGGAAA CCACCAGCGG CTCGTCGCCT ACCTCGTCCC AGCCGGGGCA
GCGGCGCCCG GCACGGCCGA GATCCGGGAG AAGCTCGCCG CGGTTCTGCC CGCCTATATG
GTGCCGTCGG CGTTCGTGGT GGTGGACGAG TTCCCGCTGA CCGCCAACGG CAAGATCGAC
CGCAAAGCCC TCCCCGACCC CGCGCCCGCA GCAGGGGACG GCCCGGCCGG ACGGGACCCG
GTCACCGCCT ACGAGGAGAT CGTCTGCCAG GTCTTCGCGG CTGTCCTCGA CCGGTCCGAC
GTCACCGCGG ACGCTGACTT CTTCGCGCTC GGCGGCCACT CCCTGCTCTC ACTGCGTGTC
GTCGCCCGGC TCCGCGCCCT CCTCGGCGTC GACGTCGGGG TCCGCGACCT GTTCGAAGCA
CCTACCCCGG CCGCGCTTGC CGCCCGCCTC ACCACACAGA CCGGACGGCG GCCCGCCGTG
ACCCGCCGCG GCCCAGACGC CCCACCTGTC CTCTCCCACT TCCAGCGGCG GCTCTGGCTC
ATCGAGCAGG TCTACCAGAC CCGCGGCGCC TACAACGTGC CGCTCGCCGT CCACGTCTCC
GACCGGCTCG ACCTCGATGT TCTCCGCGCC GCAGTCCGCG ACCTCGTGGC CCGCCACGAG
GTGCTGCGCA CCCTCGTACG CAGCAGCGAC GACGGCCCCG ACCCGGTTCT CCTCGCCCCC
GAGGACGCCG CGGTCGACGT CGCGGAAGTT CAGGCCGCAG GGCCGGTGGC GGACCTGCTC
GCCGAACTCA CCGCCCAGCC CTTCGACCTC GCCACCCAGA TCCCCCTGCG AGTCCGCATG
ATCACCGGGG AACAGGTCGA CGGCTGTGTG CTGCTCCTCG TCTGCCACCA CATTGCCGCA
GACGAGTGGT CTTTCGCGCC GCTGCTGCGC GACCTGGACA CCGCCTACCG GGCGCGGGCC
GCGGGACGCG CCCCCGACTG GGAGCCGCTG CCCGCCCAGT ACAGCGACTA CGCGGCCACG
CTGCACGACT GGTTGGGCGA GGCCACCGAC CCCGCCAGTC CGCTGCGCCG CCAACTCGAC
TACTGGCAGC ACGCCCTGCA GGACCTGCCC GACGAACTGG ACCTGCCCAC CGACCGGCCC
CGGCCGGCGA CAGCAAGCCA CCGCGGCGGC CTAGCGCGTG CCGAACTCCC CCCTGAACTG
GTCGAAGCTG TGCGCCGCCT CGCCGCCCAG CACGGCGTCA CCGTCTTCAT GGTGGTCCAG
GCCGCTGTCG CCGTGCTCTT GCACCGTCTG GGCGCCGGCG ACGACATCCC GTTGGGCAGC
CCGGTCGCCG ACCGCGCCGA CGAAGCCGTC CACGACACTG TCGGCTTCTT CCTCAACACG
CTGGTGTTGC GCGTGAACCT GTCCGGCAAC CCTACTTTCG CCGACCTGCT GGACCGGGTC
CGTGCCGTCG ACCTGGAAGC GTTCGCCCGC GCCGACGCGC CGTTCGACGC GGTAGTCGAC
ACGGTGAAAC CGCCCCGGGC GGTCAGCCGC CACCCGCTGT TCCAGACGAT GGTCTCCTAC
CAGCGTCGCC CTTCGGATGT GGACCGCCTG TTCGGCGCGG CCACCCGGCT CGTTGAAGTC
CCGCTGGACA CCGCCAAATT CGACTTGGAG TTCGCGTTTA TCGAGGACGG GCACGGCGGA
GCCCACATCG CCCTCAACTA TGCGGCCGAC CTGTTCGACC ATGACAGCGC GGAACAGCTG
GTGGCGCGCC TCCGCACCGT GTTGGAGCAC GCCTGCGCGG ACCCGTGTCG TCCGGTTGCT
GGGGTGGAGG TGGTGTCGGG GGCAGAACGT CGCCGGTTGG TGTCTGAGTG GAATGCGACG
GGGCGGCCGG TGACGGCGGT GCGTCCGGAT CGGGCGGTGG CTGAGTGGGC GCGGCGTGTG
CCGGGGGCGG TGGCGGTGCG CTGTGGTGGG GCCGTGTGGT CGTATGCCCG GTTGGATGCT
GAGGTGGAGC GACTGGCTGG TCTGCTGGTT GCGGGTGGGG TGCGGCCTGG TCAGGTGGTG
GCGGTGTTGC TGCCGCGGGT TCCGGAGCTG GTGGCTGCGC TGTTGGCTGT GCAGCGTGTT
GGTGCGGTCT ATGTGCCGTT GGATCCGGAT TTCCCGGCGG AGCGTCTCGC CTTCATGCTC
ACCGACTCGG GCGCCGTCAC CGTGGTCACC ACAGCAACGC TGGAGCCGAC CCTGCCGCAG
GACACCGCAC GGATCTGCGT CGACGACCCG GACCTGCGTC CCGAACCCGG CACAGCAGTG
CCGTCGGCCA CCGTGGATGG GGCGGCGTAC ATCTTGTACA CGTCTGGGTC GACGGGGCGG
CCTAAGGGTG TGGTGGTGTC GCACCGTAAT CTGGCGAACT TCCTCACCGA TATGGCTGAG
CGGGTGCCTA TGGGGCCCGA GGATTCTTGG CTTGCGGTGA CCACGGTGAG TTTCGACATT
TCCGCGTTGG AGCTGTACCT GCCGCTCCTC GCCGGAGCCA CCGTGGTGCT CGCCGCTCCC
GACACTGTCC GCGACCCGGC CGCGCTAGCC GACCTCATCG CGGCCGAGCG TCCCACCGTC
ATGCAGGCCA CCCCCACGCT GTGGCAGATG CTCGCCGACA CCGCACCCCA CGCGCTGCAC
GGGCTGCGTG TCCTCGTCGG TGGGGAAGCG CTCCCCGCCA CCCTCGCCGA GACCCTGGCG
GAACGGGCGG TGGAGGTCAC CAACGTGTAT GGGCCGACGG AGACGACGAT CTGGTCGACC
GCCGACCGGG TACGCAGCGG CGCCCCCGTC ACCATTGGCG TGCCGATGGC GAATACGCGG
GTGTATGTGC TGGATGCGGG GTTGCGTCTG GTGCCGCCAG GTGTGGCGGG GGAGCTGTAT
ATCGCTGGTG AGGGGGTGGC GTGGGGCTAC CACGGCCGGT TCGATTTGAC TGCGCAGCGG
TTTGTTGCTG ATCCGTATGG TCCGCCCGGG TCTCGCATGT ATCGCACGGG GGATGTGGTG
CGGTGGCGAT CGGATGGGCG GTTGGATTTC CTGGGTCGTG CCGACTTTCA GGTCAAGATT
CGCGGTTTTC GCGTCGAACT CGGGGAGATC GAAACCGCGC TCGCCCGTAT CGACGGGATC
TCCCAAGCAG TGGTGGTGGC CCGTAACGAC AGCGGAAACC ACCAGCGGCT CGTCGGTTAT
GTGGTAGCCG AACGCCTGGT CACCCCCAGG GAGTTGCGGA CCGCACTCGC TGAGACCCTG
CCCGCCTACA TGGTGCCGTC GGCGTTCGTG GTGGTGGACG AGTTCCCGCT GACCGCCAAC
GGCAAGATCG ACCGCAAAGC CCTCCCCGAC CCCACGCCCA CGGCCGACAA CGCCGCGCCG
GTCCGCGAAC CCGCAACCGA GGCGGAGGCC GCGCTGTGCG CCGTCTACGC CGAAGTCCTC
GGCTTGGACA AGGTGGGAGC CGACGCCGAC TTCTTCGCGC TCGGCGGCGA CAGCGTCCTC
ACCCTGCGCC TCGTCCACAG GGCCCGCAGT GCCGGATGGG AGATCAGCGC CCGCCACGTC
TTCCGCCACC CCGTGGTCGC TGACCTCGCC GCGGTCGCCC AGCCCGTCAC CGAGGAGGGA
CCTGCCGCCC CCGCACCGGA GGAACCGCTG GTCTCCCTCG ACGCCAACCA GCTCGCCCAG
CTCGAATCAC TGTGGAGGAA ACGGCGTTGA
 
Protein sequence
MSDTSTTRLP LTDAQAGVWY AQQIVPDSPV FNVGQYTDIP ADLDVDRFIR AVESVVRDSE 
TLRSRPVARG DTAVQEIRAD GAGVVEVHDL TAEADPRHAA AAWMRRDMAH PVRFDTDEPL
VRYALLRVGE RRWYWYQRYH HILVDAYAVT LLARRVADVY TALGRGQEPP PSRFGSLADI
VADETAYAAS EQCAADRAYW TGLLGDGYPT ALLSSRPQAP YAGVLRARAD VAADVLTGLT
ELGERTGATW ADTVIASSAA YLSRMTGHRD VVLGIPVMGR LGRAALRTPA MVVNVLPLRV
HVRPGDTVEQ VVAATAAALR DLRAHQRYRA EWLRRDLALV GTDRPLFGPE INIKLFDYDL
SFDGVSATTV TLSEGPVDDL ALSVYRAPHG GLTLEANAND RRYDPADVQA RLAEIVRLLD
AAAAAPAHTP VARLDYTGAT AAGPAAPPVD EHPPLIPSLL DQLAADDPDA VAVVADGRSV
TRAEFLDRVD RLARLLRAHG VGPERIVALA LPRTLDVLVA LFAVLRAGGA YVYLDPAHPV
ERLAAIVADT RPVVAVTAPD FGAPLPDFGD AHRIDLADPQ VRTRLAETPT TSEPLPLPHP
DNAAYLIYTS GTTGKPKGVV VPHRALANLV AAHRHVLFDG TAAQRLRVGH TGSFGFDASW
DQLLGLLYGH ELHLLGDDYI YDYARLGAYI SAHRIDYLDF TPTYLRGLLD SGQVWHLPHL
LSFGGEACPE DLWRRLRSLP ATRAVNCYGP TENTVDALVA SVADSDTPTV GRPVPGVAVR
ILDDALQPVP VGVAGELYLA GVQVARGYLG RPDQTADRFV ADPYGPPGSR MYRTGDRVRQ
RADGQLEYLG RVDTQLQVRG FRVEVEEIEA VAETHPAVAR CAVAAHTAAS GSVRLSAHVV
LHQGVTLTPD QLRAHLAEHL PDAMVPAAVV FTSDLPVTPN GKLDRAALPD PGVESAGSHD
APATPRQQTL ADIFAYVLGV PTVGVHDDFF RLGGDSITAI QLVNRARAAG LALRVRDVFD
RPTVARLAAA ATPLTGHSAA EPDDDPVGDL QPTPLMADLL DQGVPPARFA QSQVLCTPPG
LSEEVLAAAL GDLVRHHDAL RLHATGTALQ VAPPDTVPSG LLRRVDAAEW TDTDLAAAVA
RENARAADRI DLAQGRTLAA VWFDRGAHRT GRLLLRIHHF VVDGVSWRIL GPDLRAAVTA
RAQGHPPALA PAGTSLRRWT RLLAAEARRA ARVAELDYWQ RTVDPATHQP LGMRPLNAGD
TVATRRSMEW TVDPDLSAAV LTDTGPALGM GVDELLLTAL AVAAARLRAR HGHADSGVLV
DVEGHGRYEL AEPADTTRTV GWFTSAHPVR LALTRDQAAD VGAALARVKE TLRAVPGDGL
GYGLLRRLNP ETAPALAHAA RSDILFNYLG RFGASHDEPW QTAPEVADLL LDEDPDQPLT
CGLEVGIAAR DTARGPQLAV TWRWAEGVHD TADVAFLAEE FTAALRSLTT YAATPGVCTL
TPSDVPLVAV DQERITRIAR DWAEHADVAN PRIVDLWPPT PLQAGLVFHS LYSGGRDAYT
TQSCTDITGL LDADRLRDAA AALLDRHPSL RVGFWSDGTD TVQFVPAEVA LRWRIVDLSG
QDAAAQEACC AQLRAEERDT PFDLARPPLI RFVLIRLAPD HHRLVVTDHH TLLDGWSTPL
FLRELFTLYA NPTSPPPTAT FRDYLVWLAD RDLAAADRAW RAELADLPGP SLLAPDADPY
GGDGAQQELF AELSEEETAR LTETARSLGV TVGVLVQTAW GLLLAGLTGR DDVVFGVTVS
GRPAELDGVD DILGLFINTI AVRLRAHPAR SIADLLIDLQ RRQAALAEHH HVGLTRLQEL
TGTAPLFDTL LVFENFPYRD AVADEEYAGV RLRDVDVTDT THYPVSVNVF PGPRLQLRLC
HRPDAVDSAQ AHALLDRFRD LLTRIATDPG ARVGTVGVAA EAEQSRMLGE WNATGRPVTA
VRPDRAVAEW ARRVPGAVAV RCGGAVWSYA RLDAEVERLA GLLVAGGVRP GQVVAVLLPR
VPELVAALLA VQRVGAVYVP LDPDFPAERL AFMLTDSGAV TVVTTRGLEA AVPAGVGRIL
LDDTAPAAPD TPAPAWDGPD GAAYILYTSG STGRPKGVVV SHRNLANFLT DMAERVPMGP
QDSWLAVTTV SFDISALELY LPLLAGATIT LVDAATVRDP RELAAVMRAS QPTIMQATPT
LWQMLADEDP DVLNGLRIFV GGEALPVPLA DVLASRAAVV HNVYGPTETT IWSTADRVRS
GAPVTIGVPM ANTRVYVLDA GLRLVPPGVA GELYIAGEGV AWGYHGRFDL TAQRFVADPY
GPPGSRMYRT GDVVRWRSDG RLDFLGRADF QVKIRGFRVE LGEIETALAR IDGISQAVVV
ARNDSGNHQR LVAYLVPAGA AAPGTAEIRE KLAAVLPAYM VPSAFVVVDE FPLTANGKID
RKALPDPAPA AGDGPAGRDP VTAYEEIVCQ VFAAVLDRSD VTADADFFAL GGHSLLSLRV
VARLRALLGV DVGVRDLFEA PTPAALAARL TTQTGRRPAV TRRGPDAPPV LSHFQRRLWL
IEQVYQTRGA YNVPLAVHVS DRLDLDVLRA AVRDLVARHE VLRTLVRSSD DGPDPVLLAP
EDAAVDVAEV QAAGPVADLL AELTAQPFDL ATQIPLRVRM ITGEQVDGCV LLLVCHHIAA
DEWSFAPLLR DLDTAYRARA AGRAPDWEPL PAQYSDYAAT LHDWLGEATD PASPLRRQLD
YWQHALQDLP DELDLPTDRP RPATASHRGG LARAELPPEL VEAVRRLAAQ HGVTVFMVVQ
AAVAVLLHRL GAGDDIPLGS PVADRADEAV HDTVGFFLNT LVLRVNLSGN PTFADLLDRV
RAVDLEAFAR ADAPFDAVVD TVKPPRAVSR HPLFQTMVSY QRRPSDVDRL FGAATRLVEV
PLDTAKFDLE FAFIEDGHGG AHIALNYAAD LFDHDSAEQL VARLRTVLEH ACADPCRPVA
GVEVVSGAER RRLVSEWNAT GRPVTAVRPD RAVAEWARRV PGAVAVRCGG AVWSYARLDA
EVERLAGLLV AGGVRPGQVV AVLLPRVPEL VAALLAVQRV GAVYVPLDPD FPAERLAFML
TDSGAVTVVT TATLEPTLPQ DTARICVDDP DLRPEPGTAV PSATVDGAAY ILYTSGSTGR
PKGVVVSHRN LANFLTDMAE RVPMGPEDSW LAVTTVSFDI SALELYLPLL AGATVVLAAP
DTVRDPAALA DLIAAERPTV MQATPTLWQM LADTAPHALH GLRVLVGGEA LPATLAETLA
ERAVEVTNVY GPTETTIWST ADRVRSGAPV TIGVPMANTR VYVLDAGLRL VPPGVAGELY
IAGEGVAWGY HGRFDLTAQR FVADPYGPPG SRMYRTGDVV RWRSDGRLDF LGRADFQVKI
RGFRVELGEI ETALARIDGI SQAVVVARND SGNHQRLVGY VVAERLVTPR ELRTALAETL
PAYMVPSAFV VVDEFPLTAN GKIDRKALPD PTPTADNAAP VREPATEAEA ALCAVYAEVL
GLDKVGADAD FFALGGDSVL TLRLVHRARS AGWEISARHV FRHPVVADLA AVAQPVTEEG
PAAPAPEEPL VSLDANQLAQ LESLWRKRR