Gene Tbd_0355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_0355 
Symbol 
ID3672141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp375083 
End bp385501 
Gene Length10419 bp 
Protein Length3472 aa 
Translation table11 
GC content65% 
IMG OID637709016 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_314113 
Protein GI74316373 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.06862 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCGCG CGCTTCCCGC GTGGTTCGAC AAGGAGAATC TGATGAATCG CGACCACTAT 
CGGCTGGTGT TCAACCCCAC TTTAGGCATG ATGGTGCCGG TGGCGGAAAT GGCGCGCCGC
TCTGGTAAAT CAGGCCAGCG CAAGGCCGCG AGCGGGGCAG CGTTGGCGCT GGCGGGGGTG
TTGTCGGCGG GGCCGGTGCA GGCCGAGTTG CCAGTGGCAG CGGCGAACTT TGTCACGGCC
GGATCGGCAC TATTGCCGAC TCAGGTCGGT AGCCGGTTGC TGGTCAATCA GACCAGTGAC
AAGGCCGTAC TCAACTGGCA GAGCTTTAAC GTTAGCACCG GGCACGAAGT GCAGTTCCGT
CAGATCGATG GAGCGGGACA ACTGGTGCCA GGGGCGAATT TCACCTCGCT CAACCGTATA
TGGGACCTCA ACCCGAGTGT GATCGCCGGC AGCATCACGC AGGCAGTCGG GCAGAAGGCC
AATGTGATCC TGGTCAACAC CAACGGCATC GCCTTTATGG GCGGGTCGCA GGTGAACCTC
AACACCTTCA CGGCGAGCAC ACTGAACATT GCCGACAGCT TCATTCTGAA TTCGTTTCTC
GGCGACCCGA CCAAGCCGCA GTTCGAAGGT ACGACAGGGT TCATCAAGGT ATTCGAGGGC
GCGCAGATCA CCGCGGGCAG CCAGGGGCGC GTGATGCTGA TCGCGCCGAC CGTGGTCAAC
AAGGGCAAAG TGACGGCCCC GGATGGCCAG GTGATTGCGG CGGCTGGCGC CAAGGTCTAT
CTGCGCTCGG CCACGGGCGA CGACGACAAC GTGCGCGGTC TCCTGGTCGA AGTGGAGCGC
CCCGCCGGTT TGGCCGATGC CGATAGCGCC AATGCTGACG TCAAGGATGG GGTGCTCGAT
GGCCAGGCCG TCGACCTGAC GAACGCGGCG GAGGACAAAC TGGGTCACGT CACGAACCTG
GGCGAACTGA CCGCGCAGCG TGGCAACGTC ACCATGGTCG GATTCGCGGT CAACCAGCTC
GGCATTGCGC GCGCGACGAC GTCGGTTGTG GCCAACGGCT CGGTGTATCT GATGGCGAAG
GACGGCATCA CGACCAACGC CGGCCTGCTC ACGGCCACCA ATATCGGCGC GACGAGCGCG
GGTCGGGTCG TGTTGGGGGA AAACAGCCTG ACCGAAATCC TGCCGGACGT AGCCGACCCC
ACGACAGGAT TGGATGGCAC GACGGGCACT GGTCTGGCAA AGATATCGCA GGTGAAGATA
CTGGGTCGGG ATATACGCAT GGCGGGCGGC GCGACCATCG ACGCGCCTGC GGCCGCGGTC
GAATTCAGGG CGACCGATAA TCCCAGCGAT CCAACCGTTC TGAGGGATGC CGGCCAGAAT
GCCTCCGATA CTGCGCGGAT CCATATCGCG AGCGGCGCGC GTATCGACGT GGCCGGCCTG
GAGAACGTGG CCGTGTCCGT GGCGCGCAAT AGCGTCGAGG TGGAGTTGCG CGGCGACGAA
CTGAAGGACT CGCCGGTCAA TCAGCAAGGC CCGCTGCGCG GTGAGAAGGT GTATGTGGAT
ATCAATCGCG CGTTGGCCAA TTCCGACGCC GGCAAGGCGA CGCTGATTGC GCGGGACAGC
CTCGAAAGCT ATCAGGCCAA ACTGGAGCGG GGCGTCGCCG AGCGCTCGAC CGCCGGCGGC
ACCGTGCACG TCCTCTCGGA GGGCGAGACC ATCCTGGAGC ACGGCGCGGT ATTCGACCTG
TCCGGTGGCA GCGTGAAATA CACCGCCGGC ACTGTGAAAA CCACGCTGCT GACCACCAAC
GGTAAACCGG TTGACATCGC GGACGCCGGC GGCCAAACGC GGTACGACGG TATCGCCACG
CGGTTCGTGA AAAACTACGG ACGCTGGAAC GTCAAGGAAG TCATCGATCT CGGGCAGAGC
TATCGCTACG ATCCCGGTTA CGTCGAAGGA CGGGACGCCG GCGCGCTGAA CGTGGTGGGC
ATGAAGGCGG TGGTGATGCA GGCCGACGTC CAGGGGCGCA CGACGACCGG TGAATTGCAG
CGCGACGCGG GCGTGTCCCC GGCAGGTGCG CGCTTGGCCA TCGGTCGGGA TGTGGTGGAT
TTCAGCACCA ACCGCCACGA CTACAAGCTG AACCAGCGTG TCGAAATTGC CGACAACGGC
GCCACCCTGC CGGCCGATTT CAAGTTTGGC GACGAACTGG CGCAAACCCT GAAAGACACG
CTGGTGCTGA AACCCGCGCT GATGGGCAAG GACAACGTGG CGCAGCTGGA GATCTTCAGC
AACCAGGCGG CCGAAGTGCG CGATGCGTTG CGCATGCCGA CCGGCGGCAG CGCGGCGATT
ACCGCCCAGG GCGTCGCGGT CAATGCGGAT ATTCACACCG ACGCCGGCAA GATTGCACTT
ACTGCGGAAC GGAACTACTT CAACGTCGAC GCACCGTCGC TGGACGTGAC GGTGGGGGAC
GGCGTCAGCC TGTCCGCGCG CGGCGGCTGG GTGAACGATT TACCTGGTAC CGGCGAAAAA
TCCTTGGATG CCGTGGCGGT CGACGGCGGC TCGATCAGGC TGGCCGCCGC GGGCAATGTG
GTGTTGGGGG ACGATACGTT GCTTGATGTT GGGGGCGGCG TGCGCGTCAA GCCCGACGGC
AAGATCAAGG CCGGCAACGG TGGCGATGTG ACACTGGAAG CCGGCCAAGC CCTCCGATTG
GGGGGCGAGG TGCACGGCGA AGCACTCGGC AAGGGCGGGA CCTACACCCT CAAAACGAAG
AAAATCCAGA TCGGCGGCGC AACGGACGCC GACACGCTTA ATCTCGACGC GGCATTCTTT
GAGCGCGGCG GCTTCGCCAA CTTCAATCTG ACCGGATTCG ACGGTGTCGA TATCGCCGAC
GGTACGACGC TGCGCCCGAC GGTCGTGAGC CGCGAACTGC TTGCAGGCTA TACCCTGCAA
CCGACCGGCA GCGACATCAG CGCGTTCAGC CAGTTGTTGA AACAGGATGA CCGGGTGCGT
CAGGCCGCGA ACGTGAGTTT CGCTGCGTCC AATACCAACG CAAAAATTCG GTTGGGCGAA
AGCGCCAGGA TTCTGGCGGA TGACCGGGCA AAGGTCAGCT TCAGTGCGCA TACCCGCGTC
GAACTGCTGG GTGAGGTGCA AGCGGCGGGC GGTCGCGTGT CCGCCATTGC AACGCGGGGC
GCAAACGATC CGTTCGATGC GACTGCAGCC GTCTGGCTGG GGTCGAACGC CGTGCTGGAC
GTTTCGGGCG TGGCGCGCAC CTACACGGAC AGCCGTGGTT TGACGCAGGG CGTGGTACTT
GCGGGCGGTA CTGTCACGCT GGGCGGGAGT GTGGCCGCCG ACGGCGCTGT GAAAGGGGGC
CAGGCGTACG TGGTCACGCA GGCAGGATCG CGTATCGATG TTTCGGGCGC GGCACCGGTG
CGGCTGGACG TGCGGAATGA AGCCGGCTTG TTGGGCCGCG ACGTGGGCAG CGATGCAGGC
GCCGTGACGA TCCGGACGGT GGAGGGCGCG TTGCTCGATG GCATGATCGT CGCCAACGGC
GGCGGCCCGA CGAACCGTGG CGGGTCGTTC TCGCTCACCT TGCCCGCGAT AGACCAGACG
ACGCGCTTGA ATGCCGGCGC GCCCGATCAT ACGCGCGTGT TGGCCCTGGC GTCGACGACT
GCGCCTCAGG CCGCGGGACT CACGGCCGGG GACGCAATTC CGATCGAACG CAACGGCAAT
ACGCGGATCG GCGCGAAGGC TCTCGAGGCC GCCGGGTTTG ATCGCATGCA CTTCAGCAGC
GAGAGCGGGA CCATCCGCCT GGAGAACGGC CTGAACGTGG GGGCTGAACC GGACATTCCA
CGCGCAATCC CCCTCAGGGA ACTGACGCTC GACGCGCCCC GCATCGAAAC GGCAGGCGGC
GACGTCGCGC TCGCGGCCGA GACCGTTCGC ATGGGCAACT ACGGCGCATT TGGCGACACC
GCAGGCGCTG CGCCGACGGC GAAGGGCACG CTGACCGTCA ACGCGCGCCA GATCGAACTG
GCAGGCAAGC TCGCGCTCGA GGGGATGGCG CAGGCCGCGC TGAACGGCAA GGACGAAATC
CGCCTGCTCG GGGGCATACC TGAAACCGTG ACCGTCGACG GTCGCGTATT CGACAAGCGG
CCGTATGGCG AACTCAAGAC CACCGCCAAT ATGGTATTCC ACGGGGCCGT GGTCGCGCCC
GCCAGCTACG TGCAATACCA AATACTTGCG CCCGGCAAGA CCCTGCGCTT CGAGCAAGGC
GGCGATGCGC CGCGACAGCC CTGGTCAGCG TTCGGCAGCC TGACAGCGAC GGCAAAGGAT
ATCGTCCAGG GCGGCAATCT CTGGGCACCG CTCGGCAAGA TCGATTTGCA GGCCGAAAAC
ACGCTGACGT TCGAGAACGG AAGTTTGACC TCGGTGGCCG CCGACGCGAA CAGCGTACTT
CCGTTCGGCA AGCTGCAAAA CGGCCGTACC TGGACATTCG ACCTGGGCGC ACCCGACACC
TACAGCATCG AACTGGCGGA TGTCGCGGAG CAGAAATCCA TACTTGCCGC GGCTTCGAAG
GTGGACATGC AGGCGGGCGC ACGCGTCGAT CTCTCCGGCG GAGGAGACGC CCAGGCTTAC
GAATTCACTG TCGGCCCCGG CGGGTCGCGC GACATCCTCG CCGACAAGAA CACCTACGCC
ATTCTCCCTG GGTTCACGGG AGGGGTTGCG CCGACCGACG CGGAGGAAAA AATCGATCGC
GCGAGCGGCG AGGCGGTCTA TCTCGCAGGC GTGGCAGGAC TGAAGGACGG GGTCTATACG
CTGTTACCCG CGCATTACGC GCTACTGCCG GGGGCCTACG CTATCAAACT GGATACCGGC
ATCAAGGACG TGATGCCGGG GCAGGCCTAC AGCCGCCAGG ACAGCGTACG CATCGCCGCG
GGCTATGTGA CGGACACGCG CGTTGGCGCA CCTAAGGACG CGAACTGGCA GGGCATACAG
GTCATGACTC ACGATCAGGT GCGCGCGCGG TCGGAATTTA CCCTGACGCG TGCGTCCGAG
TTCTTCGCGG ACAGCCGCAG CCGTCCGCAG GATGCCGGAT TGCTGTCTGT GAGCGCCAGC
GGTAGTGGCA GCGACGCGCT GAAGCTGGGT GCCGTATACG ACCTCGCGGC GGGATCGGGT
GGGCGGGGTG CTCAGGTGGA TCTCAGTGCG CTCAAGCTCG CGGTGACCAG CGGCGCGCCC
GCCGGACTCG ATCCGGAGAC CGTCGTTCTC GATGCCGCCA AGCTCAACGA GCTGGGGGCG
GAAAGCGTCT TGATCGGCGG TACCCGGACT ACGAGCGGCG ACACCACCAC GCTGAGCGTG
GCCGCGGAGT CGTTGACGCT CGCGAACGAT GCCGATCATG CGCTGAAGGC CGGCGAAGTC
ATGCTGGCGG CGACGAACAC GCTTGCGCTT GAGTCAGGCA GTGCGATCGA TGCGCAGGGC
GCGTCGGGCG ATGCCGGCCG CTACGAAACC GGCGGTAACG GTGCATTCGT GCGCGTGGCG
TCGACTCGCG CCAGCTTCGC ACGCACCGGC AGCCCGGATC GCAGCCAGGG CACGTTGACC
GGCGCGTCGG ACAGCACATT GATTGCCACC GACTCGATTA CGCTCGACGC GACGAAGGAC
AATGCATTCA ACGGCAGGAC CCGGTTCGAG CGGCACAAAA CCGAGAACGG TGCCGAGGTT
ACAACTCCCG TCGCCGGCCA TCTGGCCGTG GGTGCCAGCC GAATCAATTT CGGAGATGCT
CCCGTGGGGA GCGACGGCCT CACCTACACG CAAGGCGAGC TGAACGCCAT CGACCTCGCC
GGACTGACTC TCATCAGTTA TACGACCTTT GATCTCCATG GCGATGTGAC CGTCGGCAAG
CTCGACGGGG GCAAACCTGT ACTGCAAAGT CTGACTCTGC AAGGTGCCGG TCTGGCAGGT
TTGAATAACG CAGGCAAGAC CGCACGACTC AACGCGAACC AACTGACGCT GGCCAACCCG
AACAACGCCG CATCGTTTGC CGCAGGCCGC GCGCTGGGCA GCGGCAATCT CGAGGTGAAA
GCCGATACGC TTACGCTGGG CTCCGGCGAC AAGCAGATCC AGGGATTTGG TGCGGTCACG
ATCACGGCCA ATGAACTCGT GGGCTCCGGA ACAGGCACGC TGGAGGTGGA CGCGCCTGTG
ATGCTGAACG TCGCCCGCAT CAGCGGCCAA CGCGGCGCCG ATCAGGCCTT CAACGCGACG
AACGCGCTGA TCGTGGCGCA ACACACTGCC GACCGCGTGT TGGCACCGGT GACTGCGCTG
GGCGCCAAGT GGGCGTTGCA AGGCTCCTCG CTCGATTTCG ACAGCCACGC CGGGCTGCCC
TCGGGCACAT TCAAGCTGAC AGCCACGGCC GGCGATATCG AGCTGGGCGA AAACGCAGAG
ATCGACGTAG CCGGACGCTC GATCCAGTTC TTCGACGTCA CCAAACCCAG TTGGGGCGGC
ACCGCGGAAT TCGTCAGCGA AACCGGAAAT GTGGATTTTG CGGATGGATC GAACGTGGAC
GTGTCCGCTG CCGCGGGCGG TGTCGCGGGC ACGCTGATCG TGCGCGCGGC GAATGGTACG
TTCACACTCG CCGATGGCAG CGGCAGCGGC AGCGTCAACG GCTCGGCGCC GGGGGACGGG
GGGGGCTTGC GCGGCGAGGG CGCCCGCGCC GACGTCGACG TGAAATCGCT CGACACAGTC
GACGATGCAG GGAACCCGGT TGCAAGTTTC TCGACGTTCA ACACCGCGTT CAACACCGGT
GGATTCGACG GGGGGCGTAG TCTCCGCGTG CGCAGCGGCG ACGTAACCAT TGCCGAGACC
GATAAGATCA AGGCCCTCGA TATCCGCATC GCTGCCGACG GGGGCAAGCT TGACGTGGCG
GGCGAACTCG ACGCGTCGGG CACGGATGCA GGCCGCATCG AATTGTTCGC GAAAGGTGAC
GTGAACGTGA CAGGCACCGC CAGCCTCTCG GCCAAGTCCC GCGGCGCGAA CGAGGACGGC
GGCGACCTCG AGATCGGTAC ACGTGAAGGC AACCTTGATT TGGCGGAAGG CAGCACCATC
GACGTTGTGG GTGGCGCCGG CGGCCAAGGC GGTACCGTGT TGCTGCGTGC GCCGCGAAGC
GGCGTCGACG TCAACGTGAC TGCGCTCAAG AGCGCGGGTC TGGCCGGCGC CCGCGCGGTG
TCCGTCGAGG CGTTCAGGGT ATACAACGAG GGCGAGATCA ACGAAATTGG AACGTTGGAC
AACGGCAACG TCAGCCTGGC GGCGGCGGAT CTCGATACGA TCAAGGCTGA CAACGCTGCG
TTCGCGGGCC ATCTCGATGC CGAGAGCAAC TATGTGGATC ACTACGCGGC GATCAGGGAC
CGCCTTGGTC AGCCGACGCT CCACGTTCTG GCGGGCGCGG AGGTGCACGC GACGGGCGAT
CTCACGCTTG GCGAGAACTG GAACCTGAAG GACATGCGGG ACAACGGCGA AGCGGGCGTC
CTCACTTTGC GTGCCGAGGG ACATCTCAAC ATCAATGGCA ATCTGTCGGA CGGCTTCAGC
GTTGCAACGC CCTGCGCCGC GGCTACGTGC GTGGGCCGCA GTCCGACGTC GGCGGCGTTG
CTGGGCGACG ATTCCTGGTC GTACCGTTTG ACCGCGGGCG CGGACCGGGC CGCTGCCGAT
CCTATGACGG TGAAGCCCGG CGACAGGGAC TTCACGCTCG CCGCCGGCAA ACTGATCCGC
ACCGGCAGTG GCGACATTCG GGTGGCGTCC GGGCACGACA TCAGGCTCGT CGGCACCGAG
GCAGTGATCT ACACGGCAGG TCGCAACACC GGGACATTGG GCGGCTTCAC GTCACCGACG
CCTGCGGCGT CGACCTATTT TTCGCACGGG GGCGGTGACG TCAGCCTGGC CGCCGCAGGC
AGCATCGTGG GCAGTCCGTC CGCGCAGCTA TTTAACAACT GGCTATTCCG CCAGGGCAGG
CTGAATGCCG ATGCGTCGGC CTATACAACA CAGCCCGCGT GGTGGGTCCG CTTCGACCTG
TTCCAGCAGG GCGTGGGTGC GCTCGGGGGC GGCGATGTCA TGCTCGTTGC CGGAGGTTCG
GTCCAGAACG TGTCGGCTCA TGCGCCGACG CAGGCGCGCC TAGCGTCCAG CACGCCGGAT
GCCGGCGCCT TGACGAAAAC CGGTGGGGGC ACCGTGCGCG TGGAGGCGGG CGGTGACGTG
CTGGGTGGCC AGTACTATGC AGACCGTGGT GACGTGGTAC TGAAGGCCGG CGGCGAGGTG
GGCAGCGGAC AAGTGGCCTT CGACAAGCCG ATATATACCC TGCTGGCGGT AGGCGACGGC
GAGGCACGCG TGAGCGCGAT GAAGGACGTT CACATCGGCA CGGTGTTCAA TCCCCATCTG
TTCAATCAGT CGAGATCGGC ATCGAACAGC AAATCGACGT TCAACGTGGC GGGTACGAGC
GGGCGCGACA CTGCCTTCTC GACGTATACG GACCAAAGCG GCGCAATTCT GCATAGCCTG
ACCGGCGCGG CAACGCTGCA TAGCACCGAT GCCATCAGCG GGAAACAATT GCACGACTCG
GTTTATCTGT ACGCCCTGAA CGACAAGTCG TCCATCGATG CCTACGGCCT TGATCTCCTC
CCGCCCAGCC TGAGCATGGT CGCCTTCCAG GGGGATGTGG CGGTCGAAGG CGGCACACGG
GTGATGCTGC CCGCCGCCGA TGCAACTCTG GAACTGCTCG CGCGCGACTC GGTTCGCCTG
CACCAGACGC TGGTGATGAG CGACCGCGAT CCCGCAAGCA TTCCAAGCCC GGCATTGCCG
GCCAATCCTC GGGTGGATTC CGTCAACAAC CCGAAAAACC TGGTCGCTGA GCTGAGCAGT
ATCGACAAAG CCGATCCCAC CCTCCACGCG GCCGTGCCCG TACATGCGGC CGACCCGCAG
CCCGTGCGCG TCCATGCGAT GGAAGGGGAT GTCGTCGGTA GGTCCGTCGC TTCGGGGGGC
GCCGCGACGA TTCTGGATGT CTCCAAGGCT TTTGATGTGC GGGCGGGTCG CGACGTGGTG
AACGTCAGCA TCGAGGCGCA GCACGTCAAC GCAGACAATG ACCGGAGCCG GGTGCAGGCG
GGCCGCGACA TCCTTTTTGC GACCGGCGCT GTACGTACCG AAGGCGACCA TATCCATGTG
GGGGGCCCGG GCGTGCTGGA CGTGATCGCA GGGCGCGACG TCGATTTGGG CACCTCGGGC
GGCATTCTGA GCCGGGGCAA TACGGTCAAT CCCGAACTTC CGGAGCTTGG TGCGGATATT
CGAGTCGCCG CAGGGGTCGG CGACCGCGGG ATCGACTACA GTGGCGCGGT CGACCGTTTG
CTGGCCAAAC TCGACGCGGG TGCGCCTGAC GACGCGACGC TGTGGCAGGC GCGCTGGATG
ACTGGGGACG ACAGTCTGAC CGCGGATTCC GCACGCGCTG CGGTTCAGGG CGTCAAGTCG
CAAGGCGTCT TGGTGTATGA AGAACGCGTA CGGGCCATGC TGTTCACCGC GCTGCGCGAA
ACCGGACGCG ACTTCAACGA CGCGGACAGC GACTACGCTG GCCACTATGC GCGCGGCTAT
GCGGCCCTTG AGCTGGTATT CCCCGGCATC GGCGACAAGT ATCCGGATGG CGAATTCAAG
AACTATCAGG GGGGCATCAA CCTGTTTGCC AGCCGGATTC AAACCCAAAG CGGCGGGAAT
ATCGAGTGGG TCGTGCCCGG TGGAGACATG GTCGTGGGGC TGGCGAACAC GCCGGAGGCG
CTCCTGAATC TTGAAGGCGG CGAGACCGGA AAACACGACG CCTTGGGTAT TGTCGCCGCG
AAGGAGGGCG ATATCCAGGG ATTCACGCGC GGCGACATGC TGGTCAACCA GTCGCGCATC
CTGACCATTG GAGGAGGCGA CGTGTTGTTG TGGTCCAGCG AGGGCGACAT CGACGCCGGC
AAGGGCAAAA AGACCGCGGT CACGGTGCCG CCTCCGCTCA TTCTGGTCGA CGGCAAAGGC
AACGTGACCC AAGTGTTGCA GGGCGCTGCG AGCGGCAGCG GTATTGGTGC GTTGCAACCG
TTGGGTGGGA CTGCGGGCGA CGTCAACCTG ATCGCCCCCA AAGGCACCGT CAACGCCGGC
GATGCCGGCA TCCGCGCCGG CAACCTCAAC ATCGCCGCGC AGGTGGTTTT GGGCGCAGAC
AACATCAGCG TCTCGGGCAA CTCGGCCGGC ACGCCGGTAG CGGACACCAG CGCGGTGACG
GCCGCCTCGT CCGGTGCGAG CAACGCGGGC GACGACGTGT CATCGACAAC CGCATCGCTG
TCGCAGAACC TCGCGGATGC GGCGCGCGCG GCAGAGCAAC TCAAGCAGGC GTTCAAGCCG
ACGTTCATTT CCGCGGAGGT GATCGGGCAC GGGGAGTGA
 
Protein sequence
MRRALPAWFD KENLMNRDHY RLVFNPTLGM MVPVAEMARR SGKSGQRKAA SGAALALAGV 
LSAGPVQAEL PVAAANFVTA GSALLPTQVG SRLLVNQTSD KAVLNWQSFN VSTGHEVQFR
QIDGAGQLVP GANFTSLNRI WDLNPSVIAG SITQAVGQKA NVILVNTNGI AFMGGSQVNL
NTFTASTLNI ADSFILNSFL GDPTKPQFEG TTGFIKVFEG AQITAGSQGR VMLIAPTVVN
KGKVTAPDGQ VIAAAGAKVY LRSATGDDDN VRGLLVEVER PAGLADADSA NADVKDGVLD
GQAVDLTNAA EDKLGHVTNL GELTAQRGNV TMVGFAVNQL GIARATTSVV ANGSVYLMAK
DGITTNAGLL TATNIGATSA GRVVLGENSL TEILPDVADP TTGLDGTTGT GLAKISQVKI
LGRDIRMAGG ATIDAPAAAV EFRATDNPSD PTVLRDAGQN ASDTARIHIA SGARIDVAGL
ENVAVSVARN SVEVELRGDE LKDSPVNQQG PLRGEKVYVD INRALANSDA GKATLIARDS
LESYQAKLER GVAERSTAGG TVHVLSEGET ILEHGAVFDL SGGSVKYTAG TVKTTLLTTN
GKPVDIADAG GQTRYDGIAT RFVKNYGRWN VKEVIDLGQS YRYDPGYVEG RDAGALNVVG
MKAVVMQADV QGRTTTGELQ RDAGVSPAGA RLAIGRDVVD FSTNRHDYKL NQRVEIADNG
ATLPADFKFG DELAQTLKDT LVLKPALMGK DNVAQLEIFS NQAAEVRDAL RMPTGGSAAI
TAQGVAVNAD IHTDAGKIAL TAERNYFNVD APSLDVTVGD GVSLSARGGW VNDLPGTGEK
SLDAVAVDGG SIRLAAAGNV VLGDDTLLDV GGGVRVKPDG KIKAGNGGDV TLEAGQALRL
GGEVHGEALG KGGTYTLKTK KIQIGGATDA DTLNLDAAFF ERGGFANFNL TGFDGVDIAD
GTTLRPTVVS RELLAGYTLQ PTGSDISAFS QLLKQDDRVR QAANVSFAAS NTNAKIRLGE
SARILADDRA KVSFSAHTRV ELLGEVQAAG GRVSAIATRG ANDPFDATAA VWLGSNAVLD
VSGVARTYTD SRGLTQGVVL AGGTVTLGGS VAADGAVKGG QAYVVTQAGS RIDVSGAAPV
RLDVRNEAGL LGRDVGSDAG AVTIRTVEGA LLDGMIVANG GGPTNRGGSF SLTLPAIDQT
TRLNAGAPDH TRVLALASTT APQAAGLTAG DAIPIERNGN TRIGAKALEA AGFDRMHFSS
ESGTIRLENG LNVGAEPDIP RAIPLRELTL DAPRIETAGG DVALAAETVR MGNYGAFGDT
AGAAPTAKGT LTVNARQIEL AGKLALEGMA QAALNGKDEI RLLGGIPETV TVDGRVFDKR
PYGELKTTAN MVFHGAVVAP ASYVQYQILA PGKTLRFEQG GDAPRQPWSA FGSLTATAKD
IVQGGNLWAP LGKIDLQAEN TLTFENGSLT SVAADANSVL PFGKLQNGRT WTFDLGAPDT
YSIELADVAE QKSILAAASK VDMQAGARVD LSGGGDAQAY EFTVGPGGSR DILADKNTYA
ILPGFTGGVA PTDAEEKIDR ASGEAVYLAG VAGLKDGVYT LLPAHYALLP GAYAIKLDTG
IKDVMPGQAY SRQDSVRIAA GYVTDTRVGA PKDANWQGIQ VMTHDQVRAR SEFTLTRASE
FFADSRSRPQ DAGLLSVSAS GSGSDALKLG AVYDLAAGSG GRGAQVDLSA LKLAVTSGAP
AGLDPETVVL DAAKLNELGA ESVLIGGTRT TSGDTTTLSV AAESLTLAND ADHALKAGEV
MLAATNTLAL ESGSAIDAQG ASGDAGRYET GGNGAFVRVA STRASFARTG SPDRSQGTLT
GASDSTLIAT DSITLDATKD NAFNGRTRFE RHKTENGAEV TTPVAGHLAV GASRINFGDA
PVGSDGLTYT QGELNAIDLA GLTLISYTTF DLHGDVTVGK LDGGKPVLQS LTLQGAGLAG
LNNAGKTARL NANQLTLANP NNAASFAAGR ALGSGNLEVK ADTLTLGSGD KQIQGFGAVT
ITANELVGSG TGTLEVDAPV MLNVARISGQ RGADQAFNAT NALIVAQHTA DRVLAPVTAL
GAKWALQGSS LDFDSHAGLP SGTFKLTATA GDIELGENAE IDVAGRSIQF FDVTKPSWGG
TAEFVSETGN VDFADGSNVD VSAAAGGVAG TLIVRAANGT FTLADGSGSG SVNGSAPGDG
GGLRGEGARA DVDVKSLDTV DDAGNPVASF STFNTAFNTG GFDGGRSLRV RSGDVTIAET
DKIKALDIRI AADGGKLDVA GELDASGTDA GRIELFAKGD VNVTGTASLS AKSRGANEDG
GDLEIGTREG NLDLAEGSTI DVVGGAGGQG GTVLLRAPRS GVDVNVTALK SAGLAGARAV
SVEAFRVYNE GEINEIGTLD NGNVSLAAAD LDTIKADNAA FAGHLDAESN YVDHYAAIRD
RLGQPTLHVL AGAEVHATGD LTLGENWNLK DMRDNGEAGV LTLRAEGHLN INGNLSDGFS
VATPCAAATC VGRSPTSAAL LGDDSWSYRL TAGADRAAAD PMTVKPGDRD FTLAAGKLIR
TGSGDIRVAS GHDIRLVGTE AVIYTAGRNT GTLGGFTSPT PAASTYFSHG GGDVSLAAAG
SIVGSPSAQL FNNWLFRQGR LNADASAYTT QPAWWVRFDL FQQGVGALGG GDVMLVAGGS
VQNVSAHAPT QARLASSTPD AGALTKTGGG TVRVEAGGDV LGGQYYADRG DVVLKAGGEV
GSGQVAFDKP IYTLLAVGDG EARVSAMKDV HIGTVFNPHL FNQSRSASNS KSTFNVAGTS
GRDTAFSTYT DQSGAILHSL TGAATLHSTD AISGKQLHDS VYLYALNDKS SIDAYGLDLL
PPSLSMVAFQ GDVAVEGGTR VMLPAADATL ELLARDSVRL HQTLVMSDRD PASIPSPALP
ANPRVDSVNN PKNLVAELSS IDKADPTLHA AVPVHAADPQ PVRVHAMEGD VVGRSVASGG
AATILDVSKA FDVRAGRDVV NVSIEAQHVN ADNDRSRVQA GRDILFATGA VRTEGDHIHV
GGPGVLDVIA GRDVDLGTSG GILSRGNTVN PELPELGADI RVAAGVGDRG IDYSGAVDRL
LAKLDAGAPD DATLWQARWM TGDDSLTADS ARAAVQGVKS QGVLVYEERV RAMLFTALRE
TGRDFNDADS DYAGHYARGY AALELVFPGI GDKYPDGEFK NYQGGINLFA SRIQTQSGGN
IEWVVPGGDM VVGLANTPEA LLNLEGGETG KHDALGIVAA KEGDIQGFTR GDMLVNQSRI
LTIGGGDVLL WSSEGDIDAG KGKKTAVTVP PPLILVDGKG NVTQVLQGAA SGSGIGALQP
LGGTAGDVNL IAPKGTVNAG DAGIRAGNLN IAAQVVLGAD NISVSGNSAG TPVADTSAVT
AASSGASNAG DDVSSTTASL SQNLADAARA AEQLKQAFKP TFISAEVIGH GE