Gene Sros_6558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6558 
Symbol 
ID8669867 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7206626 
End bp7218947 
Gene Length12322 bp 
Protein Length4106 aa 
Translation table11 
GC content73% 
IMG OID 
Productnon-ribosomal peptide synthase 
Protein accessionYP_003342014 
Protein GI271967818 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00138815 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCGACGG TGCTTATCCC GGGTTCCTTC CTGCCATTGT CCGCAGCGCA GAGTGGAATG 
TGGCACGCTC AGCGGATCGC CCCTGAGGAC CCGATCCATA TCGCCCAGTA CATTGAGATC
TCGGGGCCGG TGGACCCGGT GCTCTTCGCC TCGGCCGTCC GCCTGGCGGC CCGGGAGGTC
GACGCGATCC ATGTCCGGAT CGTCGAGGGC GGGCAGATCG TCGAGGAGCG GGAGCCCGCC
TGTCCCTTCC TCGACCTCGG AGACGAGCGG GCCGCCCTGG AGTGGATGCG GGCCCAGCTC
GCCTCCCCCC TGGGGGAGGG CTCGCTGCTG GCCACCGCCC TGCTGCGGGT CGCCGACGAC
CGCTACCTCT GGTACCTCCG CTGCCATCAC GTGATCATGG ACGGCTACAG CGGGCCGATG
ATCGCCCAGC GCCTGGCGGA GGTCTACACC GCGCTGGCCG AGGGCCGCGA CCCCGACCCC
GGCGGCCTCG GGTCACTGGC CACCCTGCTG GAGGAGGACG CCGCCTACCG GGCCTCCGAC
CGCTTCGAGG CCGACCGCCG CCACTGGCTG GGCAAACTCG CCGACCTGCC CGCCGTACCC
GCCCTCTCCT CCGGCACGGC CGTCGCCACC TCGCGCTTCC ACCGGTGGAG CGCGATGCTG
GGGGAGCGTG ACGCCGCCGC GTTGACGGAA GGGGCGCTCT CCCACGGGAC GGCCGTGTCC
GGGCTCATGA TCGCCGCGGT CGCGGCCTTC GTCGGCCGCC TCACCGGGGC CCAGGAGGTG
GTCCTCGGGC TGGCGGTCAC CGCCCGGACC ACGCCGACCG CCCGGCGCAC CCCCGGCATG
GTCTCCAACG TGCTCCCGCT CCGCCTGGCC GTACGGCCGC AGACGACCTT CGGGGAGCTC
GTCGGGCAGG TCACCCGGGA GGTCGGGCGG CTGCTGGTGC ACCAGCGCTA CCGGTACGAG
GACCTGCGCC GGGAGGTGCG CGGGCGGCTG TTCGGGCCGG TCGTCAACAT CATGCGCTTC
GACTACGACC TGAAGTTCGC CGGTCACGCC GCCACCGCGC ACCCGCTGTC CACCGGGCCG
ATCGAGGACC TGTCGATCAA CCTCTACGAC GGCTCCGACG GGCGCGGCGT CCGCATCGAC
TTCGACGGCC ACCCCGAGCT GTACGGCGAG GCCGAGCTGG CCGACCACCA CGACCGGTTC
CTGCGCTGCC TGGCCCAGAT CGCCCAGGCC AGCCCGGACC TGCGCGTCGG CGCGATCGAC
CTGCTCGACG AGGCCGAGCG GGCGCTCCTC GGCGACTGGG GCGGCGCGCC GCGCGAGGTG
CCGGACGGCA CCGCCGTCAC CCTGTTCGAG GAGCAGGTAC GGCGGCGCCC CGAGGCCGTG
GCCGTCGTGG CCGGGGACAC CGAGGTCACC TACGCCGAGC TGAACGTCCG GGCCAACCGG
CTGGCGCACC ATCTCATCGC CCGGGGCGCC GGACCCGACC GCTACGTCGG GCTGTCGGTG
CCCCGCTCGC TCGACATGGT CGTCGCGTTC CTGGCGATCC TGAAGTCCGG TGCCGCCTAC
CTGCCGCTGG ACCCCGACTA CCCGCGGGAC CGCCTCGCGG TGATGCTCGC CGACGCCGCG
CCGCCGATCG TGGTCACCTG TGCGGCGGCC GGGCTCCTCC CGCCGCCCGG GGCGGACACG
GTGGACGCCC GGACCGGAGA GAGCCTCGCC GGAGCCGGAG CCGGAGCCGG AGCCGGAGCC
GGAGCCGGCG ACGGAGGCGG AAGCGGCGAC GGAGGCGGGG CCGGGAGGGC GGGCACCCGG
ATCGAGGACG GTCTCTCCGG CGCGGAAGTC GTCATGATGG ACACCTGGAC CGGGGACGGC
CTCCCCGGTT CCGATCCGGT GACCGCGCTG CTGCCCGGCC ACCCGGCCTA CGTCGTCTTC
ACCTCCGGCT CGACCGGCCT GCCCAAGGGC GTGCTGGTCA CCCACGCGGG CCTGCCGGGT
TACGCGCGGA CCGAGATCGA CCGGTACGCG GTGACCGAGG ACAGCCGGGT GCTCCAGTTC
GCCTCGATCG GGTTCGACGG GGCGGTGCTG GAGTGGCTGA TGGCCTTCTC CTCGGGGGCG
GCGCTCGTAC TGGCCCCCGC CGGCGTCTAC GGCGGGGAGC CGCTCGGCCG CTTCCTCGCC
GAGACGCGCG TCACCCACGC CTTCATCACC CCGGCCGCGC TGGCGACGAT CCCGGAGCGG
CCGCTCCCCG ACCTGCGGAC GCTGCTCGTG GGCGGCGAAG CCTGCCGCCC CGAGCTGGTC
CGCCGCTGGT CCGCCGGCCG TCGGATGATC AATGTGTACG GACCGACCGA GACCACGGTC
GTGGTCGCCA CCAGCGACCC GCTGACCTCT CCCGACGACA CCCCGATCGG CCGCCCGGTC
TACGACACCC GCCTGCACGT CCTGGACCCC GGCCTGCGCC CGGTCCCGCC CGGCACGGCC
GGCGAGCTGT ACGTCGCGGG CCCGAGCGTG GCCCGGGGCT ACCTCAACCG GCCCGGTCTG
ACGGCCGAGC GGTTCGTCGC CGATCCGACC GGGGCCGGCG GGCGGATGTA CCGCACCGGA
GACCTGGTCC GGTGGGGGGC CGACGGCCAG CTCCAGTACC TGGACCGGGC CGACCGGCAG
GTCAAGGTGC GCGGCTTCCG GATCGAACTG GGAGAGATCG AGAGCGTGCT CGCCGGTCAC
CCCGGCGTCG GCCAGGCGGC CGTGCTGGCC CGCGACGACC AGCCCGGGGA CCGGCGCCTG
GTCGCCTACG TCACCGGTAC GGCGGACGCC GACGAGCTGC GGCGGCATGC CCGGCGGGCC
CTGCCCGACT ACATGGTCCC GGCCGCGATC GTGGTGCTGG ACGCGATGCC GCTGACCGCC
AACGGCAAGC TGGACCGGCG GGCACTGCCC GCGCCGGAGC GGGAGACCTC CGGCCGGGAG
CCCGCGACGG ATCGGGAACG GCTGCTGTGC GCGCTGTTCG CCGAGGTGCT GGGGGTGGAG
CGGGTCGGCG CCGACGACGG CTTCTTCGAC CTGGGCGGCG ACAGCATCCT CGCGATCCGG
CTCGTGGCCC GCGCCCGCGA GGCCGGGCTG GGACTCACGC CCAAGCACGT CTTCCAGCAC
CGGAGCCCGG AGGCTCTCGC GCTCGTCGCG CACGAGGCCG TGGAGACGGT CGTCACCGGC
GACGACGGCG TGGGAGCCGT CCCGGCCACC CCGATCATGG CCTGGCTGGC GGAGCGGGGC
GGGCCGATCG ACGGGTTCAA CCAGACGGTG GTCCTGCGGG TCCCCCCGGG GCTTGGCCTG
GATCATCTGA TCGTCGCGGT GCAGGCCGTG CTGGACTGTC ATGACGTGCT GCGGTTGAGG
GTGGCCGGGC CGGGGGTGTC GGGGCTGGAG GTGATGGCGC GCGGGGTGGT CCGGGCGGTG
GAGCACGTGC GCCGGGTCGA GGTCGCCGGG GAGGTGGAGG AGGCGGTCGC CGGGCAGGTC
GAGTTGCTGC GGTGCGGGTT GGATCCGGTG GCGGGCAGGA TGGTTGGCGT CGCGTGGCTG
GATGCCGGGC CGGAGGTCTC CGGCAGGCTC GTGCTCACCG TCCATCATCT GGCGGTCGAC
GGTGTGTCGT GGCGGATCCT GCTGCCGGAT CTGTTCGCCG CCTGGGAGGC GGCGGTTCAA
GGGAAGGCTC CGGTGCTGGC CCCGGTCCCG ACCTCCTTCC GGACCTGGGC CCACCGCCTG
CGGGACGAGG CCCGCGACCG AGGCGAGGCC CGCGACGGGG GCGACGCTCA TGACAGGACC
GCCGAACTCG ACCGCTGGAC CGAGATCCTC GACGGCCCCG CCGGGGACGG GAGCGCGACT
CCCCGGTGGT GGGGGACGTG GGGGGAACGG CGCGAGACGG TCGTGGAGCT TCCGGCCGAG
GTGACGAGGC CGCTGCTCGG CAAGGTCCCC GGGGCGTTCC ACGGACGGGC CGACGACGTG
CTGCTCGCGG GGCTCGCGGC GGCTGTCGCC AGATGGAGCG GGGACCGGTC GGTCCTGATC
GACCTGGAGG GGCACGGCCG GGAGGACGTC TTCGCCGGAG TGGACCTGTC CAGGACCGTC
GGCTGGTTCA CCGCCATGTA TCCGGTCCGG ATCGACGCCG GCCCGGTCGA CTGGAGCGAC
CTCAGGAACG GGGGCCAGAG CGTGGCCCAG GCCGTCAAAC GGGTCAAGGA GCAGCTCCGT
GCCCTGCCCG ACTCCCTGAG CTACGGCCTG CTCCGCTATC TCAACCCCGG CACGGCGCCC
GTCCTGGCCG CCCTGCCCCG GCCCGAGATC GGCTTCAACT ACCTCGGCCG GATCACGACC
TCGGGCCAGG ACTGGGAACC GGCGACGGAC GGGCCGTCCG GCGGGCTGTC CGGCGGCCAG
GACGCCGGCG CCCCGCTCAG GCACGGCATC GAGATCAATG CGGTCGCCCT CGGCGACACC
CTCCGCGTCA CCTGGACCTG GTCGCCGCAC CACTACACCG AGGACCGGAT CGTCGAGCTG
GCCACCGCCT GGTCGGAGGC GCTGACCGGC ATCTCCCGGC ACACCGGAGG CGGCCTCACC
CCCTCCGACG TCACCGCCCG GCTCACCCAG GACGACCTCG ACGGCTTCGG CCCGGACCTC
CAGGACGCCT GGCCGCTCGC CCCCCTCCAG CGGGGTCTGT TCTTCCACTC CATGCTGGCC
GTGGACGTCT ACACCGCCCA GCTCGTGCTG GACCTCACCG GCCCCCTCGA CGCCGCCAGG
CTCAGAACCG CCGCCGAACG CCTGGTCCGC AGGCACCCCG GCCTGCGCGC CTCCTTCGAG
TTCCGCGACG GACCCGTCCA GCTCGTGCAC CGCCGGGTCG AGGTCCCCTG GCGGGAGGTC
GCCACCGCCG ACGCCGCCGG GGTGGCCGCC GAGGAGCGGG CGCACCGCTT CGACCTCTCC
AGGCCGCCGC TGCTGAGGTT CGCCCTGGTA CGGCTCGCGC CCGGCAGGCA CCAGCTGATC
CTCACCAACC ATCACATCCT GCTCGACGGC TGGTCCACCC CGCTGCTGGC CGCCGAGCTG
TTCGCCCTCT ACACCGGGGA CGAGCCTTCC GCGGCCCCGC CGTACAAGGG GTATCTGGAG
TGGCTGGCCC GCCAGGACCA CGCGGCGGCG ACGGCGGCCT GGGACCGGGC CCTGGACGGC
CTGGACGATC CCACGCTGGT CGCCCCGCAC GCCCCCGCCG ACCCGGTGCC GCCCGGCCGC
GTCACCGCCG AGCTCGCCGA CGGCCCGACC CGGGCGCTCA CCACGCTGGC CCGCACCCAC
TCCACCACGC TGAACACGGT CATGCAGGCC GCCTTCGGGC TGCTGCTCGC CCAGCTCACC
GGCGGCGACG ACGTCGTCTT CGGCGGCACC GTCTCCGGCC GCCCGCCCGA GCTGCCCGGC
GTCGAGCGGA TGGTGGGCCT GTTCATCAAC ACGCTGCCGG TCCGGGTACG GCTCCGCCCG
GCCGAACCGG TGGGAGACCT GCTCCGCAGG CTCCGCGACG AGCAGGCCGA GCTGCTGGCC
CACCACCACC TCGGGCTGAG CGAGATCCGG CGCGGCGTCC TGTTCGACAC GCTGCTGGTG
ATGGAGAACT ACCCCCTGGA CCCGCGCACC ACGATCGGGG ATCTCCGGCT GACCTCGGCG
GACGTGGCCG ACGCGACCCA CTATCCGATC ACGCTGCTGG TCATCCCGGG GGAGCGGCTC
CGCTTCCGGC TGCAGTACCG GCCCGACGTG TTCACCGAGG AGGAGGCCGA GGGGCTGCTG
GACCGGTTCC GGCACCTGCT CACCACGCTG GCCGCCGCGC CGGACACCCC CGTCGGCCGC
CTCGGCGGTC TCTTCCGCGA CGAGCGGGAG CTGCTCCTGC GCGAATGGAA CGATACGGCG
GCGGCCGTAC CCGAGGTGAC GCTGGCCGGG CTGTTCGAGG AACAGGTCGC GCGGACCCCG
GACGCGACGG CGCTGGTGTC CGAGGGCGTG GAGCTGAGCT ACGCCGCGTT CAACGCGCGG
GCGAACCGGC TGGCGCGGCT GCTGGCCGAG CACGAGGCCG GGCCCGAGCG GGTGGTGGCG
CTGATGCTGC CCCGGTCGCC GGACCTGCTG GTCGCGATGT ACGCCGTGGT CAAGGCGGGC
GCCGCCTACC TGCCGATCGA TCCCGGCCTC CCGGCCGGGA GGATCGCGTC GATGCTCGAC
GACGCGCGCC CGGTCCTCGT CATCGACCAC GACTGGCTGG CCGGCGCGGA CGTGTCCGGA
TGTTCGGCGG AGAACCTGGA GGTACGGCCG GCGCCGGACA ACCCCGCCTA CGTGATCTAC
ACCTCGGGCT CGACCGGACG GCCCAAGGGC GTCGTGGTCT CCCACCGGTC GATCGTCAAC
CGCCTGCTGT GGGCGCAGTC CCGGTACGGC CTGCGCGCCG ACGACCGGGT CCTGCAGAAG
ACACCCGCCG GGTTCGACGT GTCGGTCTGG GAGTTCTTCT GGCCGCTCCA GACCGGCGCC
GCCCTGGTGA TCGCCCGGCC CGGCGGCCAC CGGGACCCGG CCTACCTCGC CGGGCTGATC
GACGCCGAAC GCGTCACGAC CGTCCACTTC GTCCCGTCCA TGCTCGCCGT CTTCCTCGCC
GCCTCGGACG GCCCGGCGCC GGCCTCCCTC CGGCGGGTCA TCTGCAGCGG CGAGGCGCTC
CCGCCGGAGG TCGCCGACCA GGCGGTCGCG CGGCTGGGCG TCCCGGTGCA CAACCTCTAC
GGCCCGACCG AGGCCGCGGT GGACGTCACC TACTGGGAGC ACCGCCCCGA TCCGGACGGC
GGGTCCGTCC CGATCGGCCG CCCGGTCTGG AACACCCGCG TCTACGTGCT GGATCCGTGG
TTGCGGCCGG TTCCGGTGGG AGTGGCCGGG GAGCTTTACC TGGCCGGGGT GCAGCTGGCG
CGGGGGTATC TGGGGCGTGG CGGGTTGACG GCGGAGCGGT TCGTGGCGTG TCCGTTCGGG
GGTCCGGGGG AGCGGATGTA CCGGACGGGT GATGTGGTCC GGTGGCGTGC CGATGGGGGG
TTGGACTACG TCGGTCGTGC TGATTTCCAG GTGAAGGTGC GGGGGTTCCG GATCGAGTTG
GGTGAGGTGG AGGCGGTGCT GGCCCGGCAT GGGTCGGTGT CGGGGGTGGC GGTGGTGGTG
CGTGAGGACC GGCCGGGGGA TCGGCGGCTG GTCGCGTACG TGGTGCCCGC TCCGGCGGGT
CCGGTCGAGG GGGACATGCT CGCTGTTCCG GCGAGGCGGG CCGGGGGAGA AGCGGTTGCC
GCTCCGGTGG GGCGGGTCGT GCCGGCGGAG GTGGACGCGG CGGAGTTGCG GCGGTGGGTG
GGGGAGGTGT TGCCGGAGTA CATGGTGCCG TCGGTGGTGG TGGTGCTGGA TGTGTTGCCG
GTGACGCGGA ACGGGAAGTT GGATCGGGCG GCGTTGCCGG TGCCGGAGGT GTCGGTGGGG
TCGGGGCGGG GGCCGCGGTC GGTGGTGGAG GAGGTGTTGT GTGGGGTGTT TGCGGAGGTT
TTGGGGTTTG ACGGTGTGGT GGGGGTGGAT GATTCGTTTT TTGATTTGGG TGGGGATTCG
CTGCTGGTGA TGCGGTTGGT GAGTCGGGTT CGGTCGGTGT TGTCGGTGGA GTTGCCGGTG
CGGGTGGTGT TTGAGGCGTC GTCGGTGGCG GCGTTGGCGG TGTGGGTGGG GGGTGTTTCG
GGGGGTGTGC GGCCTGGGGT GGGGCGGGTT GTGCGTTCGG GGGTGGTGCC GTTGTCGTTT
TCGCAGCGGC GGTTGTGGTT TTTGAATCGG TTGGAGCCGT TGTCGGCGGT TTATAACTTG
CCGGTGGTGT TGCGGTTGTC GGGTGGGGTG GATGTGGGGG TGTTGCGGGG GGCGTTGGGT
GATGTGGTGG GTCGGCATGA GAGTTTGCGG ACGGTGTTTC CTGAGGTGGG TGGGGTGGCG
TGCCAGCGGA TCCTCGACCC CGCCGAACCG GCGCTGGAGC ACGTGCACCT TGAGGAGGCC
GAGCTCGCGG ACGCTCTGCT CGCCTCCGTG GGCCGGGGAT TCGATCTCGC CGTTGAGCCG
CCGCTCCGCG CGACGCTGTT CACCCTGGGC CCCGACGAGC ATGTGATCGC CCTGGTGTTG
CATCACATCG CGGGGGACGG TTGGTCGATG GCTCCGCTGG CGCGTGATGT GATCACCGCT
TATGAGGCGC GGTCTCGTGG GCGGGTGGCG TCGTGGGCGC CGTTGCCGGT GCAGTACGCC
GATTATGCGG TGTGGCAGCG GGAGTTGCTG GGGGAGGAGT CGGATCCGGG GAGTCTGGTC
AGCCGTCAGG TGGGGTTCTG GCGTGCGGCG CTGGCGGGGT TGCCGGATGA GATCGCGTTG
CCGGTGGACC GGCCCCGGCC CGCCGTCGCC TCCTACCGGG GCGGGTCGGT CCCCGTGTCC
ATTGGCGTGG AGGAGGTTCG CCGCCTGAGG GCGCTGGCGC GGGAGGAGAA CGCGAGCCTG
TTCATGGTGG TGCAGGCGGC GTTGGCGGCG TTGCTGACCC GGTTGGGTGC GGGCACGGAT
GTGCCGATCG GGTCGGTGGT GGCGGGCCGG GTGGATGAGG CGCTGGACGA TCTGGTGGGC
ATGTTCGTCA ACACGTTGGT GCTGCGGACC GACACCGGGG GTGATCCGGG TTTCCGTGAG
CTGGTGGGCC GGGTGCGGGA GGTGGATCTG GCGGCTTATG CGCATCAGGA TCTTCCGTTC
GAGCGGTTGG TGGAGATCGT CAGTCCGGCC CGTTCGATGG CCCGCCACCC CCTCTTCCAG
GTGGCCCTGA CCTTCCAGAA CAACCCCGCC GCGGAGCTCG AACTCGACGG GCTGTCGATC
CGGCCGGAGC CGGTGGAAGC CGGCGTCGCG AAGTTCGACC TGCTGATGTC CCTGACCGAG
ACGGCCGGAG AGCTGACGGG GACCCTCGAA TACGCCACCG ACCTGTTCGA CCCGGAGACC
GCCGAGGGGA TCGTGTCGAG GTTCCTCCGT CTCGTCCGGG CCGTGACCGC CGATCCCGAC
GTCTCCCTCA GCGCGATCGA CATCCTCGAC GCCCGTGAAC GCCACACCAT CCTCCGCGAC
TGGGCCGGCA CCGGCTCGTC CCCCGGGGTC CCCTCGACGA TCGCCGAGGA GTTCGAGGCG
CAGGTGGCGC GGTCGCCGCA TGCGGTGGCG GTGGTGGGGT CGGGGGTGGA GCTGTCGTAT
GGGGAGTTGG ATGCGCGTGC GGGGGCGTTG GCGCGGGTGT TGGTGGGGTT GGGGGTGGGT
CCGGAGCGGT TCGTGGCGTT GGTGGTGCCG CGTTCGGTGG AGTTGGTGGT GGCGGTGGTG
GCGGTGGTGA AGGCGGGGGG TGCGTATGTG CCGATCGATC CGGGTTATCC GGCGGATCGG
ATCGCACACA TCGTGCGGGA CGCCCGCCCG GTGCTGGCTG TCACCGTCCC GGGGAGCGAG
GGGCTGCTCC CGGCCGGCCT GCCGAGGGTC GTGCTGGACG GCTCCGTCAT CACCGCTGTT
CCCGACGCCG CTGTGGCCGG TGGCGAGGTG GGGCGGTTGG TGCCTGCGCA TCCGGCGTAT
GTGATCTTCA CGTCGGGGTC GACGGGGCGG CCGAAGGGGG TGGTGGTCCC GCATGGGAAT
GTGACGCGGT TGCTGGCCTC GACGGAGGGG TGGTTCGGTT TCGACGAGAC GGATGTGTGG
ACGTTGTTCC ACTCGTATGC GTTCGATTTT TCGGTGTGGG AGTTGTGGGG GGCGTTGTTG
TATGGCGGGC GGTTGGTGGT GGTGCCGTTT GAGGTGAGTC GGTCGCCGGG GGAGTTCGTG
GCGTTGGTGG CCGAGTGTGG GGTGACGGTG CTGAATCAGA CGCCGTCGGC GTTTTACCAG
TTCATGCGGG CTGAGCGGGA GCGTGCGGGG GTGGAGCTGT CGCTGCGGTG TGTGATCTTC
GGCGGCGAGG CCCTTGATCC GGGACGCCTG GACGACTGGT ACGGCCGCCA TCCCGAGACC
GCGCCGGTGC TGGTGAACAT GTACGGCATC ACCGAGACCA CGGTGCACGT CACCTATGCC
GCGCTGGATC GGGAGGCGGC GGTGAGCGGG GTGGGAAGCG TCATCGGCGT GGGCATTCCC
GATCTGCGGG TGTATGTGCT GGATGAGTTT TTGTGTCCGG TGCCGCCTGG GGTGGTGGGG
GAGTTGTATG TGGCGGGGGC GGGGGTGGCG CGGGGGTATG TGGGCCGGGC GGGGTTGACG
GCGGAGCGGT TCGTGGCTGA TCCGTTCGGG GTCGGTGGTG GGCGGATGTA TCGGTCGGGG
GATCTGGCGC GGTGGGATCG TGCGGGGCGG TTGGAGTTTC TGGGGCGGGT GGATCAGCAG
GTGAAGATCC GGGGGTTCCG GATCGAGTTG GGTGAGGTGG AGGGGGTGTT GGTGGGGCAT
CCCGCTGGTG GCGGATGCCG CGGTGGTGGT GCGGGAGGAC CGGCCGGGGG ACCGGCGCCT
GGTCGCCTAC GTGGTCGGCA CGGCCACCCC CGACGCGCTC CGCGCATGGG CCAGGGAGGC
ACTGCCGGAT TACATGGTTC CGGCGGCGGT GGTGGTGCTG GAGGCGCTGC CGCTGACGGC
CAACGGCAAG CTGGACCGGC GGGCCCTGCC CGTCCCGGAG ATCACCTCCT CCGGGCGGGA
ACCCTCCACA CCGCGCGAGG CGCTGCTGTG CGCGCTGTTC GCCGAGGTGC TGGGAGTGGA
ACGGGTCGGC GCCGACGACG GCTTCTTCGA CCTGGGCGGC GACAGCATCA TCGCGATCCA
GCTCGTGGCC CGCGCCCGGC AGGCCGGGGT GGTGTTCGGC CCCAGGGACG TCTTCCGCCA
CCAGAGCGTG CGGGAGCTGG CCGCCGTCGC CACCGACGGG CAGGAGACGG AACACGAGCC
GGAAGGGGCG GGGATCGGCC CGCTGCCGGC CACCCCGATC ATGGCCTGGC TGGCGGAGCG
GGGCGGGCCG ATCGACGGGT TCAACCAGAC GGTGGTCCTG CGGGTCCCCC CGGGGCTTGG
CCTGGATCAT CTGATCGTCG CGGTGCAGGC CGTGCTGGAC TGTCATGACG TGCTGCGGTT
GAGGGTGGCC GGGCCGGGGG TGTCGGGGCT GGAGGTGATG GCGCGCGGGG TGGTCCGGGC
GGCGGAGCAC GTGCGCCGGG TCGAGGTCGC CGGGGAGGTG GAGGAGGCGG TCGCCGGGCA
GGTCGAGTTG CTGCGGTGCG GGTTGGATCC GGTGGCGGGC AGGATGGTTG GCGTCGCGTG
GCTGGATGCC GGGCCGGAGG TCTCCGGCAG GCTCGTGCTC ACCGTCCATC ATCTGGCGGT
CGACGGTGTG TCGTGGCGGA TCCTGCTGCC GGATCTGTTC GCCGCCTGGG AGGCGGCGGT
TCAAGGGAAG GCTCCGGTGC TGGCCCCGGT CCCGACCTCC TTCCGGACCT GGGCCCACCG
CCTGCGGGAC GAGGCCCGCG ACCGAGGCGA GGCCCGCGAC GGGGGCGACG CTCATGACAG
GACCGCCGAA CTCGACCGCT GGACCGAGAT CCTCGACGGC CCTGACCCAC TCATCGGCAG
CAGGCCGCTC GACCCGCGGA TCGACACCGT CGCCACGCTG CGCAGCCTCC GCCTGACGCT
GCCACCCGGA CAGGCAGAGC CCCTGCTGAC CAGCGTCCCC GCCGCGTTCC ACGGCCGGGC
CGGCGACGTG CTGCTGACCG GGCTCGCCCT GGCCGTGGCG CACTGGCGCA GGAAGCGTGG
CGGCCGGGGC ACCTCTGTCC TGCTGGACCT GGAGGGGCAC GGCCGGGAGG AGATCTTCCC
CGGCGTCGAC CTGTCCAGGA CCGTCGGCTG GTTCACCGCC ATGTATCCGG TCCGGCTCGA
CGCGGGCATC GGCGGCTGGA CCGACGAGCG AGCCGTGACC CAGGCGATCA AGCAGGTCAA
GGAGCAGCTC AGAGCGGTGC CCAACCCCCT GGGCTACGGC CTGCTCCGCC ACCTCGACCC
CGTGGCCGGC CCTGAACTGG CCGGGCTTCC CCAGCCGCAG ATCGTGTTCA ACTACCTCGG
CCGGGTCGCC GTCGCCGAGG GGGACTGGAA CCTGGTCCCG GCCGGGTTCG GCGGCCACGA
CCCCGGCATG CCGGTGGCCC ACACCCTGGA GATCAACGTG ACCACCCATG ACCGGCCGGA
CGGCCCCCAC CTGGAAGCCG TCTGGTCCTG GCCGGAAGGG GTGCTCACCG GAGCGGAGGT
CAGGGACCTG GCCGGGACCT GGTCCGAGGC CCTCGAAGGG CTGGCCGCGC ACGGCCGGGG
CGGCTACACG CCGAGCGACC TCCTGGTCGA CCTGGACCAG AACGAGATCG ACCGCATCCA
GGCCGCGTGG GAGAAGCGAT GA
 
Protein sequence
MATVLIPGSF LPLSAAQSGM WHAQRIAPED PIHIAQYIEI SGPVDPVLFA SAVRLAAREV 
DAIHVRIVEG GQIVEEREPA CPFLDLGDER AALEWMRAQL ASPLGEGSLL ATALLRVADD
RYLWYLRCHH VIMDGYSGPM IAQRLAEVYT ALAEGRDPDP GGLGSLATLL EEDAAYRASD
RFEADRRHWL GKLADLPAVP ALSSGTAVAT SRFHRWSAML GERDAAALTE GALSHGTAVS
GLMIAAVAAF VGRLTGAQEV VLGLAVTART TPTARRTPGM VSNVLPLRLA VRPQTTFGEL
VGQVTREVGR LLVHQRYRYE DLRREVRGRL FGPVVNIMRF DYDLKFAGHA ATAHPLSTGP
IEDLSINLYD GSDGRGVRID FDGHPELYGE AELADHHDRF LRCLAQIAQA SPDLRVGAID
LLDEAERALL GDWGGAPREV PDGTAVTLFE EQVRRRPEAV AVVAGDTEVT YAELNVRANR
LAHHLIARGA GPDRYVGLSV PRSLDMVVAF LAILKSGAAY LPLDPDYPRD RLAVMLADAA
PPIVVTCAAA GLLPPPGADT VDARTGESLA GAGAGAGAGA GAGDGGGSGD GGGAGRAGTR
IEDGLSGAEV VMMDTWTGDG LPGSDPVTAL LPGHPAYVVF TSGSTGLPKG VLVTHAGLPG
YARTEIDRYA VTEDSRVLQF ASIGFDGAVL EWLMAFSSGA ALVLAPAGVY GGEPLGRFLA
ETRVTHAFIT PAALATIPER PLPDLRTLLV GGEACRPELV RRWSAGRRMI NVYGPTETTV
VVATSDPLTS PDDTPIGRPV YDTRLHVLDP GLRPVPPGTA GELYVAGPSV ARGYLNRPGL
TAERFVADPT GAGGRMYRTG DLVRWGADGQ LQYLDRADRQ VKVRGFRIEL GEIESVLAGH
PGVGQAAVLA RDDQPGDRRL VAYVTGTADA DELRRHARRA LPDYMVPAAI VVLDAMPLTA
NGKLDRRALP APERETSGRE PATDRERLLC ALFAEVLGVE RVGADDGFFD LGGDSILAIR
LVARAREAGL GLTPKHVFQH RSPEALALVA HEAVETVVTG DDGVGAVPAT PIMAWLAERG
GPIDGFNQTV VLRVPPGLGL DHLIVAVQAV LDCHDVLRLR VAGPGVSGLE VMARGVVRAV
EHVRRVEVAG EVEEAVAGQV ELLRCGLDPV AGRMVGVAWL DAGPEVSGRL VLTVHHLAVD
GVSWRILLPD LFAAWEAAVQ GKAPVLAPVP TSFRTWAHRL RDEARDRGEA RDGGDAHDRT
AELDRWTEIL DGPAGDGSAT PRWWGTWGER RETVVELPAE VTRPLLGKVP GAFHGRADDV
LLAGLAAAVA RWSGDRSVLI DLEGHGREDV FAGVDLSRTV GWFTAMYPVR IDAGPVDWSD
LRNGGQSVAQ AVKRVKEQLR ALPDSLSYGL LRYLNPGTAP VLAALPRPEI GFNYLGRITT
SGQDWEPATD GPSGGLSGGQ DAGAPLRHGI EINAVALGDT LRVTWTWSPH HYTEDRIVEL
ATAWSEALTG ISRHTGGGLT PSDVTARLTQ DDLDGFGPDL QDAWPLAPLQ RGLFFHSMLA
VDVYTAQLVL DLTGPLDAAR LRTAAERLVR RHPGLRASFE FRDGPVQLVH RRVEVPWREV
ATADAAGVAA EERAHRFDLS RPPLLRFALV RLAPGRHQLI LTNHHILLDG WSTPLLAAEL
FALYTGDEPS AAPPYKGYLE WLARQDHAAA TAAWDRALDG LDDPTLVAPH APADPVPPGR
VTAELADGPT RALTTLARTH STTLNTVMQA AFGLLLAQLT GGDDVVFGGT VSGRPPELPG
VERMVGLFIN TLPVRVRLRP AEPVGDLLRR LRDEQAELLA HHHLGLSEIR RGVLFDTLLV
MENYPLDPRT TIGDLRLTSA DVADATHYPI TLLVIPGERL RFRLQYRPDV FTEEEAEGLL
DRFRHLLTTL AAAPDTPVGR LGGLFRDERE LLLREWNDTA AAVPEVTLAG LFEEQVARTP
DATALVSEGV ELSYAAFNAR ANRLARLLAE HEAGPERVVA LMLPRSPDLL VAMYAVVKAG
AAYLPIDPGL PAGRIASMLD DARPVLVIDH DWLAGADVSG CSAENLEVRP APDNPAYVIY
TSGSTGRPKG VVVSHRSIVN RLLWAQSRYG LRADDRVLQK TPAGFDVSVW EFFWPLQTGA
ALVIARPGGH RDPAYLAGLI DAERVTTVHF VPSMLAVFLA ASDGPAPASL RRVICSGEAL
PPEVADQAVA RLGVPVHNLY GPTEAAVDVT YWEHRPDPDG GSVPIGRPVW NTRVYVLDPW
LRPVPVGVAG ELYLAGVQLA RGYLGRGGLT AERFVACPFG GPGERMYRTG DVVRWRADGG
LDYVGRADFQ VKVRGFRIEL GEVEAVLARH GSVSGVAVVV REDRPGDRRL VAYVVPAPAG
PVEGDMLAVP ARRAGGEAVA APVGRVVPAE VDAAELRRWV GEVLPEYMVP SVVVVLDVLP
VTRNGKLDRA ALPVPEVSVG SGRGPRSVVE EVLCGVFAEV LGFDGVVGVD DSFFDLGGDS
LLVMRLVSRV RSVLSVELPV RVVFEASSVA ALAVWVGGVS GGVRPGVGRV VRSGVVPLSF
SQRRLWFLNR LEPLSAVYNL PVVLRLSGGV DVGVLRGALG DVVGRHESLR TVFPEVGGVA
CQRILDPAEP ALEHVHLEEA ELADALLASV GRGFDLAVEP PLRATLFTLG PDEHVIALVL
HHIAGDGWSM APLARDVITA YEARSRGRVA SWAPLPVQYA DYAVWQRELL GEESDPGSLV
SRQVGFWRAA LAGLPDEIAL PVDRPRPAVA SYRGGSVPVS IGVEEVRRLR ALAREENASL
FMVVQAALAA LLTRLGAGTD VPIGSVVAGR VDEALDDLVG MFVNTLVLRT DTGGDPGFRE
LVGRVREVDL AAYAHQDLPF ERLVEIVSPA RSMARHPLFQ VALTFQNNPA AELELDGLSI
RPEPVEAGVA KFDLLMSLTE TAGELTGTLE YATDLFDPET AEGIVSRFLR LVRAVTADPD
VSLSAIDILD ARERHTILRD WAGTGSSPGV PSTIAEEFEA QVARSPHAVA VVGSGVELSY
GELDARAGAL ARVLVGLGVG PERFVALVVP RSVELVVAVV AVVKAGGAYV PIDPGYPADR
IAHIVRDARP VLAVTVPGSE GLLPAGLPRV VLDGSVITAV PDAAVAGGEV GRLVPAHPAY
VIFTSGSTGR PKGVVVPHGN VTRLLASTEG WFGFDETDVW TLFHSYAFDF SVWELWGALL
YGGRLVVVPF EVSRSPGEFV ALVAECGVTV LNQTPSAFYQ FMRAERERAG VELSLRCVIF
GGEALDPGRL DDWYGRHPET APVLVNMYGI TETTVHVTYA ALDREAAVSG VGSVIGVGIP
DLRVYVLDEF LCPVPPGVVG ELYVAGAGVA RGYVGRAGLT AERFVADPFG VGGGRMYRSG
DLARWDRAGR LEFLGRVDQQ VKIRGFRIEL GEVEGVLVGH PAVADAAVVV REDRPGDRRL
VAYVVGTATP DALRAWAREA LPDYMVPAAV VVLEALPLTA NGKLDRRALP VPEITSSGRE
PSTPREALLC ALFAEVLGVE RVGADDGFFD LGGDSIIAIQ LVARARQAGV VFGPRDVFRH
QSVRELAAVA TDGQETEHEP EGAGIGPLPA TPIMAWLAER GGPIDGFNQT VVLRVPPGLG
LDHLIVAVQA VLDCHDVLRL RVAGPGVSGL EVMARGVVRA AEHVRRVEVA GEVEEAVAGQ
VELLRCGLDP VAGRMVGVAW LDAGPEVSGR LVLTVHHLAV DGVSWRILLP DLFAAWEAAV
QGKAPVLAPV PTSFRTWAHR LRDEARDRGE ARDGGDAHDR TAELDRWTEI LDGPDPLIGS
RPLDPRIDTV ATLRSLRLTL PPGQAEPLLT SVPAAFHGRA GDVLLTGLAL AVAHWRRKRG
GRGTSVLLDL EGHGREEIFP GVDLSRTVGW FTAMYPVRLD AGIGGWTDER AVTQAIKQVK
EQLRAVPNPL GYGLLRHLDP VAGPELAGLP QPQIVFNYLG RVAVAEGDWN LVPAGFGGHD
PGMPVAHTLE INVTTHDRPD GPHLEAVWSW PEGVLTGAEV RDLAGTWSEA LEGLAAHGRG
GYTPSDLLVD LDQNEIDRIQ AAWEKR