Gene Sros_1128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1128 
Symbol 
ID8664403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1141497 
End bp1153232 
Gene Length11736 bp 
Protein Length3911 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003336870 
Protein GI271962674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGT GGCCGGGCGT CCGGGCGGGC ACGCGCCCCG GACGGCGGGG CCGGGATGTC 
GTCGGCGCGG TGGCATGGAC GCCCCGCCAC AGAAGAAGGG TCGCCCGGAT CATGGCCCTG
GCCCTGACCG GTTCGGCGCT GACCGGCACG CAGGCAGCCG CCGCCGCGGC GGCCGGCACG
CTCCTGTTCA GCCAGCCCTT CCGCAACAAC ACGGCCAACG GGACCGGCGC CGTGGTGCTG
CCCGCGCTGC CCAGCGGCAC GGGCACGACG AACTTCGCGT GCCTGACCGC CTCCGGTAAC
ACCAGCACCG GCGTCCTGCG CAGCTGCACC ACCAGCACGG ACAGCGCGGG CTCCGGCAAG
CTGCGCCTCA CGAACGCGAC GACCAGCAAG GCGGGCGGCG TCTTCAGCGC CACCAGCGTG
CCCACCTCCC AGGGCCTCGA CGTCACGTTC AACACCTACC AGTACGGCGG AGGCGGCGCC
GACGGCATCA CCTTCGTGCT GGCGGCGGTC GACCCGGCCA ACCCCCAGTC CCCGGCCAAC
ATCGGCCAGC TCGGCGGTGC GCTGGGCTAC TCCGCGAACG GCGGTTCTCC CGGCCTGGCC
TACGGCTACC TCGGCATCGG CTTCGACGTG TACGGAAACT TCAGCAACAG CACCTACCAG
GGGAGCGGCT GCACGAACCC CGCCTACATC GGGACGGGCA GCGTCCGGGT GCCCGGCCAG
GTCCTCGTCC GCGGGCCGGG GAACGGCACC GTCGGATACT GCGCGCTCAA CAGCACGGCC
ACCTCGACCA GCTCCAGCGC CCTCGCCCTG CGGGCGAGCG CCCGTACGGC CGTCCCGGTC
CAGATCGGCA TCAACCCCAC CAGCTCGGTC CTCACCACCG CCGCCGGCCT GGCGGTCCCG
GCCAACAGCT ACCGGATGGT GGTCACCCTG GTGGGCGGTG CGACCAGGAC GCTGACCGGT
ACCCTCCCCT CGGTCACCTC GGGCCTCTAC CCGTCGAGCT GGCTGAACGC CAGCGGGATC
CCCCGGCAGC TGGCGTTCGG CTGGGTGGCC TCCACCGGAG GCGTGACCGA CTTCCACGAG
ATCGACGAGG CCGCGGTCTC CACGATCAGC GCCGTCCCCG AGCTGACGGT CGCGCAGACC
GGCTACATTG CCTCGACGCT GGCGCCGGGC GACCCCGTGA CCTACAACGT CGTCGCCGGC
GTCGCCGCGG GCCTCCCGGA GACCTCCCCG GTCTCGATCA CCCAGACCAT GCCGGCCGGG
ACGGTGCCCG TGGGCGCCTA CGGGACGGGA TGGGTCTGCG ACGCCCCGTC CGGGCGCTCC
ATAACCTGCA CCAACGGCAA CGGCCCGTTC GCGGCCGGGG CGGCGCTGCC GGCGCTCACC
GTGGTGGGCA TCGTCACCGG GGGCAACGTC ACGCCGGCGC TGGTCCAGAG CGCGACCGTG
GCCACGGCCT CGTCGATCGA CGCCAGCCCC GCCTACTCGT CGTCGACGAC CGCCGGCACG
CTCCCGGCCG CACCGGGCGG CATCGCCGTG AGCCCGGCAC TGGGCTCGAT CGCCGGCGGC
AACACCGTGA CCGTGAGCGG CACGAACATC TCCAACGCCA CCGCCGTCGA GATCGGCACC
ACCGCCCAGC AGGAGGCGGG CACCCCGGTC GTCCTGCTGC CCTGCGCGGC CGGGGTCACC
ACCGGCTGCT TCACGATCAA CGCCAACGGC ACGCTGACCA TCCCCTCCAT GCCCGCCAGG
GCCACCAACG GCGCGGTGAA CATCAACATC GTCACCCGCG GACTGGACGC GGCGGCCACC
TACACCTACG TCGCCAGCCC CGGCACGCCC ACCACGCCCA CGGCCGTGGC CGGGGTGACC
AGCGCGACGG TGAGCTGGAC GGCGCCGGCG GGCAACGGCG GCGCCATCAC CGGCTACATC
GTGACCCCGT ACCGGAACGG CGTGGCCCAG ACCCCGGTGA GCTTCGACGC CTCGGCCACC
AGCAGGACGC TTACCGGGCT GACCGCCGAC GTGCCGTACA CCTTCACCGT CGCCGCGGTG
AACGCGATCG GCACCGGCTC GGCCGGCCCG GCCTCCAACC CGGTCGTCCC CTACAACGTC
CCCGGCCGGC CCGTCATCAC CGCGGCCACC GCCGGAACCT CCTCCGCCAC CCTCACCTGG
ACGGCGCCGG CGGGCAACGG CAGCGCCATC ACCGGCTACG TCGTCACGCC CTACGTCAAC
GGCGTCGCGC AGCCCACCCA GACGTTCAAC AGCGCCGCGA CCACCCAGAG CGTCACCGGA
CTGACCCCGG GGACCGCGTA CACGTTCACC GTGACCGCGG TGAACGCGGC CGGCCCCGGC
CAGCCGTCCG AGCCCTCGGC CACCGCCACC CCCAACTCCC CACCCGCGTT CACCTTCCCC
GCGCCGCCGG CCGGAGAGGT GGGGGCCGCC TACAGCGTCC CGCTGACGGT CAGCGCCGGC
ACGGCGCCGT ACACGTGGTC GGTCGGCGCG GGCAGCCTGC CGCCGGGCCT GACCCTGAAC
GCCTCCACCG GCGTGCTGTC GGGCACCCCG ACCGCGGCGG GCGGCTACTC CTTCACCGCC
CGGGTCACGG ATGCCGGCAA TGTGAGCACC ACCCGGGAGG TCACCCTGGT CATCGCGCCG
AGACCCGCGT TCACCTTCCC CGCGCCGCCG GGCGGCGAGG TGGGCGTCGC CTACAGCGTC
CCGCTGACGG TCAGCGGTGG CACGGCGCCG TACACGTGGT CGGTCGGCGC GGGCAGTCTG
CCGCCGGGCC TGACCCTGAA CGCCTCCACC GGCGTGCTGT CGGGCACGCC GACCGCGGCG
GGCGGCTACT CCTTCACGGT GAAGGTGCTC GACGCCCAGA ACCAGAGCGA CACCACGGCG
GTGAGCCTGA CGATCGTGCC GCAGCCGGCG TTCACCTTCC CCGCGCCGCC GGCCGCGCAG
GTGGGGGTGA CCTACAGCGT CCCGCTGACG GTCAGCGGCG GCACGGCGCC GTACACGTGG
TCGGTCGGCG CGGGCAGTCT GCCGCCGGGC CTGACCCTGA ACGCCTCCAC CGGAGAGCTG
TCGGGCACCC CGGCCGCGAC GGGCAGCCAC CCGGTCACCT TCCGGGCGGT CGACGCGAAC
GGCCAGGCCA CCACCAGGGC GGTGACGCTG GTCGTCACGT CCGGGCCCCT GGTCGTCGTC
AAGACGGCCA GCGCCTCCTC GGCGGTCGCC GGCGGCACGG TCGGCTACAC GATCACCGTG
AACAACACCG GGCCGAGCGC CTTCACCGGT GTGACGGTGA ACGACGCGCT GGCGGGGATA
CTGGACGACG CGGCCTACAA CGGCGACGCG GCCGCCACCG CCGGCGCGGT CTCCTTCGCC
GCCCAGACCC TGACCTGGAC CGGCGACGTG GCCGCCGGCA CGACCGTGAC CATCACCTAC
TCGATCACGG TGAACAGCCC CGGCACCGGC AACAAGGTGC TGGCCAACGC GGTGACCTCA
CCGACGGTCG GCAGCACCTG CCCGGCCGGC GGCGGCGACC CCCGATGCTC CGCCACGGTG
ACCGTCGCCG GGCTGTCCAT CGTCAAGACC GCCGACGTCA CGACGGCGAC ACCCGGCGGG
ACGGTCCGCT TCACCGTCAC GGCCACCAAC AACGGCCAGA CCCCGTACAC CGGGGCGACC
TTCGGCGACG CCCTGGCGGG CGTGCTGGAC GACGCGGTCT ACAACGGCAA CGCGACCGCG
ACCAGCGGCA GCCTGTCGTT CAGCGGCTCC ACCCTGACCT GGACCGGGAA CCTCGCGGTC
GGCGCGAGCA CCACCGTCAC CTACACGGTG ACGGTGCGCA ACCCCGACCC CGGAGACCGC
AGCCTCGCGG GCACGGTCCT GTCCGGCACC CCGGGAAGCA CCTGCCCGCA GGGGAACCCC
GGCCCGCAGT GCACCGCCGT GGTCACCGTC CTGGTCCCGG CCCTGGCGAT CACCAGCAGC
GCGGACGCCA CCACCACGAC CCCCGGATCG GTCGTGCGCT ACACCTTCAC GGCCTCCAAC
ACCGGGCAGA CGCCGTACGC CGGGACGAGT TTCACCACCT CTCTGGTGGG CGCGCTGGAC
GACGCCGCGT TCAACGGCGA TCTCGCCGCG ACGTCGGGCA GCGCCGTCCT CAACCCCGAC
GGCACCATCA CGTGGACCGG AGACCTGGCC GTCGGCGCGG CGGTCACGGT CACCGGCTCG
GTCACGGTGA AATCCCCCGA CAACGGAGAC AGGGTCCTGA GGACCTCCGT CACCTCCGGC
GCGCCGGGCA GCACCTGCCC GGTGGGAAAC CAGTCACCCG CGTGCCTCAC CGGCGTCTCG
GTCCTGGTCC CCGGTCTGAC GATCACCAAG ACGGCCGACG TGTCGGCCAC GACCCCCGGA
TCGGTCGTGC GCCACACGAT CGCGGTCACC AACTCCGGGC AGACGCCGTA CACCGCCGCG
ACGGTCGCCG ACGCGCTGGC CGGGGTGCTG GACGACGCGA CCTACAACGC GGACGCCGCC
GCGACCAGCG GTTCGGTCGG CTACGCCGGC TCCACGCTGA CCTGGGTCGG CGACCTGGAC
GTCGGGGCGA GCGCCACGAT CACCTACTCC GTCACCGTGC GCGACCCCGA TCCCGGTGAC
ATGACGCTCA CCGGCACGGT CTCCTCGCCG ACCACCGGCA GCAACTGTCC CGCCGGGAGC
GGCGACTCCC GGTGCGCCGG CAGCGTGACG GTCCTCGTCC CGCAGCTGAC GATCACCACC
GCGACCGGCG GGGCCACCAC GACCCCGGGT GCCGTCGTGC CGTACACCGT CACGCTCGCC
AACACCGGGC AGACCCCGTA CACGGGCGCC GGCGCCAGGT TCGTCATCGC CGACGTGCTC
GACGACGCGA CCTACAACGG CGACCTCACC ACGGACGCGG GCAGCCTCAG CGTGGCCCCG
GACGGCGCGA TCCTGTGGGC CGGGGACATC GCGGCCGGCG CGACGGTCAC CATCACCGGC
TCGGTCACCG TGCACGCCCC CGTCACCGGC GACAAGGTGC TGAGGACCTC CGTCACCTCC
GCCGCGCCCG GCAGCACCTG CCCGGTCATC GGCGCGACGT CGCCCGGCTG CTTCACCGTG
GTCACCGTCC TGGTCCCGGC CCTGACGATC ACGAACACCG CCGACACCCA GTCGGCCACA
CCGGGAGACA CGGTCACCTA CACGATCACG GTCGCCAACA CGGGAGAGAC CCCCTACACC
GGTGCCCGGG TGACCGAGTC GCTGACCCGG GTGCTGGACG ACGCGGTCTA CAACGGCGAC
GCGGCCGCGA CGACCGGTAC GGTCACCTTC GCCGGGACGG ATCTGAGCTG GAGCGGTGAC
CTGGCGGTCG GCGCGAGCGC CACGATCACC TACTCCGTCA CCGTGCGCGA CCCCGATCCC
GGCGACCGGC AGATCGCCGC CGTCGTGATC TCGCCCACCC AGGGCGGCAA CTGCCCCGCC
GGCGGCACCG ACCCCCGGTG CGCCGCGGCG GTGGCCGTCC TCGTACCGGA GCTGACCATC
TCCAAGAGCG CGGACGCGAC GACGGCGGCA CCCGGGTCCA CCGTCCAGTA CACGGTCACC
GTCACCGACT CCGGGCAGAC GCCGTACACC GGTGCCACCG TCACCGACCT GCTGGCCGGG
GTGCTGGACG ACGCGGTCTA CAACGGCGAC GCGGCCGCGA CGACCGGCAC CGTCGGCGTC
GCCGGGACGG ACCTGAGCTG GAGCGGTGAC CTGGCGGTCG GGGCGAGCGC CACGATCACC
TACTCCGTCA CCGTGCGCGA CCCCGACCCC GGCGACGCGC TGCTGACGAG CACCGCCGTC
TCGCCGGCGC GGGGGAGCAA CTGCCAGGCC GGGAGCACCG ACCCCCGGTG CACCGTCTCG
GTGCCGGTGG CGCGCCTCGT GCTGGAGCAG GGCTACACCC GGACCGGTGC GGCACCCGGC
TCCGTGGTCC GCCTCAACGC CACCTTCACC AACACCGGGC AGGTGCCCTA CACGGGGATC
AGGGTCTTCA GCGCCAGCGG CGACACCGTC GACGACGCCA TCCCCAACGG CGACCAGGTC
GCCGACTCCG GCACGCTCGT CCTCGACGCC CAGGGCATCA CCTGGACCGG CAACATCCCC
GTCGGGGGCG TCGTCAACAT CACCGGCACC CTGACGCTCA AGAACCCGCC CACGGGCGAC
CGGACCCTCA CCGGCACGCT GGTCTCCGAG GCGCCGGGCA CCACCTGCCC CCCGGGCGGC
TCCGACCCCC GCTGCACGTC CCGCCTCGAC GTGCTCGTCC CCGGTCTGAC GATCACCAAG
GCCGCCGACA CCGCGGCCAC GGTCCAGGGC GGCACCGTCG GATACACGGT CACCGTCACC
AACTCGGGGC AGACGCCGTA CACGGGGGCG GCGTTCACCG ACGCGCTGGC CGGGGTGCTG
GACGACGCGG TCTACAACGG TGACGCGGCC GCGACGACCG GTACTGTCGG CGTCGCCGGG
ACGGATCTGA GCTGGAGCGG TGACCTGGCG GTCGGGGCGA GCGCCACGAT CACCTACTCC
GTCACCGTGC GCGCCCCCGA CCCCGGCGAC AGAAGCCTGA CCGGCACCGT CTCCTCGCCC
ACCACCGGCA ACAACTGCGC CCCGGCGAGC GGCGACCCGC GATGCACCAG CAGCGTGATC
GTCCTCATCC CCGCCCTGAC GATCACCAAG AGCGTCACTC CCACGACGGC GGTGCCGGGA
AGCACACTCA CCTACACGAT CACGGCGGCC AACACCGGGC AGCTCCCGTA CACGGGGGCG
GCGTTCACCG ACGCGCTGGC CGGGGTGCTG GACGACGCGG TCTACAACGG CGACGCGGCC
GCGACGACCG GTACGGTCAC CTTCGCCGGG ACGGATCTGA GCTGGAGCGG TGACCTGGCG
GTCGGCGCGA GCGCCACGAT CACCTACACG GTCACCGTCG ACAACCCGGT GACCGGCGAC
AGGAACCTGG CGAGCACGAT CACCTCCGCC ACCCCCGGCA CCACCTGCCC CGCCGGAGGG
ACCGACCCGC GCTGCGGCAC CGGCGTCCCG GTCACCCAGG CCACCACGCT GACGTTCGAC
AAGTCCGCGG ACACCCGATC GGTCGCGCAG GGCGAGGTGG TCACCTACAC CATCACGATC
AGCAACAGCG GGCTGATCCC TTACAACGGC GCGGCGTTCA CCGACTCGCT CGCCGGGGTG
CTGGACGACG CCGCCTACAA CGGCGACGCC GCGGCCGGCA CGGGCCTCGT CAGCGTCGCC
GGCCCGCTCC TGAGCTGGAC CGGGAACGTC CCGGCGAACG GATCGACCAC GGTCACCTAC
TCGGTGACGG CCGGCACCCC GGGCACCGGC GACGACATCC TCACCAGCAC CCTGGTGTCG
CCGTCCCCGG GCGGCAACTG CGAGGCGGGC GGCGGCGACC CGCGCTGCGC GGCCACGGTG
ACCGTGGCCA GGCTGTCCAT CGTGACCACG GCCGACGCGC CGACCACGGA ACCGGGCGAC
GTGGTGCGCT ACACCACGGT GATGACCAAC ACCGGGCAGA CGCCGTACAA CGGGACCAGC
GTCCTGTTCA ACGGATACGG CGGCCTCGAC GACGCCGTCC CCGGCGGCGA CCAGGTCGCC
ACCTCCGGGT CGCTGTCGCT GGGCCTCGAC GGGCTGACCT GGACCGGGAG CATTCCCGTG
GGGGGCAGCG TGACGCTGAC CGGTAGCGTC ACGGTGAACA ACCCCGACCT GGGCGACAGG
GTGATCCCCC TCACCGTGGT CTCCGCGGCG CAGGGCAGCA CCTGCCCCGT GGCCACCGCT
CCCGGCTGCA CCGTCATCGT CAACGTGCTG ATCCCCGAAC TGACGATCAC CAAGGCCGCC
GACCGGAACG CCGCCGTCCC CGGCGGCGCC GTGGCCTACA CGATCACGAT CGCCAACACC
GGGCAGACCC CGTACACCGG GGCGACCGCC ACCGACTCGC TCGCCGGCCT GCTGGACGAC
GCCGCCTACA ACGGCGACGC GGCCGCGACG ACCGGCACGG TCGGCTTCGC CGGCCAGACC
CTGACCTGGA GCGGCGACCT GGCGGTGGGC GCGACCGCGA CCGTCACCTA CTCGGCCACC
GCCGACACCC CGGACGTGGG CGACAAGCTC CTGACCAACT CCGTCGTGTC CACCGAGGCA
GGCAGCACCT GCCCGCCCGC CAGCGCGAAC GCGGCGTGCA GCGCGAGGGT CGTGGTCCTC
ACCCCCGCGC TGACCATCGT GAAGACCGCC GACAGGGCAT CGGCCACGCC CGGGGACACG
GTCACCTACA CGGTGAACGT CACCAACACC GGGCAGGTCC CCTTCGCGGC GGCGGACTTC
GCCGACGCGC TGGCCGGGGT GCTGGACGAC GCGGTCTACA ACGGCGACGC GACCGCGACG
ACCGGTACGG TCACCTTCGC CGGGCAGGCC CTCGGCTGGA CCGGAGGCCT GGCCCCCGGC
CAGGGGGCCA CCGTCACCTA CAGCGTCACG ACCGGCAGTC CGGGCACCGG CGACCAGCGG
CTGACCGGCG AGGTCACCTC CACCACCGCG GGCACCACCT GCCCGGCCGG CGGCACCGAC
ACCCGCTGCT CCAACACGGT CCTGATCTCC AGGATCACGA TCACGGCCTC GGCGGACGTC
GCCACGGCCA TCCCGACCGG GGTGGTCCAC CACACGGTCA CGATCGCGAA CACCGGGCAG
ACGCCGTACG GCAGCGCCGT CGTGGACGGC CTGCTCGCCG ACGTCTTCGA CGACGCCGCC
TACAACGGCG ACGGGACGGC CTCGGCGGGC AACCTCACCT TCGTGCCCGG GAGCGGCCAG
GCCAGATGGG AGGGCCCGCT CGCGGTCGGC GACACCGTCA CCGTCACCTT CTCGGTGACG
GTACGGAACC CGGACCCCGG CGACAAGGTC ATGAACGCCG TCATGACCTC CGGCACCCCG
GGGAACAACT GCCCAGCCGG GAGCCCCGCC CCGGCCTGCG CCTCGGCGGT GACCGTGCTG
ACCCCGGTCC TGGCCGTCTC CAAGAGCGCG GACAGGAGCA CGGTCACGCC CGGCGGCACG
ATCGCCTACA CGATCACGGT CGCCAACACC GGCCAGGCGC CGTACACCGG GGCGACGGTG
ACCGACCGGC TCACGCGGGT GCTCCCGGAC GCCGTATACA ACGGCGACGC GGCCGCCACG
GCCGGCACGG TCACCTTCGC CGGCTCCGAC CTGACCTGGA CCGGTGACCT CGCGGCCGGC
GCGAGCGCCA CGATCACCTA CACGGTCACC GTGCGCGACC CCGATCCTGG CGACAAGCAG
ATCGTCAACA GGGCCTTCTC CGACACCCTG GGCAGCACCT GCCCGTCCAC CGGGTCCGTT
CCGGCCTGCA CGACGCTCGT CACCGTGCTC GTCCCGGCCC TGAGGATCGT CAAGGCCGCC
AACACCGTCG TCGCCACCCC AGGCGAGACG GTCGGCTACA CGGTCACGGT CACCAACACC
GGGCAGACGC CGTACACCGG GGCGACCGTC GCCGACGCGC TGGCCGGGGT GCTGGACGAC
GCGGTCTACA ACGGCGACGC GGTCGCGACG AGCGGCACGG TCACCTTCGC CGGCTCCGAC
CTGACCTGGA CCGGCGACCT CGCGGCCGGG GCGTCGGCGA CCGTCGACTA CTCGGTCACC
GTCGACATCC CCGACACCGG CGACAGGCTG CTCACCGGCG CCGCCACGTC GAACGCGCCC
GGCAGCACCT GCCCGGCGGG CACCACGGAC CCGGCGTGCG TCTCGACGGT CACCGTCCTG
ATCCCCGGCC TGGCCGTCTC CACCGTCGCC GACCGGGCGA CCACGACCCC CGGCGGCACC
GCGCGGTACA CGGTCACGAT CGCCAACACC GGCCAGACCG CCTACAGCGG CATCAGCGTC
AGCGACGTGC TGACGGAGGT GCTGGACGAC GCCGCCTACA ACGGCGACGC CACCGCCACG
GCGGGCACCG TCGTGTTCAG CGGTCCCGTC CTGACCTGGA CCGGCGACCT GGCGACAGGC
GAGACGGTGA CCGTCGCCTA CACCGTGACC GTCGCCGATC CCGACACCGG GGACAAGGTC
ATGACCGGTA CCGTCGCCTC GTCGGCACCG GGCAGTACCT GCCCGGTGGG CTCCACCGCG
CCCGCCTGCG GCGCGACCGT CACCGTGCTG ATCCCCGCCC TGGACATCGT CAAGACCGCC
GGCGCGCCGG CCACCGTCCC CGGAGGCACG GTCGGCTACA CGATCACGGT CACCAACTCC
GGGCAGACGC CGTACACCGG GGCGAGCGTC GCCGACTCAC TGCAGGGCCT GCTGGACGAC
GCCGCCTACA ACGGGGACGC CGCCGCCACC ACCGGGGTGC TCGCCTACGC GGAGCCGGTG
CTGACCTGGA CCGGCGACCT GGCGGTCGGC GCGAGCGCCA CGATCACCTA CTCCATCACG
GCGAACGGCA CCGCCACCGG TGACAAGACC CTCACCAACG TCGTCACCTC GGACGCTCCG
GGAAGCACCT GCCCCGCCCA GGGCACCGCC CCGGCGTGCT CCACGCTCGT ACGCCTGCTC
GTGCCGGAGT TGACGATCGT CAGGAGCGCC GACCGGGCGA CCGTCGTCGC GGGCGGCACC
GTCCGCTACA CGATCACGGT CACCAACACC GGCGAGACCG GCTACCCCGG CGCCACGGTC
ACGGACCGGC TGGCGGGGGC GCTGGACGAC GCGGTCCACA ACGGCGACGC CGTCGCCACC
ACCGGGGTGC TCGCCTACGC CGAACCGGAG CTGACCTGGA CCGGCGACCT CGCGGTCGGC
GCCACGGTGA CGATCACCTA CTCGGTCGCC GTGGCCTACC CGGCCCGGGG TGACCGGCTC
CTGTCCGGGA CCGTCGTCTC CGCCGTACCC GGTTCGACCT GCCCGGCGGG CGGGACCGAC
CCGCGCTGCA CGGCGACGGC GACCGTCCTG GTCCCGGCGC TCGGCATCAC CAAGACCGCG
GACACCGGCG GCGAGGTCGT CGCGGGCGGC ACCCTCCGCT ACACGGTCGT CGTCACCAAC
ACCGGTGAGG CGCCCTACGA CGCCGCCACG GTCACGGACC GGCTGGCGGG AGTGCTGGAT
GACGCGGTCT ACAACGGCGA CGCCGTCGCC ACCACCGGCG TCCTGGCCTA CGCCGAACCG
GAGCTGACCT GGACGGGCGC GCTACCCGTG GACGCCAGCG CCGTCGTCAC CTTCAGCGTC
ACCGTCGCCG ACCCGGCCAC CGGGAACGCC GAACTGGACA ACCAGGTCAC CTCGACGACC
ACCGGCAGCA CCTGCCCGGC CGGTGGGACC GACCCGCGCT GCTCCGTCGT GACCTCCGTG
GCAGCCACGA GCATGACCCT GACCGGCGCG ACCGAGGACT TCACACTGAC CGGCCCGCCG
AACACGACCG TGCGGGGCGA GGACGTGGTC ACGATGACGG TCGTCACCAA CAGCGTCGAC
GGCTACACCG TCACCGCGCG GGCGGCCGCG GCCGAGCTCT CCCCGGCGCA GCCCGGCGTG
ACCGTCGGCA TCCCCGTCGC CAACCTGCGC GTGCGCGAGC ACGGCACCTC GACCTTCCGG
TCCCTGTCCA CCACCGACCC GGTGCTCGTG TACGACAAGC CGCTCCCGTC GGCACCGGGC
GGGGACGGCA TCAGCAACGA CTACGAGGTC GACATCCCAT TTGTCCCGAC CGGCCGCTAC
ACGGTGACGA TCGACTACGT CGCGACGGCC AGATGA
 
Protein sequence
MSEWPGVRAG TRPGRRGRDV VGAVAWTPRH RRRVARIMAL ALTGSALTGT QAAAAAAAGT 
LLFSQPFRNN TANGTGAVVL PALPSGTGTT NFACLTASGN TSTGVLRSCT TSTDSAGSGK
LRLTNATTSK AGGVFSATSV PTSQGLDVTF NTYQYGGGGA DGITFVLAAV DPANPQSPAN
IGQLGGALGY SANGGSPGLA YGYLGIGFDV YGNFSNSTYQ GSGCTNPAYI GTGSVRVPGQ
VLVRGPGNGT VGYCALNSTA TSTSSSALAL RASARTAVPV QIGINPTSSV LTTAAGLAVP
ANSYRMVVTL VGGATRTLTG TLPSVTSGLY PSSWLNASGI PRQLAFGWVA STGGVTDFHE
IDEAAVSTIS AVPELTVAQT GYIASTLAPG DPVTYNVVAG VAAGLPETSP VSITQTMPAG
TVPVGAYGTG WVCDAPSGRS ITCTNGNGPF AAGAALPALT VVGIVTGGNV TPALVQSATV
ATASSIDASP AYSSSTTAGT LPAAPGGIAV SPALGSIAGG NTVTVSGTNI SNATAVEIGT
TAQQEAGTPV VLLPCAAGVT TGCFTINANG TLTIPSMPAR ATNGAVNINI VTRGLDAAAT
YTYVASPGTP TTPTAVAGVT SATVSWTAPA GNGGAITGYI VTPYRNGVAQ TPVSFDASAT
SRTLTGLTAD VPYTFTVAAV NAIGTGSAGP ASNPVVPYNV PGRPVITAAT AGTSSATLTW
TAPAGNGSAI TGYVVTPYVN GVAQPTQTFN SAATTQSVTG LTPGTAYTFT VTAVNAAGPG
QPSEPSATAT PNSPPAFTFP APPAGEVGAA YSVPLTVSAG TAPYTWSVGA GSLPPGLTLN
ASTGVLSGTP TAAGGYSFTA RVTDAGNVST TREVTLVIAP RPAFTFPAPP GGEVGVAYSV
PLTVSGGTAP YTWSVGAGSL PPGLTLNAST GVLSGTPTAA GGYSFTVKVL DAQNQSDTTA
VSLTIVPQPA FTFPAPPAAQ VGVTYSVPLT VSGGTAPYTW SVGAGSLPPG LTLNASTGEL
SGTPAATGSH PVTFRAVDAN GQATTRAVTL VVTSGPLVVV KTASASSAVA GGTVGYTITV
NNTGPSAFTG VTVNDALAGI LDDAAYNGDA AATAGAVSFA AQTLTWTGDV AAGTTVTITY
SITVNSPGTG NKVLANAVTS PTVGSTCPAG GGDPRCSATV TVAGLSIVKT ADVTTATPGG
TVRFTVTATN NGQTPYTGAT FGDALAGVLD DAVYNGNATA TSGSLSFSGS TLTWTGNLAV
GASTTVTYTV TVRNPDPGDR SLAGTVLSGT PGSTCPQGNP GPQCTAVVTV LVPALAITSS
ADATTTTPGS VVRYTFTASN TGQTPYAGTS FTTSLVGALD DAAFNGDLAA TSGSAVLNPD
GTITWTGDLA VGAAVTVTGS VTVKSPDNGD RVLRTSVTSG APGSTCPVGN QSPACLTGVS
VLVPGLTITK TADVSATTPG SVVRHTIAVT NSGQTPYTAA TVADALAGVL DDATYNADAA
ATSGSVGYAG STLTWVGDLD VGASATITYS VTVRDPDPGD MTLTGTVSSP TTGSNCPAGS
GDSRCAGSVT VLVPQLTITT ATGGATTTPG AVVPYTVTLA NTGQTPYTGA GARFVIADVL
DDATYNGDLT TDAGSLSVAP DGAILWAGDI AAGATVTITG SVTVHAPVTG DKVLRTSVTS
AAPGSTCPVI GATSPGCFTV VTVLVPALTI TNTADTQSAT PGDTVTYTIT VANTGETPYT
GARVTESLTR VLDDAVYNGD AAATTGTVTF AGTDLSWSGD LAVGASATIT YSVTVRDPDP
GDRQIAAVVI SPTQGGNCPA GGTDPRCAAA VAVLVPELTI SKSADATTAA PGSTVQYTVT
VTDSGQTPYT GATVTDLLAG VLDDAVYNGD AAATTGTVGV AGTDLSWSGD LAVGASATIT
YSVTVRDPDP GDALLTSTAV SPARGSNCQA GSTDPRCTVS VPVARLVLEQ GYTRTGAAPG
SVVRLNATFT NTGQVPYTGI RVFSASGDTV DDAIPNGDQV ADSGTLVLDA QGITWTGNIP
VGGVVNITGT LTLKNPPTGD RTLTGTLVSE APGTTCPPGG SDPRCTSRLD VLVPGLTITK
AADTAATVQG GTVGYTVTVT NSGQTPYTGA AFTDALAGVL DDAVYNGDAA ATTGTVGVAG
TDLSWSGDLA VGASATITYS VTVRAPDPGD RSLTGTVSSP TTGNNCAPAS GDPRCTSSVI
VLIPALTITK SVTPTTAVPG STLTYTITAA NTGQLPYTGA AFTDALAGVL DDAVYNGDAA
ATTGTVTFAG TDLSWSGDLA VGASATITYT VTVDNPVTGD RNLASTITSA TPGTTCPAGG
TDPRCGTGVP VTQATTLTFD KSADTRSVAQ GEVVTYTITI SNSGLIPYNG AAFTDSLAGV
LDDAAYNGDA AAGTGLVSVA GPLLSWTGNV PANGSTTVTY SVTAGTPGTG DDILTSTLVS
PSPGGNCEAG GGDPRCAATV TVARLSIVTT ADAPTTEPGD VVRYTTVMTN TGQTPYNGTS
VLFNGYGGLD DAVPGGDQVA TSGSLSLGLD GLTWTGSIPV GGSVTLTGSV TVNNPDLGDR
VIPLTVVSAA QGSTCPVATA PGCTVIVNVL IPELTITKAA DRNAAVPGGA VAYTITIANT
GQTPYTGATA TDSLAGLLDD AAYNGDAAAT TGTVGFAGQT LTWSGDLAVG ATATVTYSAT
ADTPDVGDKL LTNSVVSTEA GSTCPPASAN AACSARVVVL TPALTIVKTA DRASATPGDT
VTYTVNVTNT GQVPFAAADF ADALAGVLDD AVYNGDATAT TGTVTFAGQA LGWTGGLAPG
QGATVTYSVT TGSPGTGDQR LTGEVTSTTA GTTCPAGGTD TRCSNTVLIS RITITASADV
ATAIPTGVVH HTVTIANTGQ TPYGSAVVDG LLADVFDDAA YNGDGTASAG NLTFVPGSGQ
ARWEGPLAVG DTVTVTFSVT VRNPDPGDKV MNAVMTSGTP GNNCPAGSPA PACASAVTVL
TPVLAVSKSA DRSTVTPGGT IAYTITVANT GQAPYTGATV TDRLTRVLPD AVYNGDAAAT
AGTVTFAGSD LTWTGDLAAG ASATITYTVT VRDPDPGDKQ IVNRAFSDTL GSTCPSTGSV
PACTTLVTVL VPALRIVKAA NTVVATPGET VGYTVTVTNT GQTPYTGATV ADALAGVLDD
AVYNGDAVAT SGTVTFAGSD LTWTGDLAAG ASATVDYSVT VDIPDTGDRL LTGAATSNAP
GSTCPAGTTD PACVSTVTVL IPGLAVSTVA DRATTTPGGT ARYTVTIANT GQTAYSGISV
SDVLTEVLDD AAYNGDATAT AGTVVFSGPV LTWTGDLATG ETVTVAYTVT VADPDTGDKV
MTGTVASSAP GSTCPVGSTA PACGATVTVL IPALDIVKTA GAPATVPGGT VGYTITVTNS
GQTPYTGASV ADSLQGLLDD AAYNGDAAAT TGVLAYAEPV LTWTGDLAVG ASATITYSIT
ANGTATGDKT LTNVVTSDAP GSTCPAQGTA PACSTLVRLL VPELTIVRSA DRATVVAGGT
VRYTITVTNT GETGYPGATV TDRLAGALDD AVHNGDAVAT TGVLAYAEPE LTWTGDLAVG
ATVTITYSVA VAYPARGDRL LSGTVVSAVP GSTCPAGGTD PRCTATATVL VPALGITKTA
DTGGEVVAGG TLRYTVVVTN TGEAPYDAAT VTDRLAGVLD DAVYNGDAVA TTGVLAYAEP
ELTWTGALPV DASAVVTFSV TVADPATGNA ELDNQVTSTT TGSTCPAGGT DPRCSVVTSV
AATSMTLTGA TEDFTLTGPP NTTVRGEDVV TMTVVTNSVD GYTVTARAAA AELSPAQPGV
TVGIPVANLR VREHGTSTFR SLSTTDPVLV YDKPLPSAPG GDGISNDYEV DIPFVPTGRY
TVTIDYVATA R