Gene Arth_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3098 
Symbol 
ID4444331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3472121 
End bp3482695 
Gene Length10575 bp 
Protein Length3524 aa 
Translation table11 
GC content68% 
IMG OID639690925 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_832577 
Protein GI116671644 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCGA ACTCCTTGAA TGCCCGGTCA TCCGCTGCTG AAGGAAGGCT GGACCTGACG 
GCGGCGCAGC GCGGCATCTG GTACGCGCAG AAACTGGCGC CGCAGAATCC GATGTACCAG
ATCGGCCAGT TCGTCGAAAT CGAAGGGCCG TTGGAGGCGG ACGTCCTGGC CAGGGCGGTA
GCGTGCGCGG TCAGTGAAAC CGACGCCCTC AACGTGGCAT TCGGCGAGGA CTCTTCCGGC
CCTTTCCAGT ATCCGCGCTC CAACCGTTCC GGCCTGGTGG TCACCGACTT GAGCGGGACG
AACGACGGCG GCAACCGGTC CGGCGGCAAC CGCGGCGCGA GCGCCGCCCG GGAACTTATG
GACTCTGACC TTGCCATGCC GAGGGACGTT GCGGCCGACG AACTGCTGCA CACGGAGCTC
ATCACACTCT CGGAGACCAA GCACTTCTTC TACCAGCGTG TCCACCACCT GCTGCTGGAC
GGATACTCCG CGGTGCTGGT CCTCAAGCGG GTGGCAGAGC TGTACAACAG CCTCCTCGGT
CGGGACAGCG AGGCTACCGC CCCCGCCATC TTCGGGTCGT TGTCTGAGCT GGTGGACACA
GAGAGCGGGT ATGCCGGGTC CCCGGCTGCA GACGGCGACC GCAGCTACTG GGAAGAGCAG
TTGCGTGGCG CTGCCGTCCC GGCGGGCCTG GCCGGCCAGC CCCAAGGAAC CGCACGCTCG
CTGATTCGCG CCGTCAGGGC CCTCCCCCGG GAAACCGCTG CGGCGGTGGA GGCTGCCGCT
GCCTCGGCAC CGGCACTGGT ACTCACCGCA GCCTCGCTCT ACGTGCACCG CATCACCGGC
GAGCGCGATG TGTCCCTGGC ACTGCCGGTG ACCGCACGGC GCGGCAAGCT GGCGAAATCG
ACGCCGTCCA TGCTGTCCAA CATCGTTCCG ATCCGGATGG GCATCGCGCC CGGCGCCAGC
GTCGGAGAAA CCATCACCGC CATGGGCGCC AAACTGCGCG GGGCACTCAT CCACCAGCGA
TTCCGCTACG AGAACCTTGC GGCACAGTCC GGCTATGTTG GCCCGTCGGT GAACATTCTT
CCGGCATTGG ATGCCATTTC CTTTGGCCCC GCCCGCGGAA CCATGAACAT CCTCTCCACC
GGCCCCATCG ACGACCTCTC CATCATCGTT CACGGCCTCG GCCAGGGAGG CGGAGTGACG
GTACAGTTCG AGGCCAATGC CGAGCTCTAC ACTGCGCCGC AGCTGGAACA GCACCTCGAC
CGCTTCGTTC GGATACTCGG GGCCGTGGCC ACCCTGCCCC ACGCGACCAT GGCGTCGATG
CCGGTCACCA CCGCCGAGGA GGAGCGCTTG CTCCTCGCGG CCGGCGACGC CGGGGACGCC
GCCCTCCCAG GCCATACGAT CGTGGAAGAG TTCCAGCTCA ACGCCCGGAA CTCCGGAGAC
CGGACCGCCG TCGTGGCTCC CGACGGCGAG CTGACCTTCG CTGAGCTGGA GCGTCGCTCC
AACCAGCTGG CCAGGTTCCT GAAAGGGCAC GGGGCCGGCC CCGGCAAAAC CGTGGCCGTG
CGGCTGGACC GCAGCGTGCT CCTTCCGGTC GCGCTCCTTG CAGTCCTCAA GTCCGGCGCG
GCCTACCTCC CCCTGGACCC CGACTATCCC GCGGGACGTG TTGAGGGGAT GCTGGAGGAC
GCATCGCCGG TCCGCCTGCT GACCTCGGCC GCCTTCACGG GAAGCGCCGC GAGCCATGAA
GAGCTGGAGA CCAGCGTTCC GGTCACCGTC CTGGACTCGG CCCTGATGGT GTCCTGCCTC
GACGGCAAAG ACCCGTCAGC GCCGGAGCCC TCGGCCGGGC AGCACGACCT TGCCTACGTC
ATCTTCACTT CGGGCTCCAC CGGGCGGCCA AAGGGCGTCG GCGTGGGACA TCTTGCGCTC
CTGAACCTTT ACACCTCCCA CCGGGACAAC ATCTTTGCGC CGGCGGAGCA GCGGCTAGGC
CGGAAACTGA AGGTTTCGCA CACCGCCGGT TTGTCCTTCG ACGCCTCGTG GGACCCCATC
CTGTGGCTCA TCGCCGGCCA TGAACTCCAT GTGGTGGACA ACCTCACCCG CCGCGACCCG
GAGGACCTCA GCCGGTACCT GTCGGCCACC GGCATCGACT CGATCGAAAC CACCCCGTCC
TTCGCCAAGG TGCTGCTTTC CGGCGGCCTC TTCGACCAGG GGACCCACCC GACGGTGGTG
GCCCTTGGCG GCGAAGCCGT GGACGCGTCG CTGTGGAGCA CGCTGGCGGA AAAGAACGGC
GTGGTGGCCT ACAACTTCTA CGGCCCCACC GAGACCACCG TGGACTCACT CACCGCGGTC
ATGGAACCCG GCACCGAACC CACTTTGGGT GATTCTGTGG CCAACAGCCG CCACTACATC
CTTGACTCGG GCCTGAACCC GGTGCCCGTG AACGCCATCG GCGAGCTGTA TGTGGCAGGC
ATCAACCTGG CACGCGGCTA CGTTGACCAG CCGGGCCTGA GCGCCGAACG CTTCGTGGCT
GACCCCTTCG TGCCGGACGG CTCCCGGATG TACCGCACCG GCGACGTCGT CCGGCGGCTC
CCTGACGGAA CCCTTGAGTT TAGGGGCCGC ATGGATGCGC AGGTGAAGAT CCGGGGATTC
CGGATCGAGC TCGCCGAAAT CGAGGAAGCC CTGCGCGGCC TAGCCGGCGT TGATCAGGCT
GCGGTCACTG TGTCCAAGAA CCGGGCGGGC TACGACCAGC TGCTGGGCTT CGTCACGCCG
GCCGGCGGCC TGGAGGATGA ACTGGACGTG GCGGAACTCC GGCGCCAGGT CCGCAGGCAG
CTTCCCGATT ACATGGTGCC CGCCTCCATC GTGCAGATCA CTGCCATACC GCTGACCCCC
AACGGCAAAC TGGACACCCG CGCCCTGCCG GCCCCGGCCC GGGAAACAGC CGTCAGCTCG
CCCCGCAACG AACGTGAGCG GCTCGTGGCG GACGCCTTCA AGGAAGTTCT GGGACTCGAC
GCCGTCGGGC TGGATGACGA CTTCTTTGAA CTCGGCGGCC ACTCCTTGCT GGCCACCAGG
CTCGTGGCCC ACCTGCGTGA TACCGCAGGC GTGGCGCCGG CGCTGCGCAC GGTGTTTGAG
CATCCCACGG TCACTTCACT GGCGGAGACG CTGGAACTGG CAGCAGCCAA CGCCCACCCG
CTCACGCCAA CGGAACGTCC CGCAGCAATG CCGCTGTCCT TTGCCCAGCG CCGGCTGTGG
TTCCTGAACC GCTTCGATCC CGAGTCCGGC GCCTACAACA TCCCGGTTGT CCTCGACCTG
AAGGGCCGGC TGGAGGTCTC CGCGCTCCAC CGGGCAATCA ACGATGTTGC GGCCCAGCAT
GAAACCCTCC GAACGCTCTT CCCCCTTGCC GACGGCGAAC CCGTCCAGCA GGTCCTTCCC
GCCGGCGAGC GGCCAGTGGA CCTGCTGGGT GTCCAGTGCA CCGCCGGGGC ACTCGCCGAC
GCCGTGGCGG CCGAAACCCG CCGTGGCTTC GACGTGGCCC GGGAGCTGCC CATCCGGGCG
GTCCTCTTCC AGCTGGCCCC GGACCATCAC GTGCTGGCCA TTACGCTCCA CCACATCGCC
GCGGATGGCT GGTCCCTGGC GCCGTTGGCG CGAAGCCTCT CCGTGGCCTA CAACGGCCAT
GTGTCCGGTC ACGGCGCCCT GCTCCCGCCG CTCCCGCCGC TCCCGCCGCT CCCGCCGCTC
CCGGTCCAGT ATGCCGACTA CACGCTCTGG CAGCGCGACG AACTTGGCAG CGAAGAAGAC
CCGGACAGCC CCATCTCCCG GCAGCTGGAA TTCTGGGCCA GGGAGCTCAA AGGCGCACCG
GAGGAACTCC GGCTTCCCTT CGACTTCGCC CGCGGCGCGC AGCCCGCGGG CGAACCGGCT
TCGTCGGTTC CGCTAGCGCT GTCAGCCGAA ACAGCAAACC GGCTCAACCA GCTGGCCCGG
GAACACAATG CCAGCCTGTT CATGGTGCTG CAGGCAGCGC TGGCAGCACT GCTCACCAAG
GCAGGCGCCG GCGAGGACAT TCCCCTGGGC ACCCCCGTTG CCGGACGGAC GGACACGCAG
CTCAACGAAC TCGTGGGCTT CTTCGTGAAC ACGCTGGTGC TGCGGACCAC CACGTCCGGG
AACCCCACGG CGGCGGAGCT GGTGGAAAGC GTCAGGTACA CCAACCTGCA CGCGTACGCC
AACCAGGACG CACCGTTTGA ACGCGTGGTG GAAGAACTCA ACCCCGCCAG ATCCCAGCAC
CGGCACCCGC TGTTCCAGGT GATGCTCACG CTGCAGAACA ATGCGCCTGC GGGGTTGTCC
ATGGACGGCC TCGAGGCCAG CGCAGACGCC AGCCACGAGC CGGGCGGGGC GAAGTTCGAC
CTTCTGCTGG ACCTTGCTGA GGAAGCGCAC GACTGCAGCA TCCGGGGTGC ACTCGCGTAC
AACCCGGCGC TGTTCGCCCG GGCCACTGCC GAACAGCTTG CGGCCGGTTT CCGGGCGGTT
GCGGAGCAGT TCGCCGCTAA TCCCGGCATC ACCTTGGACC GATTGCAGGT CCAGTCACCC
GGCCAGCTAG CACGCGTGAT GGAGCAGAGT CGCGGCGTCC AGGCAGCCTC CCCGTGGAAT
ACGGTCCTGG ACGCCTTCCA GGACACCCTT GAACGGACAC CGGACGCGCC GGCCCTCACG
GACGGCTGCG GACCGGCTGC CACGTTCAGC CAGCTGCACT CACGGGTGAA GTCACTGGCC
AAGGGCCTGG TGGCGTCCGG CGTGGAGCCC GGAGACCGGG TGGCGGTGGC CCTGCCGCGA
TCGTCCGACG TCGTTGCTGC CGCCCTGGCC GTACTGGCCG CCGGCGCCGT GTACCTCCCC
GTCGATCTCT CCTATCCGGC TGCACGGATC CGCATCATCC TCGAAGACGG CGGACCCGCC
GTCGTCATTG CTGCTGCCGG GGACCACGCC GCTGAGTTTC ACGGCAAAGG CGCGGAAGGC
CCCCGGATTC TCGACGTCGA CGCCCTCCTG CAGGCCGGCG CGGGAGTCCC GGATGCCACG
CTGGCCGGGC GGTACCCCGA CGCCGATGAT CTCGCCTACG TGCTCTACAC CTCCGGTTCC
ACCGGCCGGC CCAAGGGGGT TGCCGTGGCG CACAGCGCCC TTGCCAACCT CTTCGGCCAC
CACCACCGGA CCCTCTTTGC CCCGCGATTG GCGGCGTCCG GTGCAGAACC GGTGGCGGTG
GCCCACATCG CCGGGCTCGG CTTCGACGCC GCGTGGGATC CGATGCTCTG GATGATCGCC
GGCGCCGAGC TGCATGTGGT GGGCGACGAC ATCCGCAGCG ATGCCGAAGC CCTCGCCCGG
TACTGCGTTT CCCACGGAAT CGACGTACTG GAAACCACTC CCTCCTACGC AGCCCAGCTG
CTGCAATGCG GCCTGCTGGA TGCGCCGCGG GCACACCCCC TGCTTCTTGC CCTGGGCGGT
GAAGCGGTCA GCCCGGAGCT GTGGCAGCAG CTGGCCTCCA CAGCGGGCGT GGAGGCGTAC
AACTTCTACG GCCCCACCGA ATTCACGGTG GACTCCGTGA CGGCCCGCAT CACCGGGGCA
ACACCCACTA TCGGCCGCGG CATCGGCAAC ACCGACGCCT ACGTGCTGGA CCAGTTCCTG
GCCCCGGTTC CGGCCGGCGT GCCAGGCGAG CTGTACCTCG CCGGGCCCGG CGAGGCTCGC
GGCTACGATC AGCGCCCCGG CGAGACCGCC GCACGATTCG TGGCCAACCC CTTCGTTGCC
GACGGCAGCC GGATGTACCG CACCGGCGAC CTGGTCCGGA GAGCCGCGGA CGGGTCGTTG
GAATTCCTCT CCCGTACCGA CGATCAGGTC AAAGTCCGAG GCTTCAGGAT CGAACTGGGC
GAAATCGAGG CAGCCGTGGC GTCCCACCCG GACGTGTCCC GGGCCGTGGC CGTGGCCGAC
GGTGATCCCG CCCATCGCGT GGTTGCCTAT TACACGGGTG CGGCCAGCCC GGCGGAATTG
CGCGGCGTGG CCGGTGAAAA ACTGCCGGAC TACATGGTGC CCGCCGTGTT CATGAACGTG
CCCGCCATCC CGCTCACGGC CCATGGCAAA CTGGACCGGA AGGCGCTGCC TGCGCCGGCG
TCGGACACGG GTACCGGACA GGGGGCCGCA CCGGCGACGG CGGACGAGCA CACCATGTGC
GGAATCTTTG GCGAAGTGCT GGGTGCCGAC AACGTGACGA TGGGCGACGA CTTCTTCGTG
CTCGGCGGGC ACTCCTTGCT GGCCATCACC ATCATGGGCC GGATCCGTGA GGCCTTCGGC
ACTGAACTGC CGCTGCGGAC CCTCTTCGAC CGGCCCACCC CCGGGGGCCT GCTGGCGGCC
ATCGGCCAGC GGAACGGCAT GGCGGACGGC CCGGTCATAA CGCGTCCGGT CACGGCTACT
GACGGCGTCC CGCCCGCTTT CCACGAAGGA GACTCGCAAC CACTGGCGGA GTGGCTGGAC
AGCGAGGCCG CGGTCCGGCC TGAGCGGCTG GAACTTTCCT TTGCCCAGCA GCGCATGTGG
TTCCTCAACC AGCTGGATCC GGGATCCTCC GACTACAACA TTTCCCTCGC CGCGCGCCTG
GGCGGGGAGT TGGACGAACG GGCCCTGGCT GCTGCTGTCA GCATGCTGTT CAGCCGGCAC
GAGATATTGC GGACCCTCTA CCCCGCCACG GATGGCGTGC CGGAGCAGCT GATCCTTGAA
CCGGATACCG ATCCGCGGGC CGTTGGGCTG CTTGACATCA CCGACTCGGA TTCCGAGGCC
GGGGTCACGG CCCTGCTGCG ACAGGACGCC GAACGCGGCT TCGACGTCCG GTCCGAGCTT
CCGCTGCGGG CACGGCTTAT CCGGACCGGA TCCGCCGGCG CCGGCGGGGC CGGAACAGGC
GAATGGGTCC TCCACCTGGT GATGCACCAT ATTGCAAGCG ATGGCGCGTC CTTGGCCCCG
CTGGTCCGGG ACCTGTCGGT TGCCTACCAG TCCGCCCTTG CCGGACATTC CGGTCCGCCG
CTGGCTCCAC TGCCGCTGCA GTACGCGGAC TTCAGCGCCT GGCAGCGGCA GCAGCTGGAC
CGAGGAGCTG CCGCCGGCCC GGACGCCAGT TTCGGAGCAT CCGCCATGGC ACCCAAGGTG
GAGCACTGGC GGCGCACGCT GGCCGGCATT CCGGCGGAAC TCCTGCTCCC GGCGGACCGC
CGGCGTCCCC GCGAAGCGCG CCAGCCCGGA GGCCAGCTGT CATTCCGGCT GCAACCCGCG
GCCGTGGACG GGCTCAACAG CCTGGCGGCG TCGGTCAATG CGAGCCAGTT TATGGCCCTG
CACGCCGGCC TGGCCGCCTT CCTTCACCGC ACGGGCTGCG GTGACGATCT GGTCATTGGT
TCCCCCACGG CGGGGCGGAC AGATCCGGCG CTGAACGATC TGGTGGGCTT CTTCGTCAAC
ACTCTGCCGC TGCGCGTCGG TGCGGACGGA GACCCGAGCC TGCGGAGCAT GGTCTCGCGC
TCCCGGGAAA GCATCCTGGC TGCGTTCGAC AACGACGTAC CCTTCGAGCG GCTTGTTGAG
GCCGTCAATC CGGACCGCGA ACTGGGCCGC CATCCGCTGT TCCAGACAAT GCTCACCGTG
GACAGCGAGG CGCCGGCGGT TCCGCAGCTG CCAGGCGTCG TGGTCACGCC CGTGCCGGAG
ACGGCCTCGG GCGAGGCCAA GTTCGATCTC TCCTTCACCT TCAGGCCGGA CGCCGGAGAC
GGCCTGGCCG GCACCCTCGA TTACAACGCG GCCATGTTCG ACGAGGCGAC GGTCCGGCGA
ATGGTGCACA GCTTCGGCCG GTTCATCGGG CTCGCCGCCG CGTCCCCCGA CACGCCCTTG
TCGCTGCTGC CCCTGCTTGA GGACCATGAG GCCCGCGCCC TGATGACGTC CACCGCCAAT
CCCGCCGTCA CCGCCGGCAA GCAGGCGGAA GGCATCTTGT CCGCCCTGGC ACAGACGGTG
GCTGCAACGC CCGCCGCCAC CGCGGTGTCT GCGGACGGGA AGACCCTGAC GTTTGCGGAG
CTGGCCGCCT CCGCTTCACG GATCGCCGCT GCCCTGACAG CCGGGGGCGT CGGCAGCGGG
GATGTGGTTT CCGTGATGCT CCCCCGCTCC CCGGGCACTG TTGAAAGCAT GTTCGGCGTC
ATGGCCGCGG GCGCCGCCTA CAACCCCATC GACACGGAAT ATCCGGACGA CCGCGTGGCC
GCCATCTTCG AGGACGCGGC TCCCCCGGTG ATCGTCACCA CCAGGGCCGT GGCAGGGCGG
GTCCGGCAGA TCATTGCATC ACTCCCCGGG GCCGGTCCGC GGCTGGTCCT GCTGGAGGAG
CTCGCAGGCG CCCCGCAGGC GGCCAAGGGA TCCGACAACG AACCGTCAGC AGCCGTGTTT
GCCCGGCCCG GCCCCCGCGA CCTGGCATAC GTGATGTTCA CGTCAGGCTC CACGGGCCGG
CCAAAGGGTG TCGAAATCAG CCATGGCGCC CTCGCGTCGC TTCTTGCATC GCACCGGCAC
ACGTTGCTGG CAGACACCGG TGGCCCGCGG CGCGTGGCCC ACACCACCGG GGTGGGCTTC
GACGCATCCT GGGATCCGAT CCTCTGGATG GTGGACGGCC ACGAGCTGCA CCTGATCGAC
GACGCAACGC GCCGGGATTC CGAGCGGCTC GCCGCCTACT TCGCCGAGCA TGGAATCTCA
GTCTGGGAGA GCACCCCCGG CTACCTGCGC CAGCTGCTCG GCGAGCCCGC TTTCACCGCG
CTCCTGGACG CGCGCGCTGC CGCTGCGGAT CCGTTCCGCC TCGCCCTGGG CGGAGAAGCG
TTCGACGCCG GGCTGTGGGG CACCGTTTCC GCTCACCCGG GCCTTGAGGC GTGGAATCTC
TACGGCCCCA CCGAGGCCAC GGTGGACACG GTGCTTGCAA GGGTGGGAGA CACATCCGCG
CCTGTGCTCG GACAGCCTAC GGCGGCCACC CGGCTATACG TCCTCGACGC CAGGTTGCAG
CATGTCACGG CCGGCGCGGC GGGCGAACTG TACGTCGCCG GCCCCCAGCT GGCCCGAGGC
TACAGGGGCA GGCCGGACCT GACATCCGAA CGATTCGTGG CCGACCCCTT CGCCGGCCGC
GGCGAACGCA TGTACCGCAC CGGCGACGTG GTGTACCGGC ATGCCGACGG CAGGCTGGTG
TTTGCCGGAC GCAACGACGA CCAGCTGAAG ATCCGCGGCT TCCGGGTGGA GCCGGGTGAA
GTGGAGCGGG CGGTGCGCAG CACCAAAGGC GTCCGGGAGG CGGTGGTTCG CGCAGCGGTC
AACGACGCCG GCACCCGCCT CGTCGCGTAC GTCGTCCCCG CGAACAGTCC GGCCATGGCC
GATGCTGAGC TCTCCGACGT CGTCAGAACG CACGTTCGCG GGCTCGTGCC CGATTACATG
GTGCCGTCCG CCGTCGTCGT CCTCGACAAA ATCCCGCTGA CCCAGCACGG CAAGGTGGAC
GCGTCCGCGC TGCCGGACCC GGGCCGGACC GAACGCAGCG GAGGAAAGGC GCCGCGGAAT
CCGAAGGAAA AAACGGTGGC GCGGATCTTC GCCGAGGTGC TGTCCCTGGA CCGCGCCGGA
GTGGACGAGT CGTTCTTCGA ACTGGGCGGG CATTCCTTCC TGGCCCAGCC GCTGATCGCC
AGGATCAACG CCGCGCTGGG GACCTCGCTC CAGGTGCAGT CGCTGTTCCG TTCCCCGACG
GTTGAGGGCC TTCTCCGTGA AGCCGCGCAA GGCGGGGAGG AGAGCACGGC GGACAGCCTG
AAGCAGCTGT TGCCGCTGCG CACGGCCGGC TCCAAGCCGC CGCTCTTCGC CGTGCACCCT
GCTTCGGGCA TTGGCTGGGG ATATGCCTCC ATGCTGGGCC GGCTGGATCC CGAACGGCCC
CTCATCGGGC TGCAGATGCC GGGAATGGAG CCCGGCCGGA CCCATCGTGT GGGTGCATCC
ACCCTGACGG AACTCGCGGA CGACTACATT GCCCGGCTGA GGTCAGTCCA GCCCGAAGGC
CCGTACCACC TGATGGGCTG GTCCTTCGGC GGACATTTGG TCCACCGGGT GGCCACCAGG
CTTCAGGCGC TGGGCCACGA GGTGGCCTTC CTCGCCATCC TCGATGCCTT CCCTGGCAAC
CAGGAACATA ATGCGGACGT CGGGACCGGA CCTGCCCTGT GGGCAAGCTA CCTCGCGGCG
CAGGGCTTCG AGCTGACGCC GGAAGAGGCC GCAAACCTCG ACGGAGTGCG CGCACTGGAG
ATACTCCGGG AGAACCACAA TCCGCTGGGA AGCGTTCCGC TGGATTCGGC GAATGCAATG
GTGGAGAACT TCCCGGACCT TGCCCGCCTG ATCCGCGGCC AGGAACCGGA GGTGTTCCAC
GGCGACCTGC TGTTCTTCCG GGCCACGCGG GACGTCCCGG AGGGGACTCC GGGGACCGAT
GCCTGGCAGC CCTTCATCAC CGGGGCGGTC ACCGATGTGG CCGTGGAAGA GCGCCACTCG
CAGATGCTAA GTGACGCGGC TCTCAGTGTG ATTGTGCCCG AAATAGCAAT CCGGCTGGAC
GTCTCCACCG AATAA
 
Protein sequence
MNPNSLNARS SAAEGRLDLT AAQRGIWYAQ KLAPQNPMYQ IGQFVEIEGP LEADVLARAV 
ACAVSETDAL NVAFGEDSSG PFQYPRSNRS GLVVTDLSGT NDGGNRSGGN RGASAARELM
DSDLAMPRDV AADELLHTEL ITLSETKHFF YQRVHHLLLD GYSAVLVLKR VAELYNSLLG
RDSEATAPAI FGSLSELVDT ESGYAGSPAA DGDRSYWEEQ LRGAAVPAGL AGQPQGTARS
LIRAVRALPR ETAAAVEAAA ASAPALVLTA ASLYVHRITG ERDVSLALPV TARRGKLAKS
TPSMLSNIVP IRMGIAPGAS VGETITAMGA KLRGALIHQR FRYENLAAQS GYVGPSVNIL
PALDAISFGP ARGTMNILST GPIDDLSIIV HGLGQGGGVT VQFEANAELY TAPQLEQHLD
RFVRILGAVA TLPHATMASM PVTTAEEERL LLAAGDAGDA ALPGHTIVEE FQLNARNSGD
RTAVVAPDGE LTFAELERRS NQLARFLKGH GAGPGKTVAV RLDRSVLLPV ALLAVLKSGA
AYLPLDPDYP AGRVEGMLED ASPVRLLTSA AFTGSAASHE ELETSVPVTV LDSALMVSCL
DGKDPSAPEP SAGQHDLAYV IFTSGSTGRP KGVGVGHLAL LNLYTSHRDN IFAPAEQRLG
RKLKVSHTAG LSFDASWDPI LWLIAGHELH VVDNLTRRDP EDLSRYLSAT GIDSIETTPS
FAKVLLSGGL FDQGTHPTVV ALGGEAVDAS LWSTLAEKNG VVAYNFYGPT ETTVDSLTAV
MEPGTEPTLG DSVANSRHYI LDSGLNPVPV NAIGELYVAG INLARGYVDQ PGLSAERFVA
DPFVPDGSRM YRTGDVVRRL PDGTLEFRGR MDAQVKIRGF RIELAEIEEA LRGLAGVDQA
AVTVSKNRAG YDQLLGFVTP AGGLEDELDV AELRRQVRRQ LPDYMVPASI VQITAIPLTP
NGKLDTRALP APARETAVSS PRNERERLVA DAFKEVLGLD AVGLDDDFFE LGGHSLLATR
LVAHLRDTAG VAPALRTVFE HPTVTSLAET LELAAANAHP LTPTERPAAM PLSFAQRRLW
FLNRFDPESG AYNIPVVLDL KGRLEVSALH RAINDVAAQH ETLRTLFPLA DGEPVQQVLP
AGERPVDLLG VQCTAGALAD AVAAETRRGF DVARELPIRA VLFQLAPDHH VLAITLHHIA
ADGWSLAPLA RSLSVAYNGH VSGHGALLPP LPPLPPLPPL PVQYADYTLW QRDELGSEED
PDSPISRQLE FWARELKGAP EELRLPFDFA RGAQPAGEPA SSVPLALSAE TANRLNQLAR
EHNASLFMVL QAALAALLTK AGAGEDIPLG TPVAGRTDTQ LNELVGFFVN TLVLRTTTSG
NPTAAELVES VRYTNLHAYA NQDAPFERVV EELNPARSQH RHPLFQVMLT LQNNAPAGLS
MDGLEASADA SHEPGGAKFD LLLDLAEEAH DCSIRGALAY NPALFARATA EQLAAGFRAV
AEQFAANPGI TLDRLQVQSP GQLARVMEQS RGVQAASPWN TVLDAFQDTL ERTPDAPALT
DGCGPAATFS QLHSRVKSLA KGLVASGVEP GDRVAVALPR SSDVVAAALA VLAAGAVYLP
VDLSYPAARI RIILEDGGPA VVIAAAGDHA AEFHGKGAEG PRILDVDALL QAGAGVPDAT
LAGRYPDADD LAYVLYTSGS TGRPKGVAVA HSALANLFGH HHRTLFAPRL AASGAEPVAV
AHIAGLGFDA AWDPMLWMIA GAELHVVGDD IRSDAEALAR YCVSHGIDVL ETTPSYAAQL
LQCGLLDAPR AHPLLLALGG EAVSPELWQQ LASTAGVEAY NFYGPTEFTV DSVTARITGA
TPTIGRGIGN TDAYVLDQFL APVPAGVPGE LYLAGPGEAR GYDQRPGETA ARFVANPFVA
DGSRMYRTGD LVRRAADGSL EFLSRTDDQV KVRGFRIELG EIEAAVASHP DVSRAVAVAD
GDPAHRVVAY YTGAASPAEL RGVAGEKLPD YMVPAVFMNV PAIPLTAHGK LDRKALPAPA
SDTGTGQGAA PATADEHTMC GIFGEVLGAD NVTMGDDFFV LGGHSLLAIT IMGRIREAFG
TELPLRTLFD RPTPGGLLAA IGQRNGMADG PVITRPVTAT DGVPPAFHEG DSQPLAEWLD
SEAAVRPERL ELSFAQQRMW FLNQLDPGSS DYNISLAARL GGELDERALA AAVSMLFSRH
EILRTLYPAT DGVPEQLILE PDTDPRAVGL LDITDSDSEA GVTALLRQDA ERGFDVRSEL
PLRARLIRTG SAGAGGAGTG EWVLHLVMHH IASDGASLAP LVRDLSVAYQ SALAGHSGPP
LAPLPLQYAD FSAWQRQQLD RGAAAGPDAS FGASAMAPKV EHWRRTLAGI PAELLLPADR
RRPREARQPG GQLSFRLQPA AVDGLNSLAA SVNASQFMAL HAGLAAFLHR TGCGDDLVIG
SPTAGRTDPA LNDLVGFFVN TLPLRVGADG DPSLRSMVSR SRESILAAFD NDVPFERLVE
AVNPDRELGR HPLFQTMLTV DSEAPAVPQL PGVVVTPVPE TASGEAKFDL SFTFRPDAGD
GLAGTLDYNA AMFDEATVRR MVHSFGRFIG LAAASPDTPL SLLPLLEDHE ARALMTSTAN
PAVTAGKQAE GILSALAQTV AATPAATAVS ADGKTLTFAE LAASASRIAA ALTAGGVGSG
DVVSVMLPRS PGTVESMFGV MAAGAAYNPI DTEYPDDRVA AIFEDAAPPV IVTTRAVAGR
VRQIIASLPG AGPRLVLLEE LAGAPQAAKG SDNEPSAAVF ARPGPRDLAY VMFTSGSTGR
PKGVEISHGA LASLLASHRH TLLADTGGPR RVAHTTGVGF DASWDPILWM VDGHELHLID
DATRRDSERL AAYFAEHGIS VWESTPGYLR QLLGEPAFTA LLDARAAAAD PFRLALGGEA
FDAGLWGTVS AHPGLEAWNL YGPTEATVDT VLARVGDTSA PVLGQPTAAT RLYVLDARLQ
HVTAGAAGEL YVAGPQLARG YRGRPDLTSE RFVADPFAGR GERMYRTGDV VYRHADGRLV
FAGRNDDQLK IRGFRVEPGE VERAVRSTKG VREAVVRAAV NDAGTRLVAY VVPANSPAMA
DAELSDVVRT HVRGLVPDYM VPSAVVVLDK IPLTQHGKVD ASALPDPGRT ERSGGKAPRN
PKEKTVARIF AEVLSLDRAG VDESFFELGG HSFLAQPLIA RINAALGTSL QVQSLFRSPT
VEGLLREAAQ GGEESTADSL KQLLPLRTAG SKPPLFAVHP ASGIGWGYAS MLGRLDPERP
LIGLQMPGME PGRTHRVGAS TLTELADDYI ARLRSVQPEG PYHLMGWSFG GHLVHRVATR
LQALGHEVAF LAILDAFPGN QEHNADVGTG PALWASYLAA QGFELTPEEA ANLDGVRALE
ILRENHNPLG SVPLDSANAM VENFPDLARL IRGQEPEVFH GDLLFFRATR DVPEGTPGTD
AWQPFITGAV TDVAVEERHS QMLSDAALSV IVPEIAIRLD VSTE