Gene Ent638_3119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3119 
Symbol 
ID5111672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3391534 
End bp3402192 
Gene Length10659 bp 
Protein Length3552 aa 
Translation table11 
GC content61% 
IMG OID640493318 
Productputative outer membrane adhesin like proteiin 
Protein accessionYP_001177834 
Protein GI146312760 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0912825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCCGC CGGGGGCGCT CATTACCCGC AAAAAATCTG GCTTCGTCTT GCACGCCATT 
AAAAAAATCT GTCGGGAGTC ATTATATATG AGCCAAATCT CTGTCATTTC GAAACTCACT
GGCGTGGAAA CCACCACGGA AGGTAATCAA GTCTCTTTGG GTCAATCCTC AATTGTTAAG
CTCCACGTGG GTAGAGCGGA TATTTCCCAC TACGCCCGTA ACGGAAACGA CCTGGTCGTC
AGCCTGAACT CCGGCGAAAC CATTACGCTT AAAAACTTCT ACGTCGGTGA TGCGCAGGGC
GCGAGCATGC TGGTGCTCGA AGAAAGCGAC GGCGCATTGT GGTGGATTGA AGACCCCACC
GCAGTGGAAC ATTACGAAGC TATCTCCTCG ATTGACGCCC TTATGGCAGC GTCAGGGGGC
GACGCGTCCG GCGGTGCCGC GATTTGGCCG TGGGTACTCG GTGGCGCAGC TGTCGCGGGG
GGCATTGCCG TGGCGGCGGG TAGCGGAGGA GGAGGTGGTG GTGGAAGTGG TGGCGGGAAT
AATAATAATC CCGGCAACCT GGGGAATTCC GAGAACCCTC TCAATTCAGA CACGACGCCA
CCCAACGCAC CGACCAATCT CGCCTTCTCC ACGGACGGTA CTACCGTGAC CGGTACGGCC
GAACCCAACA GCACTATCAC GCTCAAAGAC GCAAACGGTA ATTTGGTTGG CACAGGCCAG
GCTGACAGCG ACGGGAAATT CACCATCGAA TTGGGCACGC CGCTGATAAA CGGCGAGCAA
ATCACCGCAA CTGCGACCGA CGCCGCAGGA AATATCAGTC AAGATGGCCA CGTTACCGCG
CCCGATCTCG TAACACCTGA TGCTCCTACC CTGATTCTGG TGAACGATGA CGCCGGGAGC
ATCACCGGCC CTCTCATTCA AAACCAGGTG ACCGATGATG CCCGCCCGAC GCTGAGCGGT
AGCGGCGAAC CCGGCACTCT GATCACGATT TACGACAAAG GCGTGCAAAT CGGCACCACC
CAGGTAGGCG CTAACGGTAG CTGGACGTTT ACGCCCGGCA CGGCCCTGTC TGAAGGTAAT
CACTCCCTGA CCCTCACCGC GACCGATGCC GCGGGCCATG TCAGTGTCCC CTCCGATGCT
TTTACCTTGA TGGTGGATAC CCTCGCGCCA CCAGCACCGG TGATGACACT CAATCCTGCG
GGCACTGAGG TAACCGGGAC CGCAGAGCCG AACAGCACCA TCACCATTAC CAGCAACAAT
CAGCCGATCG CGACGGGCAA AGCCGATGGC AACGGCAATT TTGTCATCCC CCTGTCGCCA
GCACAGATTG ATGGCGAGAC CATTCGCGTT GTCGCCACCG ATGAGGCGGG GAATACCAGT
CTTCCCGCCA CAACGACCGC CCCCGATAAT ATCGCGCCCG CCATGCCAGC AAATCTGGCT
GTCGCAGCCG GCGGGAACAG TGTCACAGGC ACCACAGAAC CTCACTGCAC CGTCACGGTA
AAAGCCCCTA ACGGCGATGT CATTGGTGAA GCCACAGCGG ATGGCGACGG TCATTTTACC
GTCCCCATTT TTCCACCGCA CCTCAATGGC GAAGTGCTGC TCGTTCTGGC AACGGACACA
TCAAGCAATA CCAGCCTGCC GGGTCAAGCC GATGCGCCTG ACACCACCAA ACCGTTAGCC
CCCGATAATC CTGTCGTTTC TGGTGACGGA ACCAAAGTCA CCGGAACCGC AGAACCTGGC
AGCACCGTGA CCATCCGCGA AGATGGCGTA AAAATTGGCG AAGGCAAAGC GGACGATCAG
GGCAACTTTA GCGTGACCAT CGCCCCGCCA AAACTGAACG GCGAAATCCT CACCGCCGAG
GCCGCCGATA AAGCCGGAAA CACAGGGCCA ACGGCGAACG CCACCGCCCC GGATATCACC
CCTGCGCAGA CGCCGACGAT AGTTTCCGTG GAAGATAACG CGGCTAACGT GACTGGTCCT
GTCCCGCAAA GCGGGCTGAC CAACGACAGT ACGCCGACAA TCACCGGCAC TGGCGAGCCA
GGAACGCTGG TTTACATCTA TAGCGGTGAC AACCAAATTG GCACCGCCAA CGTGCTCTCC
AACGGCAGTT GGTCGTTTAC GCCAACTGTC CATTTACCGG AGGGCGGACA TGTTTTAACC
GCCGTCGCCG TGGACGATGC GCTCAACCGC AGTGAAACGT CAAACAGCTG GAGTATTACC
GTCGACAGCC TCGCGCCTGC GGCCCCGGTC ATAACCCAAG TGGTTGATGA CGTACCGGGC
CGAACCGGCG CACTCGATAT CAACGAAGTC ACCAACGATA ATCGCCCAAC GCTTAACGGC
ACAGGTGAAC CTGGCACAAC CATCAGCATT CGTCTTGACG GCACGCAGAT TGGCACGGCG
CTGGTCAACG ACGGCGGAGC CTGGACTTAC ACTCCTACGA TTGTTTTCCA GAACGGGCAA
CACACCCTCA CCGCAACTGC CATTGATAAA GCAGGCAACG TTAGCGCAGC ATCTGGCGGA
TTTACCTTCA CGGTGGATAC GACAGCACCG CCGCCGCCAT CCATTACGAC GGTCACGGAT
AACACCGGTG ACGTTAAGGG AATCCTCACC AGCGGTTCAC CGACGGACGA AACCCATCCG
GTCATGCAAG GCACGGCACC CGCGGGCACC ACGATTGCCA TTTACGACGG TACGACGCTG
CTAGGATCTG CCGTGCTCGA TGGCAGCGGC GGCTGGAGCT TTACCCCACC GAGTACGCTG
ACGGACGGCA CGCACGTTTT AACGGCGGTG GCGACAAATG CCGCAGGAAC CTCCACCCCT
TCTGGCTCGT TTACGCTGGT GGTGGACACC GTTGCACCCG CCACGCCTGA TTCGCCAGAC
ATCACCGTCA ACCCGGACAA TGCCCCCATC GGTACCGCGC TCAACCCGGG CGAAGCAACG
CGCGATACCT CGCCAACGCT GAGCGGCACC GGGAATGTGG GTGATACGGT CACGATTTAC
ATTGATGGCG TGAAGCAGCC CGGGGCGGTC ATCGTCGATG ATGACGGGAA ATGGAGCTGG
TCGCCTGTTC CGCCGCTGAC CAATGGCTCG TATGACATCG AGCTAACCGT CACCAATAAA
GACGGCGCGG GCAATGAAAG TGCCCCGTCA CAGCCGGTCA CCATTGTCAT TGATACCGTT
GCGCCGACCA CGCCAGCTAC GCCAGTGGTA ACGGATAACG TCACGGAGAT AACCGGCCCT
GTCGCCGATA ACGGCAGCAC CAACGATCCG CGTCCGGTGA TAAGCGGGAC GGGCACGCCG
AACGACGTGA TCACCATTTA CGATAGCGTC GACGGTGCGC CGAAAAGTGA AGTAGGCCAG
GTCACAATCG GTGCTGACGG CAACTGGAGC TGGAGGCCGG ATACTCCCCT GACACAAACG
TCGCACACGT TCACCACCAC CGCGACGGAC GAAGCGGGCA ACGTCTCGGG AACGTCTATC
GCCATCAAAG TGACTATCGA TACCGATGCC CCCCTGCCCC CGGCGATCAC TGATGCGGGC
GGCGTAAGCA ATAACGGCGC GACGCAGGAT ACCACCCCGA CCATTTCGGG AACGGGCGTC
AGCGGCGATA CGATCCTTAT TTATAATAAC GGCGTGCAAA TCGGGACGGC GACGGTTGCA
GGCGGCGTCT GGAGCTTTAC GCCTACTACC GCGCTGAGCG AAGGGCCACA CACGCTGACG
GCGGCGCAAG TCGACGCGGC CGGTAACGTC AGCCCCCTGA GTCCGATTTA CACCGTCACC
GTCGATACCA TCGCGCCAAC CACGCCGCTG ATTGACAACA TATCTTCCAG CACGCTCGCT
AACGGCGTTC TTTACACCAA CGACAATACG CCAACGCTGA CCGGCACCGG CGAGCCGCGC
ACCGTGATCA CCGTCTCTAT TGACGGGACT GCGTCTACGG TCACCGCTAC GGTTCAACCT
GACGGGACCT GGAGCTGGAC GTCACCCACG GCGCTTCCTG ATACCCCTCA TGTCATCACC
GTCACGTCCA GCGATGCGGC GGGAAATACC TCTGGCACGT CGACCACTAA CGTGACGGTG
GATACCGATG CGCCCGCAGC TCCGGTCGTG ACGGCGCTGG CTATCGAAGG CACGCCGATT
ACCGGTACGG CTGAAGCGGG CTCACTGGTG ATTATCACGG GCCCCGGCCC GGGCGGCACC
ACCATTGAGT TGGGTCGCGG CATTGCGGTG GGCGGGAATT TCTCTATCGC CCTTTCGCCT
GCGCAAACCA ACGAAACCAC GCTTACGGTC AGGGCGACAG ATGCGGCGGG CAACCTCAGC
GATCCCACCA CCTTTAACGT CGCGGATGCG CCCGATCTGC CCGACGTGCC GGTCATCACC
TCAATCGCTG ATAACAACGG CACGGACAGC ATCGAAGTGA AAGGGGGAAG TTCCGACGAC
ACCACGCCGG TTATCAGCGG AACCGGTCCA GAAAACAGTA CCATTACCCT GTATCTGAAC
GGGGTGGAGA TCGCCACGAT TGGGCTGGGC GCGGGGCAAA CCACCTGGAG TTACACGGTT
CCGGCGGGAA GCGCGTTGGC AGAAGGTACC TACAATTTCA CGGCCACCGC GACGATCGGC
GGAGCCACCA GCGGGCTTTC CGCGGCGGCG ACGGTGACGA TCGACCTGAC GGCACCGAAC
ATCCCTGCGA TTGGTGCCGT GACTGATGAC GTGGGTCCTG GCACCGGCCC CCTGACCAAC
AATCAAATTA CCAATGACAG CCAACCGACG CTGACCGGCA ATGCGACGGC GGGCGATACC
ATTTCCGTCT ACAGCAACGG CACGCTGCTC GGCAGCGTGT TAGTAGGTCC AACGGGCGCG
TGGAGCTTTA CCCCTCCGAG CGCGCTGGCC GAGGGCAGCA ATGTGCTGAC CATCAGGGCC
ACCGATCCGG CCGGGAATCA AAGTGGTGCT TCCGCCCCGT TCACTATTGT GGTCGATACA
GTCTCTACCA CGCCGGTGAT TGTGGGCGCG GAAGATAACG TCGGCACAAC AGAAAACATT
CCGACGAACG GCGTCACCAA CGACACCACG CCGACACTCT CCGGTACGGC GGAAGCCAAC
AGCGTCATCG CGATTTTCGA AGGTGCAACG CAGATCGGCA CCGCCATCGC CAACGGCAGC
GGCGCGTGGA CATTCACCCC TGCCGCCGCC CTGAGTGAAG GCTCACACAC CTTCACCGTC
AAAGCGACCG ATCCGCAGGG TAACGTCAGC ATTGCCTCCA ATGCGTACAC GGTCGTCATC
GACATCACCC CGCCAGCGGT ACCGGTGCTG AGCACGGTTA ACGACAACGT CACCGGCGGT
GCGTTCGGCA ATTTAACAGC GGGACAAGTG ACCAACGACG CGACCCCAAC GCTGAGCGGA
ACGGGCGTGA CGGGCAGCAC GATCCATATT CTCAATAACG GCATTGAAAT CGGCACCGCA
ACCGTCGCCA ACGGCACGTG GACCTTCACG CCGCCCGCTA ATCTGCCCGA CGGCGCTTAC
AACATCAGGG TTAACGCCAG CGATGCGGCG GGCAACGTCT CCGCGAATTC ACCAGTATTC
TCATTTACCG TCGACACCAC CCCACCCGCC GCACCGATTG TGCTCACGGT GCTGGATGAT
GTCGGCCCGG TGATCGGGGA AATCACCAGC GGCGACAGCA CCAACGATAA TCGGCCAACC
TTTAACGGCA CGGGCGAAGT CGGGGGCACC ATTACCCTGC TGAATGACGG ACAACCGTTC
GGCACCGCCA TCGTCAACGC CCAGGGGAAC TGGACATTCA CCCCGACCGC GCCGCTGAGC
GAAGGCACAC ACACCATTAC CCTTAGCACG ACCGATGTCG CGGGCAATAC CAGCACGACC
ACCAGCACGT TCGAACTGAC GGTCGACACC TTCGCGCCTT CTGCACCGGC CATTATTAAC
GCGACCGATA ATGTGGGAAG CGTCCTGACT CCAGTGACCA ACGGCAAAAC CACCGATGAC
ACCACGCCAA CGCTGAACGG GACGGCGGCG GGCAACGCAA CGGTGACTAT TTATGAAAAC
GGCGGCGTCG TGGGTACCGT TCAGGCCAAC GCATCAGGCG AATGGAGCTT TACGCCAGGC
AGCGCGTTAA GCAACGGGAG CCACACCTGG ACCGCCACCG CGACCGACGC GGCGGGTAAC
GTCAGCGTGG CGTCGCCTGG CTTCACGGTG ATTGTCGATA CCATCGCGCC GCTCGCCCCA
GTGATTACGC AGGCGTTTGA CGACGTTGGC ACTGTCACCG GGCCGCTGAG CAACGGGCAA
ACCACCGATG ACACCGTTCC GCGTCTCATC GGGACCAGCG AACCTAACGC CACCATCAAC
ATTTTTGAAG GGAACACGCT CGTTGGCACA ACCACCGCCG ATGCCAGCGG TAATTGGGCC
GTCACGCTCA ACACCACGTT CCCTGAAGGG CCGCACCAGT TCGTGGCGCG GGCGACCGAT
GCGGCGGGCA ACACCGGCGA TCCGTCGTCA CCATTTAATC TGAACATTGA CCTCACGCCG
CCCGCCATTC CGTTGCTGGT GAGCGTGGTC GATGACGTGG GCACCACGGC CACGATCAAC
AGCGGACAGA TAACCAACGA CGCGCAGCCT ACGCTCAGCG GGACGGCGGA AGCGGGTTCG
ACGATTAAAA TCTACGATAA CGGCGTGCAA ATCGGCAGCG TAACCGCTGC AGACGGCACT
TGGAGCTTTA CGCCAACGCC AGCGTTAGCC GACGGCCAGC ATCCGCTGAC GATCACCGCG
ACCGATCCGT CGGGCAATAC CAGCGTTGCG ACGACGCCGT TTGTGCTGAA CCTGGATGCC
ACCCCGCCAA ATGCGCCGAT CATCACAACG ATTGTCGATG ATGTCGGCCC GAACCTGGGA
ACGATCGCGG GCGGCACACC AACCAACGAT ACGCAACCGA CGCTGAACGG CACCGCGGAA
GCCAACGCGG TGGTGCGCAT TTACGACGGC GGTACGCTGG TCGGCACCGT CACGGCGGAT
GCCAACGGCA ACTGGACGCT GCCGCAAACC TCCACCATTC TGACCAACGG CCAGCACAAC
TTCACCGCAA CCGCCACCGA TGCTGCGGGC AACACCAGCG CGCCGTCGTC CATCTCCTCG
GTCGTGGTCG ATACCATCGC TCCTAACCTG CCGACCACGC TCGCGGTCAT CACCAACGGG
ACGCACGTCA CCGGTGTTGC CGAAGCGGGC AGCACCGTGA CGATAACCAC CAGCGGCGGA
ACCGTGCTGG GCACCGCCAC GGCGGATGGC ACCGGCAGCT TTAACGTCAC TATTTCACCG
CCGCAAACCA ACGGCGAATC ACTGCTGGCG TTTGCAACCG ATAAGGCTGG CAACGTCGGC
GGCAACGCAA CGGTGGTCGC GCCGTTTACC AACCTGCCAA ACGCGCCGGT GATTGTGACG
ATTGACGATA ACGTCGGTAC GCTGATCGGC AATTTAACCA ACGGGAAAGC GACCGACGAC
ACCACGCCGA CGCTGACCGG CACCGCGCAG CCAAACTCCA CCGTCACGCT GTATAACAAC
GGCGTGGCAA TGGGCACCGC GACCGCCGAC ATCAACGGCG ACTGGTCGTT TACCACGCCA
GTTCTCAGCC AGGGCTCGCA CGCCTTCACC GCCACGGCCA CCAACGTGGT GGGCGGCGTC
GGTCCGGTAT CATCGCCATC CACCATTATT GTCGATACCG TCGCGCCGAA CGCGCCAACC
GGGACCTTCA ACGCCGACGG TAGCGTCCTG ACTGGCAGCG CCGAAGCGGG CAGCACGGTG
ACGATTCGCC TGCCGGACAA TTCGACGTTC ACTACAGTCG CCAACAGCAG CGGCACGTAC
AGCTACACCT TCCTGAACAA ACAGACGGAA GGCAACACCC TGCAAATCAC CGCCACCGAT
GCCGCTGGGA ACACCTCTAC GCCAGGTTCG GTGCTGTCAC CGGTTGTGGC GCTGTCGGCA
AGTACCAACG TTGAGGAGCT GGATATCAGC ACCACGGCCA CGGTCACTAA CGCGCAGTAC
AGCGATTACG GATTCCTGCT GGTCGGCGCG CTCGGCAACG TACTGACGTT ACTGGGCAAC
GACACGGCGC AGGTGGGGTT CGTGGTTTCA GACGGCGGCA ACGCGGATAT CTCGATCAAC
GCGAACGCCA CGGGCGTGGT GCTCTCGCTG CTCAACACGC TGGAGGTGGT GGTTCAGCGC
TGGGACGGCG TCAATAACAC CTGGACGACG GTCGTTGATA CCGGCCTGCC GCAGTTTGCC
AACCTGCTCA CGCTCGGCGC GAGCGGCGTG ACGCTGAACA TGACCGGGCT GGAAAACGGC
CAGTATCGCG TCCTAAGCTA CAACACCAAT CTGCTGGCCA CCGGGTCTTA CACAAGTCTC
GACGTAGACG TGACCGAAAC CAGCGCCGGG GTCATCACCG GGATCTCAAC CCAAAGCGGC
AACGTCATTT ATGACCTTGA CCCGACGACG GGCAGCGATA ACGCGCCAGC GGGAACCCGC
ATCACGGCCG TGACCGACGC GCAAGGTAAC GCAGTCAACG TGACGGCGGA CGGCACCATC
ATTCAGGGGC AGTACGGCAC GCTGACCATC AACCTGAACG GGAGTTACAC CTACACGCTG
ACCAACACCA GCGCTGCCGT GCTGGGTCGC ACCGAGAGCT TCACCTATAC CATCGGGCAT
AACGGTGCCA CCGCCTCGGC AAAACTGGTG ATTTCACTCG GCGCAAATAC CGTCACCAAT
AGCGTCACAG CGGTCGACGA TACGGCATCG CTGACCTACG ACACCAGCGT ACACGCGATC
AACAACGGCC CGTCGTCCCA GGGCGGATTC ACCGTCGCGG GCGTTAATCT TGGCAGCACG
CTGGGGCTGA ACCTACTCGA CGATCTGAGC AATCCGATCA TCTACAGCGT GCAGGAAGGC
ACCACCCGCA CCATGACTAT TCAGGCCTCG GTCGGCGGCG TGGCGCTGGC GTCGGTGTTT
GACCTGTATA TCTATAAATT TAACGATGCG ACCCAAACCT TCGAGCAGAT GCGCGTTCAG
CCGGGCTGGC TGCGCGCGCC GCTGCTGGGC GGCACCTCGG GCACGCTGAC GCTGAACCTG
CCTGCGGGCG AATACCTGTT CCTGCTCAAT ACCGCAGCCG GGATCACGGT GCTCACGGGG
TATACCCTTA ACGTGCTGGA GGATCACGTT TATAACGTGG CGAGCGTGGG GGCATCGACG
ACGGGTGACG TGCTGGCGGA CGATATTGCG CCGCAGGGCA CGCTGGTCAC CGAGGTCAAC
GGTGTCGCCG TTAACGCCAC GGGCGTGACC GTTATTCAGG GTGAATACGG TACGCTGACG
ATTAACGCTC AGGGGAGCTA CACCTATACG CTGCGCAGCG GTATCGGCGC GGATCACATC
AAAACGCCGG ATACGTTCGT CTACACCATC ACTGCGCCAA ACGGGGATAA AGACACGGCG
TCGCTGAATA TCACACCAAC CGCACGGGCG ATGGATGCAG TGAACGACGT CAGCGCCGTG
ATGGACGTGA CCTCCCTGCA CCACACCACT GCGTATTCTG ACACCACCGT GGGCACCGCA
AGCTGGACCA CCGCGCTGTT AGCGCCGACT CAGGGCAGCG GCAGCGGCAC ATTTGTGGTG
GATGCCAACA CGGCGCTGCA TAACGTGGCG CTGCACTTCA ACGTCGCATC GCTGCTGGCG
CTGGGTGGGT TGACGGTGAG CTGGTCGATC AGTAATGGAT CGACGGTTAT CCGCAGCGGT
TCGTTTGCCG GCGGTGCGCT GCTGGGCGGC AAAATCGATA TCAACCTCGG CGGACTGGAT
CTGGACGCCG GAACCTACAC GCTGAACTTC ACCGGCAACG TGCCGGGGCT GAGCGTCGGC
GGCATCCTCA TCACCCCAAG CGTGACGGGG ACGGCCTACT CGCTGAGCCA GTTCGACTCC
ACCAGCGGGC ATACGGTTAA TGGCAATATC TTTGACGGCA GCGACTCGCA AGGGGCGACG
GATCAGCTGC ACTCCGTGGA TACGCGCCTG AGCATCACCG GCTATAACGG CGTCACCACC
ACGCTGGATC CGTACACCGC CAGCACTGCA ACGTCGACCA TTCAGGGGCA TTACGGCACG
CTGTCCATCG GCGCTGACGG GCATTACACC TACACGCTCA ACACCGGGGT TTCACTTTCC
AGCATCACCT CGAAAGAGGT CTTCAACTAC ACCCTGACCG ATGCGGCAGG CAAAACGGAC
AGCGCCTCGC TGACCATCAA CATGGCACCG CAGTTCATCA GTTCGGAGCA TAACGATCTC
ATCACCGGGA CGGCTTACGC TGACACCTTG ATTTACCAGG TGCTTAACAA CACCGTGGGA
AATGCCACGG CGGGCAACAG CAGCGGCGAT CACTGGACTA ATTTCTCGAT AAGCCAGGGG
GACAAAATTG ACATCGGCGA TCTGCTGGTG GGCTGGAACG GACAGTCCTC TACGCTTGGG
AATTACGTCC ACGTGACCCA AAGCGGTAAT AACACTGTGA TCTCCATCGA CCGCGATGGC
GCAGGGAGTG CCTACACTAA TACAACGCTT GTTACGCTGG ACAACGTCCA GGCCACCTAC
GACGAGTTAG TCAACCAGCA TCACAACATC ATTACCTGA
 
Protein sequence
MTPPGALITR KKSGFVLHAI KKICRESLYM SQISVISKLT GVETTTEGNQ VSLGQSSIVK 
LHVGRADISH YARNGNDLVV SLNSGETITL KNFYVGDAQG ASMLVLEESD GALWWIEDPT
AVEHYEAISS IDALMAASGG DASGGAAIWP WVLGGAAVAG GIAVAAGSGG GGGGGSGGGN
NNNPGNLGNS ENPLNSDTTP PNAPTNLAFS TDGTTVTGTA EPNSTITLKD ANGNLVGTGQ
ADSDGKFTIE LGTPLINGEQ ITATATDAAG NISQDGHVTA PDLVTPDAPT LILVNDDAGS
ITGPLIQNQV TDDARPTLSG SGEPGTLITI YDKGVQIGTT QVGANGSWTF TPGTALSEGN
HSLTLTATDA AGHVSVPSDA FTLMVDTLAP PAPVMTLNPA GTEVTGTAEP NSTITITSNN
QPIATGKADG NGNFVIPLSP AQIDGETIRV VATDEAGNTS LPATTTAPDN IAPAMPANLA
VAAGGNSVTG TTEPHCTVTV KAPNGDVIGE ATADGDGHFT VPIFPPHLNG EVLLVLATDT
SSNTSLPGQA DAPDTTKPLA PDNPVVSGDG TKVTGTAEPG STVTIREDGV KIGEGKADDQ
GNFSVTIAPP KLNGEILTAE AADKAGNTGP TANATAPDIT PAQTPTIVSV EDNAANVTGP
VPQSGLTNDS TPTITGTGEP GTLVYIYSGD NQIGTANVLS NGSWSFTPTV HLPEGGHVLT
AVAVDDALNR SETSNSWSIT VDSLAPAAPV ITQVVDDVPG RTGALDINEV TNDNRPTLNG
TGEPGTTISI RLDGTQIGTA LVNDGGAWTY TPTIVFQNGQ HTLTATAIDK AGNVSAASGG
FTFTVDTTAP PPPSITTVTD NTGDVKGILT SGSPTDETHP VMQGTAPAGT TIAIYDGTTL
LGSAVLDGSG GWSFTPPSTL TDGTHVLTAV ATNAAGTSTP SGSFTLVVDT VAPATPDSPD
ITVNPDNAPI GTALNPGEAT RDTSPTLSGT GNVGDTVTIY IDGVKQPGAV IVDDDGKWSW
SPVPPLTNGS YDIELTVTNK DGAGNESAPS QPVTIVIDTV APTTPATPVV TDNVTEITGP
VADNGSTNDP RPVISGTGTP NDVITIYDSV DGAPKSEVGQ VTIGADGNWS WRPDTPLTQT
SHTFTTTATD EAGNVSGTSI AIKVTIDTDA PLPPAITDAG GVSNNGATQD TTPTISGTGV
SGDTILIYNN GVQIGTATVA GGVWSFTPTT ALSEGPHTLT AAQVDAAGNV SPLSPIYTVT
VDTIAPTTPL IDNISSSTLA NGVLYTNDNT PTLTGTGEPR TVITVSIDGT ASTVTATVQP
DGTWSWTSPT ALPDTPHVIT VTSSDAAGNT SGTSTTNVTV DTDAPAAPVV TALAIEGTPI
TGTAEAGSLV IITGPGPGGT TIELGRGIAV GGNFSIALSP AQTNETTLTV RATDAAGNLS
DPTTFNVADA PDLPDVPVIT SIADNNGTDS IEVKGGSSDD TTPVISGTGP ENSTITLYLN
GVEIATIGLG AGQTTWSYTV PAGSALAEGT YNFTATATIG GATSGLSAAA TVTIDLTAPN
IPAIGAVTDD VGPGTGPLTN NQITNDSQPT LTGNATAGDT ISVYSNGTLL GSVLVGPTGA
WSFTPPSALA EGSNVLTIRA TDPAGNQSGA SAPFTIVVDT VSTTPVIVGA EDNVGTTENI
PTNGVTNDTT PTLSGTAEAN SVIAIFEGAT QIGTAIANGS GAWTFTPAAA LSEGSHTFTV
KATDPQGNVS IASNAYTVVI DITPPAVPVL STVNDNVTGG AFGNLTAGQV TNDATPTLSG
TGVTGSTIHI LNNGIEIGTA TVANGTWTFT PPANLPDGAY NIRVNASDAA GNVSANSPVF
SFTVDTTPPA APIVLTVLDD VGPVIGEITS GDSTNDNRPT FNGTGEVGGT ITLLNDGQPF
GTAIVNAQGN WTFTPTAPLS EGTHTITLST TDVAGNTSTT TSTFELTVDT FAPSAPAIIN
ATDNVGSVLT PVTNGKTTDD TTPTLNGTAA GNATVTIYEN GGVVGTVQAN ASGEWSFTPG
SALSNGSHTW TATATDAAGN VSVASPGFTV IVDTIAPLAP VITQAFDDVG TVTGPLSNGQ
TTDDTVPRLI GTSEPNATIN IFEGNTLVGT TTADASGNWA VTLNTTFPEG PHQFVARATD
AAGNTGDPSS PFNLNIDLTP PAIPLLVSVV DDVGTTATIN SGQITNDAQP TLSGTAEAGS
TIKIYDNGVQ IGSVTAADGT WSFTPTPALA DGQHPLTITA TDPSGNTSVA TTPFVLNLDA
TPPNAPIITT IVDDVGPNLG TIAGGTPTND TQPTLNGTAE ANAVVRIYDG GTLVGTVTAD
ANGNWTLPQT STILTNGQHN FTATATDAAG NTSAPSSISS VVVDTIAPNL PTTLAVITNG
THVTGVAEAG STVTITTSGG TVLGTATADG TGSFNVTISP PQTNGESLLA FATDKAGNVG
GNATVVAPFT NLPNAPVIVT IDDNVGTLIG NLTNGKATDD TTPTLTGTAQ PNSTVTLYNN
GVAMGTATAD INGDWSFTTP VLSQGSHAFT ATATNVVGGV GPVSSPSTII VDTVAPNAPT
GTFNADGSVL TGSAEAGSTV TIRLPDNSTF TTVANSSGTY SYTFLNKQTE GNTLQITATD
AAGNTSTPGS VLSPVVALSA STNVEELDIS TTATVTNAQY SDYGFLLVGA LGNVLTLLGN
DTAQVGFVVS DGGNADISIN ANATGVVLSL LNTLEVVVQR WDGVNNTWTT VVDTGLPQFA
NLLTLGASGV TLNMTGLENG QYRVLSYNTN LLATGSYTSL DVDVTETSAG VITGISTQSG
NVIYDLDPTT GSDNAPAGTR ITAVTDAQGN AVNVTADGTI IQGQYGTLTI NLNGSYTYTL
TNTSAAVLGR TESFTYTIGH NGATASAKLV ISLGANTVTN SVTAVDDTAS LTYDTSVHAI
NNGPSSQGGF TVAGVNLGST LGLNLLDDLS NPIIYSVQEG TTRTMTIQAS VGGVALASVF
DLYIYKFNDA TQTFEQMRVQ PGWLRAPLLG GTSGTLTLNL PAGEYLFLLN TAAGITVLTG
YTLNVLEDHV YNVASVGAST TGDVLADDIA PQGTLVTEVN GVAVNATGVT VIQGEYGTLT
INAQGSYTYT LRSGIGADHI KTPDTFVYTI TAPNGDKDTA SLNITPTARA MDAVNDVSAV
MDVTSLHHTT AYSDTTVGTA SWTTALLAPT QGSGSGTFVV DANTALHNVA LHFNVASLLA
LGGLTVSWSI SNGSTVIRSG SFAGGALLGG KIDINLGGLD LDAGTYTLNF TGNVPGLSVG
GILITPSVTG TAYSLSQFDS TSGHTVNGNI FDGSDSQGAT DQLHSVDTRL SITGYNGVTT
TLDPYTASTA TSTIQGHYGT LSIGADGHYT YTLNTGVSLS SITSKEVFNY TLTDAAGKTD
SASLTINMAP QFISSEHNDL ITGTAYADTL IYQVLNNTVG NATAGNSSGD HWTNFSISQG
DKIDIGDLLV GWNGQSSTLG NYVHVTQSGN NTVISIDRDG AGSAYTNTTL VTLDNVQATY
DELVNQHHNI IT