Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4335 |
Symbol | |
ID | 5736195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5536715 |
End bp | 5541976 |
Gene Length | 5262 bp |
Protein Length | 1753 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281496 |
Product | alpha amylase catalytic region |
Protein accession | YP_001547095 |
Protein GI | 159900848 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.167512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGACA CTGCCCATCG TCGGAGCCTG ATCAGCCGTA TGCTGCTGAT CGTTATGCTG TTGTCGTTGA TGATCCCCAC GTTCAGCCAA CAAACACCTG TGGTCGCGGC CCAGCCTGCC ACGTTTGATC TGGCGCTCAA TCAAGCCCCA ACTATTGCAA ATGCTCAAGT CAATTGGTGT TTTGCTGGCG GGTTCCAAAA TTGGGACAAT GCTGCAACAC CTGCCAATGA CGAAGGTCTC AACGGCGATT TAGTCGCTGG CGATGGCATC TACTCTTTAG ATGTCGAGAT CGCAACTGCT GGCCGCTCTG AATGGAAAGC GGTTGTTTGT GGCAGTTGGG AAACTTCAGT TCCCGCTGGC CCTAATGCTT GGATGAACAC CACCCAAGCC AATCAAACCG TCAAATTGAC CCTCGATACC AATAACTACA GCAGCAATGC TGGCAATCAT GGCGTGCCAA GCAACAATAT TATTCACGCC TACGATAGCG ATTTTGCTAG CTGGACAGCG GTTGGTTCGT TCCAAAATCC AGTTTGGACG AACAACGACC CAGCAACCGC AATGACAAGC CTCGGCAATG GTTGGTACTA CTTGGCCTAT ACGGTGCCAA ACGCTGGTAC CTATGAAGGC AAAGTTGTGC ACACTGGCGA TTGGACAACT CAATACACTG GGGTTGGCCG CGCCGCCGAT CAAGGCAACA TCAGCTTTAC CACCACCCAA GCCGGACAAA TGGTGGTATT TTTGCTTGAT ACCAACACGA GCCGCCTGAC GATTCGCCCG CAAACCGCTG GCAATGCAGG CCCATGGTGT GCTCCAGGTA CCTATCAAAC CCCACAATGG CAAGAAACTG GCAGCCCCTT GGTTGATAAC GGAACCCAAG GCGACTTAGT GAGCGGCGAC GGGGTTTTCA GCCTCGATGT TGTTATTCCA GCCGCTGGCA CCTACGAATG GAAGGTCAAT GCCTGTAACT GGGCGACCGC CTTCCCAGCT GCTAACGCTT GGATTTACAC CGACCAACCA AATCAAACGG TAAAATTATT GTTCGATAGC AATAATCACG CTGCTGATAA CGGCTGGGAT TTACTACCAA CTCAAAATGT GGTCAATGCA ATCGATGCAT CAAATGATTT CACGGTGGTT GGTGCATTCC AAGGCTGGAG CAACAATAAT CCAGCTACCA AAATGATCCA AATTGCCCCA AATCAATTTG TGTTGCACTA CACGATTGCA GCACCAGGCG ATTATGCTGC CAAATTCACC CGCACTGGCT CATGGGTCGA GCAATACAAT GCCCAGGGTC GGGTCTTCGA TGCCAACGAT CCCGCACCCA TCGGCTTTAC CACCACCAAT CTCAATGAAA CTGTCGTCTT CTACTTGGAT AATCGCACTG GCCGCGTGGC AATTACGCCG CAACGTGATG GCCAAGTGCC CGATTCTGTC ATTGGCGATG GCTTGATCAA TCGTGATGCA ATTGAACATC ACAGCCGTGA ATCGCTCTAT CGCGTACCGT TTGGCGCTGC ACCGCTCAAC CAAGCGGTTA ACTTGCGTTT GCGCACCGCC GCCCACGATA TCAATCAAGC ACGAATTCGC TTGTATTACA CTGCTAACAA TGGTCAAAGC ATTCAACGCA TGACCAAGGT GGCCAGTGAT GAAATGTACG ATTATTGGGA ATACACCATG CCAGCTCAAA CTGCTTTGGG CGTGGTCTAT TATCGTTTTA TTATCGAAGA TGGCGCGACG ACCCTATTCT ATGAAGATGA TAATCGCTTC GATGGTGGTT TAGGCGAAGT GGTCAACGCC AGCGCTGATC GCTCGTGGAA CATTTATATC TACGACCCAG CCTTTACCAC GCCTGAATGG GCCCAAAACG CCACAATTTA TCAAATTTTT GTCGAGCGCT TCCGCAACGG CGATACAAGC AACGACCCAA CTGGGGTGGC AACCGATACC ACCTATCCAG GGCGTGGTTG GTTCTACCCA ACTGAGCGCG GCCATCGCTT CCCAGTTACC CCGTGGAACT CGATCGTGCC CGATCCAGAA CCATTCACCG ACCAAAATAA CCCATGGTGG TCAACCTACA GCAGCACCAT GTATGGCGGC GACTTGAATG GAGTCCAAGA TAAACTCGAT TATTTACAAG ATTTGGGTGT AACTACGCTC TATCTGAACC CGATTTTCGA TAGCCCATCC AACCACAAAT ATGATGGCCG CAACTATCGC ACAGTTGACC CAGCCTTTGG TGGGCAACAA GCCTTTGACG ATTTGGTTGC CGATGCCCAT GGCCGTGGCA TGACGGTGGT GCTCGACGGC GTGCCCAATC ACGTCAGCAG CGATAGCCCC TTCTTCGATC GCTTTGGTCG CCACGCTGAA GTTGGCGCAT GCGAAAGCAC CAGCAGCCCA TATCGCACAT GGTTCTTCTT TGAGCCAGCC GCCGAGCCAG GCACAGGCGT TTGTGCTGGC GATACCAACT ATCGTGGGTG GTTTGGCGTA GCCACCTTGC CCCAAATCAA CACCAATCAT CCCGAAGTGA TGGCCTATTG GTTTGGTACG GCTGGCGGCA ACCCCAACTT ACCAACCAAC ACCGCTAGCT ACTGGGTCGA TGGCACCAAC AAAGCCGATG GCTGGCGCAT CGACGTTGTG CCCGATGTGA TTGGGGTCAA CCCAACCTTC TTTGAAACCT GGCGCGACGT GATGAAAGCC GCCAATCCCG ATGCTGTGCT CTACTCCGAA ACCTGGAGCG AAGGCGATGT GCGTGATCGG GTGCTTGGCG ATGAATTCGA TAGCACCATG AACTATCGCT ATCGCCGCGC AGTTTTGGGC TTCTTACGCG ATACCCGCTG GGTCGATAAC GACGGCGGCC AAGAGATCGA TCCGCTCTTG CCTAGCCAAT TTGTCAACAG TTTTGAAACT ATCCGCGAGG ATTATCCAGA GCCAGCCTAC AACGCGGCTA TGAACTTGAT CGATAGCCAC GATACCAATC GGGCGGTGCA TGTGCTCAAC GAGTTGGGCT TTACTGGCAC TGGCTACGAT CGCCAACCAG TCGATAATTT TGTTGATGCT CGGCATCGTT TGTCGTTGGT TGCTGCCTTG CAAATGACCT TGCCAGGTGC GCCAACCATC TATTATGGCG ATGAAGTTGG CTTAACTGGC TATGGTTTCG ATGTACCACG CGACGACCCC TACAACCGCC AGCCCTACCC ATGGACTGAC CAAGCAGGCT ACAACAGCTT GCCCGAATGG CGCAAAGCCG ACACTAATTT GTTGGCAACC TATCAACGGC TCGGCCAATT GCGCCGCGAT TATAGCTACT TGCGCACAGG CTCGTTCGAT GTGATGACCG CCAACGATGC CAACAAAGTT TTGGCTTATG GCCGTAAAGA TGAAAACGGT GCGGCAATTG TGGTGTTCAA CCGCGATAGC CAAGCCCATA GCATCGAACT AAACTTGGCT GGCTACGTGC CAACGGGCAC AGTGCTGACT CGCACCATGC CATTGACCCA AACTGGCCTA TTGCCAGCGA CCGATGTTAC AACCTATACC TTCAGCGTTG CACCACAAAG CGTTGGCATT TGGCTAACAC CTGATAGCGT TGATATGAGT GCTCCTGCTG CGCCAACTAA CTTGCAAGTG ACCAACCAAC TGTCACAAAG CATCGAGCTT GGCTGGAACA GCGTTGCAAC CGCCGAAAGC TACAACATTT ATCGCTCAAT CGTGAGCGGC GGTGGCTACG AGTTGGTCGG CAATACCACC AGCACCAGTT TCAGCAGCGC AGATCTCACG CCAGGTCAAC GCTACTACTA CATTGTCAAA GCAGTACGCA ATGGCTTGGA AAGCACCGCG AGCAACGAAG TTTCGGGCTT GCCAGCTTAT GTAATCAACT GGTCGAACCT GCAATTCCCC GCCACAATCA ACCATACAAT CAGCATTACC AACCCAACCA CTAATATTTA TGGTCAGGTG TATATCGCAG GTGTAACCAG CGAAGTTGGA GCAACGCCAA GCGTTTTGGC CGAAGTTGGC TATGGCCCTG ATGGCTCGCA GCCCAACACT GCTAGCTGGA CATGGTTTGA TGCCAATTTC AACGTACAAA ATGGCAATAA CGATGAATAT GTGGGCAATC TATTGCCAAG CAGCGTCGGA ACCTTCGATG TAGCCTATCG CTATAGCACC GACGGCGGCA CAAGCTGGAT TTACGCCGAT CTTGATGGTT CGGATAATGG CTATTCGCCA GCGCAGGCAG CCAGTTTGGT TGTTGCGGCG AGCAGCGATA CGACAGCACC GACCGCGCCG CTGAATCTTC GCGAAGTGCG ACGCTCAGCC TCGCAAATTG TGATTGGCTG GGATGTTTCA AGCGCTAACG ATACCTATCG CTACGATGTG TATCGCGATG GCAACGCGAT TGCCTCGGTA TTGCACCCAA CAACGATCTT TACTGATACC GAAGTAACCG CAGGCATTAC CTATACCTAT CAATTGAAGG CGCTTGACGG TTCGTTCAAT CAATCAGAAT TCTCGAATAG CGTTTCGATT CGCGCTGAGC AACGAGTGAT TGACGTAACC TTCCGCGTCA AAGTGCCAGT TGAAACCCCA ACCAACGAAA GTGTTTATAT CGCTGGCAAC GATGGCACAG TCTTCAACGG CGCATGGACG GCTAATGGTC AAATCATGCA ACGCGTGCTG ACCGATACAT GGGAATTCAA TAAACAAATT CTCGAAGGCA CAGCCTTGGA GTATAAATAC ACTCGTGGCG CGTGGGAGCG AGTCGAATCG TGGGGCACAA TCGAGGCCTT CACCAATCGT CGCGTAACCA TTGAATATGG CAACAACGGC CATTTCATCG TTGATGATAC TGACACCAAT TGGGGCACTG GCGACGATAA CCGCAAAGCC GTCCAAGCCT GGCGAGACCC CTTGGTCAGC AGTAGTTCAA TTGCCAACAA TGCCGTTGAA GTTGCGCCAA CAGTGCAACC AGCAATTACA TGGTCGCAAG TTGTTACAAC AACAACCCCA ATCAGCCAAG TATTGGTCTT GACCAATGCC CAAAATCAGT CCGTAGCCGG AACAGTGGTC GCTACCAGCC CAACAACCTT CAGCTTCATC CCCGCAGCAC CACTGCCAAA TGGAACCTAC ACGCTAACCG CATTCAATGT ACGGCGGGTG ATTGGTGATC CATCGCCAAT GCAACAACGC TACGTTGTGC GCTTCACTGT TGGCAGGGAC ATTCTCAAGA TCTTCCTACC AATCGTTGGA CGTTCAAACT AA
|
Protein sequence | MRDTAHRRSL ISRMLLIVML LSLMIPTFSQ QTPVVAAQPA TFDLALNQAP TIANAQVNWC FAGGFQNWDN AATPANDEGL NGDLVAGDGI YSLDVEIATA GRSEWKAVVC GSWETSVPAG PNAWMNTTQA NQTVKLTLDT NNYSSNAGNH GVPSNNIIHA YDSDFASWTA VGSFQNPVWT NNDPATAMTS LGNGWYYLAY TVPNAGTYEG KVVHTGDWTT QYTGVGRAAD QGNISFTTTQ AGQMVVFLLD TNTSRLTIRP QTAGNAGPWC APGTYQTPQW QETGSPLVDN GTQGDLVSGD GVFSLDVVIP AAGTYEWKVN ACNWATAFPA ANAWIYTDQP NQTVKLLFDS NNHAADNGWD LLPTQNVVNA IDASNDFTVV GAFQGWSNNN PATKMIQIAP NQFVLHYTIA APGDYAAKFT RTGSWVEQYN AQGRVFDAND PAPIGFTTTN LNETVVFYLD NRTGRVAITP QRDGQVPDSV IGDGLINRDA IEHHSRESLY RVPFGAAPLN QAVNLRLRTA AHDINQARIR LYYTANNGQS IQRMTKVASD EMYDYWEYTM PAQTALGVVY YRFIIEDGAT TLFYEDDNRF DGGLGEVVNA SADRSWNIYI YDPAFTTPEW AQNATIYQIF VERFRNGDTS NDPTGVATDT TYPGRGWFYP TERGHRFPVT PWNSIVPDPE PFTDQNNPWW STYSSTMYGG DLNGVQDKLD YLQDLGVTTL YLNPIFDSPS NHKYDGRNYR TVDPAFGGQQ AFDDLVADAH GRGMTVVLDG VPNHVSSDSP FFDRFGRHAE VGACESTSSP YRTWFFFEPA AEPGTGVCAG DTNYRGWFGV ATLPQINTNH PEVMAYWFGT AGGNPNLPTN TASYWVDGTN KADGWRIDVV PDVIGVNPTF FETWRDVMKA ANPDAVLYSE TWSEGDVRDR VLGDEFDSTM NYRYRRAVLG FLRDTRWVDN DGGQEIDPLL PSQFVNSFET IREDYPEPAY NAAMNLIDSH DTNRAVHVLN ELGFTGTGYD RQPVDNFVDA RHRLSLVAAL QMTLPGAPTI YYGDEVGLTG YGFDVPRDDP YNRQPYPWTD QAGYNSLPEW RKADTNLLAT YQRLGQLRRD YSYLRTGSFD VMTANDANKV LAYGRKDENG AAIVVFNRDS QAHSIELNLA GYVPTGTVLT RTMPLTQTGL LPATDVTTYT FSVAPQSVGI WLTPDSVDMS APAAPTNLQV TNQLSQSIEL GWNSVATAES YNIYRSIVSG GGYELVGNTT STSFSSADLT PGQRYYYIVK AVRNGLESTA SNEVSGLPAY VINWSNLQFP ATINHTISIT NPTTNIYGQV YIAGVTSEVG ATPSVLAEVG YGPDGSQPNT ASWTWFDANF NVQNGNNDEY VGNLLPSSVG TFDVAYRYST DGGTSWIYAD LDGSDNGYSP AQAASLVVAA SSDTTAPTAP LNLREVRRSA SQIVIGWDVS SANDTYRYDV YRDGNAIASV LHPTTIFTDT EVTAGITYTY QLKALDGSFN QSEFSNSVSI RAEQRVIDVT FRVKVPVETP TNESVYIAGN DGTVFNGAWT ANGQIMQRVL TDTWEFNKQI LEGTALEYKY TRGAWERVES WGTIEAFTNR RVTIEYGNNG HFIVDDTDTN WGTGDDNRKA VQAWRDPLVS SSSIANNAVE VAPTVQPAIT WSQVVTTTTP ISQVLVLTNA QNQSVAGTVV ATSPTTFSFI PAAPLPNGTY TLTAFNVRRV IGDPSPMQQR YVVRFTVGRD ILKIFLPIVG RSN
|
| |