Gene Haur_4335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4335 
Symbol 
ID5736195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5536715 
End bp5541976 
Gene Length5262 bp 
Protein Length1753 aa 
Translation table11 
GC content52% 
IMG OID641281496 
Productalpha amylase catalytic region 
Protein accessionYP_001547095 
Protein GI159900848 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.167512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGACA CTGCCCATCG TCGGAGCCTG ATCAGCCGTA TGCTGCTGAT CGTTATGCTG 
TTGTCGTTGA TGATCCCCAC GTTCAGCCAA CAAACACCTG TGGTCGCGGC CCAGCCTGCC
ACGTTTGATC TGGCGCTCAA TCAAGCCCCA ACTATTGCAA ATGCTCAAGT CAATTGGTGT
TTTGCTGGCG GGTTCCAAAA TTGGGACAAT GCTGCAACAC CTGCCAATGA CGAAGGTCTC
AACGGCGATT TAGTCGCTGG CGATGGCATC TACTCTTTAG ATGTCGAGAT CGCAACTGCT
GGCCGCTCTG AATGGAAAGC GGTTGTTTGT GGCAGTTGGG AAACTTCAGT TCCCGCTGGC
CCTAATGCTT GGATGAACAC CACCCAAGCC AATCAAACCG TCAAATTGAC CCTCGATACC
AATAACTACA GCAGCAATGC TGGCAATCAT GGCGTGCCAA GCAACAATAT TATTCACGCC
TACGATAGCG ATTTTGCTAG CTGGACAGCG GTTGGTTCGT TCCAAAATCC AGTTTGGACG
AACAACGACC CAGCAACCGC AATGACAAGC CTCGGCAATG GTTGGTACTA CTTGGCCTAT
ACGGTGCCAA ACGCTGGTAC CTATGAAGGC AAAGTTGTGC ACACTGGCGA TTGGACAACT
CAATACACTG GGGTTGGCCG CGCCGCCGAT CAAGGCAACA TCAGCTTTAC CACCACCCAA
GCCGGACAAA TGGTGGTATT TTTGCTTGAT ACCAACACGA GCCGCCTGAC GATTCGCCCG
CAAACCGCTG GCAATGCAGG CCCATGGTGT GCTCCAGGTA CCTATCAAAC CCCACAATGG
CAAGAAACTG GCAGCCCCTT GGTTGATAAC GGAACCCAAG GCGACTTAGT GAGCGGCGAC
GGGGTTTTCA GCCTCGATGT TGTTATTCCA GCCGCTGGCA CCTACGAATG GAAGGTCAAT
GCCTGTAACT GGGCGACCGC CTTCCCAGCT GCTAACGCTT GGATTTACAC CGACCAACCA
AATCAAACGG TAAAATTATT GTTCGATAGC AATAATCACG CTGCTGATAA CGGCTGGGAT
TTACTACCAA CTCAAAATGT GGTCAATGCA ATCGATGCAT CAAATGATTT CACGGTGGTT
GGTGCATTCC AAGGCTGGAG CAACAATAAT CCAGCTACCA AAATGATCCA AATTGCCCCA
AATCAATTTG TGTTGCACTA CACGATTGCA GCACCAGGCG ATTATGCTGC CAAATTCACC
CGCACTGGCT CATGGGTCGA GCAATACAAT GCCCAGGGTC GGGTCTTCGA TGCCAACGAT
CCCGCACCCA TCGGCTTTAC CACCACCAAT CTCAATGAAA CTGTCGTCTT CTACTTGGAT
AATCGCACTG GCCGCGTGGC AATTACGCCG CAACGTGATG GCCAAGTGCC CGATTCTGTC
ATTGGCGATG GCTTGATCAA TCGTGATGCA ATTGAACATC ACAGCCGTGA ATCGCTCTAT
CGCGTACCGT TTGGCGCTGC ACCGCTCAAC CAAGCGGTTA ACTTGCGTTT GCGCACCGCC
GCCCACGATA TCAATCAAGC ACGAATTCGC TTGTATTACA CTGCTAACAA TGGTCAAAGC
ATTCAACGCA TGACCAAGGT GGCCAGTGAT GAAATGTACG ATTATTGGGA ATACACCATG
CCAGCTCAAA CTGCTTTGGG CGTGGTCTAT TATCGTTTTA TTATCGAAGA TGGCGCGACG
ACCCTATTCT ATGAAGATGA TAATCGCTTC GATGGTGGTT TAGGCGAAGT GGTCAACGCC
AGCGCTGATC GCTCGTGGAA CATTTATATC TACGACCCAG CCTTTACCAC GCCTGAATGG
GCCCAAAACG CCACAATTTA TCAAATTTTT GTCGAGCGCT TCCGCAACGG CGATACAAGC
AACGACCCAA CTGGGGTGGC AACCGATACC ACCTATCCAG GGCGTGGTTG GTTCTACCCA
ACTGAGCGCG GCCATCGCTT CCCAGTTACC CCGTGGAACT CGATCGTGCC CGATCCAGAA
CCATTCACCG ACCAAAATAA CCCATGGTGG TCAACCTACA GCAGCACCAT GTATGGCGGC
GACTTGAATG GAGTCCAAGA TAAACTCGAT TATTTACAAG ATTTGGGTGT AACTACGCTC
TATCTGAACC CGATTTTCGA TAGCCCATCC AACCACAAAT ATGATGGCCG CAACTATCGC
ACAGTTGACC CAGCCTTTGG TGGGCAACAA GCCTTTGACG ATTTGGTTGC CGATGCCCAT
GGCCGTGGCA TGACGGTGGT GCTCGACGGC GTGCCCAATC ACGTCAGCAG CGATAGCCCC
TTCTTCGATC GCTTTGGTCG CCACGCTGAA GTTGGCGCAT GCGAAAGCAC CAGCAGCCCA
TATCGCACAT GGTTCTTCTT TGAGCCAGCC GCCGAGCCAG GCACAGGCGT TTGTGCTGGC
GATACCAACT ATCGTGGGTG GTTTGGCGTA GCCACCTTGC CCCAAATCAA CACCAATCAT
CCCGAAGTGA TGGCCTATTG GTTTGGTACG GCTGGCGGCA ACCCCAACTT ACCAACCAAC
ACCGCTAGCT ACTGGGTCGA TGGCACCAAC AAAGCCGATG GCTGGCGCAT CGACGTTGTG
CCCGATGTGA TTGGGGTCAA CCCAACCTTC TTTGAAACCT GGCGCGACGT GATGAAAGCC
GCCAATCCCG ATGCTGTGCT CTACTCCGAA ACCTGGAGCG AAGGCGATGT GCGTGATCGG
GTGCTTGGCG ATGAATTCGA TAGCACCATG AACTATCGCT ATCGCCGCGC AGTTTTGGGC
TTCTTACGCG ATACCCGCTG GGTCGATAAC GACGGCGGCC AAGAGATCGA TCCGCTCTTG
CCTAGCCAAT TTGTCAACAG TTTTGAAACT ATCCGCGAGG ATTATCCAGA GCCAGCCTAC
AACGCGGCTA TGAACTTGAT CGATAGCCAC GATACCAATC GGGCGGTGCA TGTGCTCAAC
GAGTTGGGCT TTACTGGCAC TGGCTACGAT CGCCAACCAG TCGATAATTT TGTTGATGCT
CGGCATCGTT TGTCGTTGGT TGCTGCCTTG CAAATGACCT TGCCAGGTGC GCCAACCATC
TATTATGGCG ATGAAGTTGG CTTAACTGGC TATGGTTTCG ATGTACCACG CGACGACCCC
TACAACCGCC AGCCCTACCC ATGGACTGAC CAAGCAGGCT ACAACAGCTT GCCCGAATGG
CGCAAAGCCG ACACTAATTT GTTGGCAACC TATCAACGGC TCGGCCAATT GCGCCGCGAT
TATAGCTACT TGCGCACAGG CTCGTTCGAT GTGATGACCG CCAACGATGC CAACAAAGTT
TTGGCTTATG GCCGTAAAGA TGAAAACGGT GCGGCAATTG TGGTGTTCAA CCGCGATAGC
CAAGCCCATA GCATCGAACT AAACTTGGCT GGCTACGTGC CAACGGGCAC AGTGCTGACT
CGCACCATGC CATTGACCCA AACTGGCCTA TTGCCAGCGA CCGATGTTAC AACCTATACC
TTCAGCGTTG CACCACAAAG CGTTGGCATT TGGCTAACAC CTGATAGCGT TGATATGAGT
GCTCCTGCTG CGCCAACTAA CTTGCAAGTG ACCAACCAAC TGTCACAAAG CATCGAGCTT
GGCTGGAACA GCGTTGCAAC CGCCGAAAGC TACAACATTT ATCGCTCAAT CGTGAGCGGC
GGTGGCTACG AGTTGGTCGG CAATACCACC AGCACCAGTT TCAGCAGCGC AGATCTCACG
CCAGGTCAAC GCTACTACTA CATTGTCAAA GCAGTACGCA ATGGCTTGGA AAGCACCGCG
AGCAACGAAG TTTCGGGCTT GCCAGCTTAT GTAATCAACT GGTCGAACCT GCAATTCCCC
GCCACAATCA ACCATACAAT CAGCATTACC AACCCAACCA CTAATATTTA TGGTCAGGTG
TATATCGCAG GTGTAACCAG CGAAGTTGGA GCAACGCCAA GCGTTTTGGC CGAAGTTGGC
TATGGCCCTG ATGGCTCGCA GCCCAACACT GCTAGCTGGA CATGGTTTGA TGCCAATTTC
AACGTACAAA ATGGCAATAA CGATGAATAT GTGGGCAATC TATTGCCAAG CAGCGTCGGA
ACCTTCGATG TAGCCTATCG CTATAGCACC GACGGCGGCA CAAGCTGGAT TTACGCCGAT
CTTGATGGTT CGGATAATGG CTATTCGCCA GCGCAGGCAG CCAGTTTGGT TGTTGCGGCG
AGCAGCGATA CGACAGCACC GACCGCGCCG CTGAATCTTC GCGAAGTGCG ACGCTCAGCC
TCGCAAATTG TGATTGGCTG GGATGTTTCA AGCGCTAACG ATACCTATCG CTACGATGTG
TATCGCGATG GCAACGCGAT TGCCTCGGTA TTGCACCCAA CAACGATCTT TACTGATACC
GAAGTAACCG CAGGCATTAC CTATACCTAT CAATTGAAGG CGCTTGACGG TTCGTTCAAT
CAATCAGAAT TCTCGAATAG CGTTTCGATT CGCGCTGAGC AACGAGTGAT TGACGTAACC
TTCCGCGTCA AAGTGCCAGT TGAAACCCCA ACCAACGAAA GTGTTTATAT CGCTGGCAAC
GATGGCACAG TCTTCAACGG CGCATGGACG GCTAATGGTC AAATCATGCA ACGCGTGCTG
ACCGATACAT GGGAATTCAA TAAACAAATT CTCGAAGGCA CAGCCTTGGA GTATAAATAC
ACTCGTGGCG CGTGGGAGCG AGTCGAATCG TGGGGCACAA TCGAGGCCTT CACCAATCGT
CGCGTAACCA TTGAATATGG CAACAACGGC CATTTCATCG TTGATGATAC TGACACCAAT
TGGGGCACTG GCGACGATAA CCGCAAAGCC GTCCAAGCCT GGCGAGACCC CTTGGTCAGC
AGTAGTTCAA TTGCCAACAA TGCCGTTGAA GTTGCGCCAA CAGTGCAACC AGCAATTACA
TGGTCGCAAG TTGTTACAAC AACAACCCCA ATCAGCCAAG TATTGGTCTT GACCAATGCC
CAAAATCAGT CCGTAGCCGG AACAGTGGTC GCTACCAGCC CAACAACCTT CAGCTTCATC
CCCGCAGCAC CACTGCCAAA TGGAACCTAC ACGCTAACCG CATTCAATGT ACGGCGGGTG
ATTGGTGATC CATCGCCAAT GCAACAACGC TACGTTGTGC GCTTCACTGT TGGCAGGGAC
ATTCTCAAGA TCTTCCTACC AATCGTTGGA CGTTCAAACT AA
 
Protein sequence
MRDTAHRRSL ISRMLLIVML LSLMIPTFSQ QTPVVAAQPA TFDLALNQAP TIANAQVNWC 
FAGGFQNWDN AATPANDEGL NGDLVAGDGI YSLDVEIATA GRSEWKAVVC GSWETSVPAG
PNAWMNTTQA NQTVKLTLDT NNYSSNAGNH GVPSNNIIHA YDSDFASWTA VGSFQNPVWT
NNDPATAMTS LGNGWYYLAY TVPNAGTYEG KVVHTGDWTT QYTGVGRAAD QGNISFTTTQ
AGQMVVFLLD TNTSRLTIRP QTAGNAGPWC APGTYQTPQW QETGSPLVDN GTQGDLVSGD
GVFSLDVVIP AAGTYEWKVN ACNWATAFPA ANAWIYTDQP NQTVKLLFDS NNHAADNGWD
LLPTQNVVNA IDASNDFTVV GAFQGWSNNN PATKMIQIAP NQFVLHYTIA APGDYAAKFT
RTGSWVEQYN AQGRVFDAND PAPIGFTTTN LNETVVFYLD NRTGRVAITP QRDGQVPDSV
IGDGLINRDA IEHHSRESLY RVPFGAAPLN QAVNLRLRTA AHDINQARIR LYYTANNGQS
IQRMTKVASD EMYDYWEYTM PAQTALGVVY YRFIIEDGAT TLFYEDDNRF DGGLGEVVNA
SADRSWNIYI YDPAFTTPEW AQNATIYQIF VERFRNGDTS NDPTGVATDT TYPGRGWFYP
TERGHRFPVT PWNSIVPDPE PFTDQNNPWW STYSSTMYGG DLNGVQDKLD YLQDLGVTTL
YLNPIFDSPS NHKYDGRNYR TVDPAFGGQQ AFDDLVADAH GRGMTVVLDG VPNHVSSDSP
FFDRFGRHAE VGACESTSSP YRTWFFFEPA AEPGTGVCAG DTNYRGWFGV ATLPQINTNH
PEVMAYWFGT AGGNPNLPTN TASYWVDGTN KADGWRIDVV PDVIGVNPTF FETWRDVMKA
ANPDAVLYSE TWSEGDVRDR VLGDEFDSTM NYRYRRAVLG FLRDTRWVDN DGGQEIDPLL
PSQFVNSFET IREDYPEPAY NAAMNLIDSH DTNRAVHVLN ELGFTGTGYD RQPVDNFVDA
RHRLSLVAAL QMTLPGAPTI YYGDEVGLTG YGFDVPRDDP YNRQPYPWTD QAGYNSLPEW
RKADTNLLAT YQRLGQLRRD YSYLRTGSFD VMTANDANKV LAYGRKDENG AAIVVFNRDS
QAHSIELNLA GYVPTGTVLT RTMPLTQTGL LPATDVTTYT FSVAPQSVGI WLTPDSVDMS
APAAPTNLQV TNQLSQSIEL GWNSVATAES YNIYRSIVSG GGYELVGNTT STSFSSADLT
PGQRYYYIVK AVRNGLESTA SNEVSGLPAY VINWSNLQFP ATINHTISIT NPTTNIYGQV
YIAGVTSEVG ATPSVLAEVG YGPDGSQPNT ASWTWFDANF NVQNGNNDEY VGNLLPSSVG
TFDVAYRYST DGGTSWIYAD LDGSDNGYSP AQAASLVVAA SSDTTAPTAP LNLREVRRSA
SQIVIGWDVS SANDTYRYDV YRDGNAIASV LHPTTIFTDT EVTAGITYTY QLKALDGSFN
QSEFSNSVSI RAEQRVIDVT FRVKVPVETP TNESVYIAGN DGTVFNGAWT ANGQIMQRVL
TDTWEFNKQI LEGTALEYKY TRGAWERVES WGTIEAFTNR RVTIEYGNNG HFIVDDTDTN
WGTGDDNRKA VQAWRDPLVS SSSIANNAVE VAPTVQPAIT WSQVVTTTTP ISQVLVLTNA
QNQSVAGTVV ATSPTTFSFI PAAPLPNGTY TLTAFNVRRV IGDPSPMQQR YVVRFTVGRD
ILKIFLPIVG RSN