Gene Haur_1153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1153 
Symbol 
ID5733046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1323927 
End bp1325900 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content51% 
IMG OID641278293 
Productalpha amylase catalytic region 
Protein accessionYP_001543929 
Protein GI159897682 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00213284 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCAC TCACAGTTCA AACTATGCAT TTGCCCAATC CAACCACGTT CGATGATCTG 
CTCGATCAAC AGATTGCCAA TTCGCGTGAT CGCGATATTT TTCGCTTGCG CATGCAACGC
CATTTTGGCG ATTGTTTAGA AGCGCTAGGA GCGTTGTATG CCCAGCATCC AGCTTGGCCA
CAGTTGTTGG AGCAATTGCC CGAACGCTTG ATTACTGCCT ATGCCCAGCG CCGCGATGCC
CTGAAAATTC ACGATTTAGC CCGCGAAATC CAGCCCGATT GGTTTGCTGA GGCCACCATG
GTTGGCGGCA TTTACTATGT TGATCGCTTG GCAGGCACAT TGCGCGGGGT GATTGAGCAT
ATTGATTATT TGCAAGAATT GGGTTTGACC TATGTGCATC TGATGCCGCT ATTACAGCCA
CGCCATGGCC CCAACGATGG CGGCTATGCG GTGCTCGATT ATCGCTCGAT TGATCAACGG
CTTGGCAATG TGGCCGATTT TATCGAATTA AGCGATTTGC TCCGTACCAA CGGCATCAGC
TTATGCATTG ATGTGGTGGT GAATCACACG GCCAAAGAGC ATGAATGGGC AGTCAAGGCC
CGTGCTGGTG ATGCCCAATA TTTGGATTAC TATCTGAGTT TTGCCGATCG CAGTTTGCCT
GATGCCTATG AGCAACATTT ACCCGAAGTG TTTCCCGATT TTGCGCCTGG TAATTTTACT
TGGTATGCCG AGTTGAGCGA GCATGGCCGT TGGGTTTGGA CGACCTTCAA CGAATTTCAA
TGGGATTTGA ACTATACCAA CCCCATGGTT TGGCTGGAGA TGCTGGATAT TTTGCTGTAT
CTCGCCAATC TAGGCGTTGA TGTGCTGCGT TTGGATGCCG TGCCGTTTAT GTGGAAACGC
CTCGGCACGA ATTGCCAAAA TCAGCCCGAA GTGCTCGATT TGTTACAAGC TTGGCGAGCA
GCCATGCGGA TCGTCTGTCC GGCGACAATT TTCAAGGCCG AGGCGATTGT TGCCCCCGAC
GATTTGGTGC AATATTTGGG TTTGGGACGG CGCACAGGCA AGCTCTGTGA AATTGCCTAC
CATAATTCGC TGATGGTGTT GTTGTGGAGT GCCTTGGCCT CGCAACGCGC CGATCTGTTT
ACGCAATCGC TGTTGAACAT GCCTGCAACG CCCAGCAATG CCGCTTGGAT TACCTATGTG
CGCTGCCACG ATGATATTGG CTGGGCTGTG ACCGACCACA ATGCAGCTTT GGTTGGCGAA
GATGGGCCAT TGCATCGCCA ATTTTTAAGC GCTTGGTATA GTGGCGAATT TGCTGGTAGT
TTTGCGCGGG GCGAGGTGTT TCAATATAAT CCACTCACCA ACGATCGCCG AATTAGCGGC
ATGACTGCCT CGTTGGCTGG GCTAGAGCAA GCCTTGGAAA CCACCGATCC AGCAGCGATT
GAATTGACAA TTCGCCGGAT TGCGTTGCTG TATGCCGTGA TTTTTAGCTT TGGTGGCATT
CCGTTGATCT ATATGGGCGA TGAATTGGGC ATGCTCAATG ATCACAGCTA CTTGCATGAC
CCTACCAAAG CCAACGATAA CCGCTGGTTG CATCGCCCAG CCATGGATTG GTGCTTAGCG
GCCCAACGCC ATGATCCAAC TACGCTTGCT GGGCGCTTAT GGCAGGTATT GCGCCATTTG
ATTCAGGTGC GCCAACATAC TCCAGCCTTG CATAGCGCAG GCCAAACCTT GCCAATCTGG
ACACAGCAAC GCCATGTTTT AGGGGTGGTT CGAGTTCACC CATTGGGGCG AATTTTAATT
CTTGGAAACC TTTCCGCCAC CCCACAGCGG GTCAGTTTAG CGGTTATTCA ACAAGCAGGG
CTGGTTGGTC GCTTATATAA TTTGTTGGAT AACGATTCAC TTAATATCGA TACACAAAGC
CATGAAATTA TACTCGATGC ATATCAATGT TGTTGGCTCA GCATTCAAGC CTAA
 
Protein sequence
MQPLTVQTMH LPNPTTFDDL LDQQIANSRD RDIFRLRMQR HFGDCLEALG ALYAQHPAWP 
QLLEQLPERL ITAYAQRRDA LKIHDLAREI QPDWFAEATM VGGIYYVDRL AGTLRGVIEH
IDYLQELGLT YVHLMPLLQP RHGPNDGGYA VLDYRSIDQR LGNVADFIEL SDLLRTNGIS
LCIDVVVNHT AKEHEWAVKA RAGDAQYLDY YLSFADRSLP DAYEQHLPEV FPDFAPGNFT
WYAELSEHGR WVWTTFNEFQ WDLNYTNPMV WLEMLDILLY LANLGVDVLR LDAVPFMWKR
LGTNCQNQPE VLDLLQAWRA AMRIVCPATI FKAEAIVAPD DLVQYLGLGR RTGKLCEIAY
HNSLMVLLWS ALASQRADLF TQSLLNMPAT PSNAAWITYV RCHDDIGWAV TDHNAALVGE
DGPLHRQFLS AWYSGEFAGS FARGEVFQYN PLTNDRRISG MTASLAGLEQ ALETTDPAAI
ELTIRRIALL YAVIFSFGGI PLIYMGDELG MLNDHSYLHD PTKANDNRWL HRPAMDWCLA
AQRHDPTTLA GRLWQVLRHL IQVRQHTPAL HSAGQTLPIW TQQRHVLGVV RVHPLGRILI
LGNLSATPQR VSLAVIQQAG LVGRLYNLLD NDSLNIDTQS HEIILDAYQC CWLSIQA