Gene Haur_4336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4336 
Symbol 
ID5736196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5542255 
End bp5544999 
Gene Length2745 bp 
Protein Length914 aa 
Translation table11 
GC content53% 
IMG OID641281497 
Productalpha amylase catalytic region 
Protein accessionYP_001547096 
Protein GI159900849 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAACA AACGATTTGG CGGATTGCTC GCGTTGGTAC TGTTGGCCTT GGCCTTGTTG 
CCAAACCCGA AGTCGGTTGC CGCTGGCGAT AATAATGTGC ACTGGGATGA GCTGTATCAT
GCTGCTCCCA GCGCCAACCC TCGCACTGAG CTTGTGCCAG GCGAAAGTTT CAGCTTTCAA
CAAGCCGAAG TCAATGGCAC AATTGTTTCA ACCACCAATG TCCAAATTTC AATCTTAGCG
CTGGCTGGCG ACCTAACCAG CGCCCAAATT CGCTACTGGA ACGGAACCGA GCAGATCGTA
GCGATGAGCA AAGTCAAAAC GCTGACTGCC AGTTTCCGCA ACACCGCTAG CACCAGCTAC
GATCTGTGGC GTGGCACAAT TCCAGCGCAT GCTGCTGGCT CAACCGTGTA TTATCGGGTT
CGGGTGTATG ATGGCAGCGT TTTAGCCTTA CTCAAAGCCC AAAATGGCAG CTACACCAAT
CCACGCGGCC AGCATGTACG TGGCGCAAAC TATGATCCCG ATGATTATAG TTTTACGGTG
CAAAGCGGCG GTATGCCCAC AGCTACTCCA ACAATTGCCC CAACCGCCAC CCGCACCGCC
ACTCCCAGCG CTACAGCCAC GCGCACGCCA ACCCCGGTTG GCACTATCAG CGCTACGCCC
ACGCCATCGC GCACCCCAAC TGCGACTGCC ACCAACACCT CAACTCCCAC AGTTTCAGCA
ACACCATCGG GGGCTTGCAG CGGCGCAGCG GTTGGCAACA ATACAATTAT CAGCAGTGCG
GTGTACCACG ATAGCACCAA CAGCGTGTAT CGCGACCCGC TTGGTTCGCT GCAAGCAGGC
CAATCGGCCA GCATTCGTTT GCGCACTTGT AGTAATGATG TTAGCGCTGT GAGTTTATCG
GTTTGGTTAA CTGGCGCACC ATTCAGCCAA CCATCGTTCA GCTATCCATT AACTGTGGTA
AGCAATGATG GAACCTACGC CATGTGGCAA GCCAGCGTAC CAGCCCCCAG CAGCTCAACC
GATCAGTGGT ATCAATTTAA GCTGACCGAT GGCTCGACAA TTGGCTATTA TGTGGTTGCC
AACACCAGCA ACACAGGTCC AGGCGTGTGG AGCGCCACCG CGCTTGATCG TTCGTGGAAG
CTGGGTACAG TTCCCGCGCC ACCGCAAGAT TATGCCGTAC CAACATGGCT GCAAGATGCA
GTGATCTATC AAATTTTCCC TGATCGCTTC CGTGATGGCG ATAGCAGCAA CAATTTCAAT
AATGTGCGGG TCTACGGCCC AAACACCTGC AACGGCTACA GCGGTGCAGG TGCACCCAAC
TGTTTGGCCT CGATCCACAG CAATTGGAAT GAAACGCCAA CCACCCCAGG CTATGGCATC
GATTTTTATG GTGGCGATTT GCAAGGCATT GTCGATAAAA TCAATGCTGG TTATTTTAAC
GACCTTGGCG TTAATGTGCT GTATCTCAAC CCGATTTTTG ATGCTTCATC GAACCATGGC
TACGATACCA ACGATTATTA TGGTATCAAC CCACGCTTTG GCAATTTGGC AAAATTCGAC
GAGATGATTG CCGCCGCCGA TGCCAAAGGC CTCAAAGTGA TTCTTGATGG TGTGTTCAAC
CACGCTGGCA TGGATAGCAT TTATCTTCAA GGTTATCCAG GCTATAAAAC CGACCGCTGG
ACAGGCATCA ACGGCGCTTG TGAATCGGAT TCATCGCCCT ACCGCAGTTG GTTCACCCAA
GGCTCAGCTG GCACCAGCGG CTCGTACCCA TGTGTTGGCG GTTGGGGCTG GAAGGGTTGG
TATGGCTATG AAACCATCCC TGAATTTATC GAAAACGACC CAGTGAAGCA ATTTTTCTAT
CGCGATGGCA GCGCTCAAAG CCCCAATGGC AAATCAGTAA CCCGCTTCTG GCTCGAACGG
GGTATCGCTG GCTGGCGCTT CGATGTAGCC CAAGATATCA CCCACGCTTG GTGGAGCGAT
ATGCGGCCTT ATGTTAAAAA TGGTTATGGC GATAGCGAAA GTTTATTGCT GGGCGAAGTT
ACGGGCGGCT GTGATTGGGG CTTATATCGT GCCTATCTCA ACCAAAACGA GCTTGATTCG
GTGATGAACT ATTGTTTCCG TGATTGGGCA GTGAGCTTCG CCAATGGCAA TGCACCTAGC
TCATTCGACA GTAGCTACAA TGCCTTCCGT GCGCAAATGC CTGCTAGCCC ATGGTTTGGC
ATGATGAACT TAGTCAGTTC GCACGACTCA ACTCGCGCCT TGCGCTTGCT CAACGACGAC
AAAGCCCGCA TGAAATTGAT GGTGTTGTTG CAAATGACCC TACCAGGTGC GCCATCGGTG
TATTATGGCG ACGAAGTTGG GGTAACTGGT GGCGGCGATC CCGACAACCG CCGCACCTAT
CCTTGGGCCG ATAAAGGTGG TAGCCCCGAT ACGGTGATGT ATGCTCATTT CAAGAAATTG
ATCGCGCTAC GCCGTACCTA TCCAGCGCTC AGCAGTGGTG ATGTTGCAAC CTTGTTGGTC
AACGATGCCA GCAAACTTTA TGGCTATCGC CGCTGGAAAG GCACGCAAGA GGCAGTTGTA
GTGCTCAATA ACGGCACGGC CAACCAAACT GCGACAGTTA ATGTGAGCCA TTTAGCCAAT
GGCACAGTCT TGACCGATGT CTTGAATGGT GGCAGCTACA CCGTTAGCAA CGGCCAATTG
ACCTTGCCAG TCGCAGCCCA ATCGGGCGTG GTGCTGGTGA AGTAA
 
Protein sequence
MRNKRFGGLL ALVLLALALL PNPKSVAAGD NNVHWDELYH AAPSANPRTE LVPGESFSFQ 
QAEVNGTIVS TTNVQISILA LAGDLTSAQI RYWNGTEQIV AMSKVKTLTA SFRNTASTSY
DLWRGTIPAH AAGSTVYYRV RVYDGSVLAL LKAQNGSYTN PRGQHVRGAN YDPDDYSFTV
QSGGMPTATP TIAPTATRTA TPSATATRTP TPVGTISATP TPSRTPTATA TNTSTPTVSA
TPSGACSGAA VGNNTIISSA VYHDSTNSVY RDPLGSLQAG QSASIRLRTC SNDVSAVSLS
VWLTGAPFSQ PSFSYPLTVV SNDGTYAMWQ ASVPAPSSST DQWYQFKLTD GSTIGYYVVA
NTSNTGPGVW SATALDRSWK LGTVPAPPQD YAVPTWLQDA VIYQIFPDRF RDGDSSNNFN
NVRVYGPNTC NGYSGAGAPN CLASIHSNWN ETPTTPGYGI DFYGGDLQGI VDKINAGYFN
DLGVNVLYLN PIFDASSNHG YDTNDYYGIN PRFGNLAKFD EMIAAADAKG LKVILDGVFN
HAGMDSIYLQ GYPGYKTDRW TGINGACESD SSPYRSWFTQ GSAGTSGSYP CVGGWGWKGW
YGYETIPEFI ENDPVKQFFY RDGSAQSPNG KSVTRFWLER GIAGWRFDVA QDITHAWWSD
MRPYVKNGYG DSESLLLGEV TGGCDWGLYR AYLNQNELDS VMNYCFRDWA VSFANGNAPS
SFDSSYNAFR AQMPASPWFG MMNLVSSHDS TRALRLLNDD KARMKLMVLL QMTLPGAPSV
YYGDEVGVTG GGDPDNRRTY PWADKGGSPD TVMYAHFKKL IALRRTYPAL SSGDVATLLV
NDASKLYGYR RWKGTQEAVV VLNNGTANQT ATVNVSHLAN GTVLTDVLNG GSYTVSNGQL
TLPVAAQSGV VLVK