Gene Haur_4065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4065 
Symbol 
ID5735923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5190483 
End bp5192273 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content53% 
IMG OID641281216 
ProductAlpha-amylase 
Protein accessionYP_001546825 
Protein GI159900578 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.540437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACGGA CCACTCGCGC AATGGCGCGG CTGACACTGT TCGTCTTGGT TGTGATCATC 
TTCTCGGTTG GCTTTTTTGG AACCGCTCCA CGGGCAACGC AAGCCCAAAC TACTCCACGC
ACGGCCTTCG TCCATTTATT CGAATGGAAA TGGACTGACA TCGCCAAAGA ATGCGAAAAT
TGGCTCGGCC CCAAGGGCTT CGCTGCTGTT CAAGTCTCGC CACCCCAAGA GCATATCCAA
GGCTCGCAAT GGTGGACGCG CTATCAACCA GTTAGCTATC AGATTCAAAG CCGCTCGGGC
ACTCGCGCCG AGTTCGCCAA CATGGTTTCA CGCTGTAAAG CTGTTGGGGT TGATATTTAT
GTCGATGCCG TGATCAACCA CATGACCGGC GTGGGCAGTG GCACGGGCGT AGCTGGCTCA
AGCTACACCA GCTACAATTA CCCCGGTAAT TATCAAACTC AAGATTTCCA CCACTGTGGC
CGCAATGGCA ACGACGATAT CAGCAACTAC CAAGATCGCT GGGAAGTTCA AAATTGTGAG
TTGGTTAACC TCGCCGATCT CAAAACTGAA TCAGATTATG TTCGCGGCAA ATTAGCTGCC
TATTTGAATG ATCTGCGCAG TTTGGGCGTA GCTGGCTTCC GCATTGATGC TGCCAAGCAT
ATGCCCGCCG CTGATATTGC CAACATCATG AGCCGCGCCA GCAATCCTTA CATCTATCAA
GAAGTGATTG ACCAAGGCGG CGAGCCAATT ACCTCAGGCG AATATACGGG CAACGGCGAT
GTGACTGAGT TCAAATACAG CACCAACATT GGCCGCATGT TCAAAACCGA CAAGCTTGCC
AACATGAGCA ACTTCGGCAC AGCTTGGGGC TTTATCGCCA GCGATAGTGC GGTGGTTTTC
ACCGATAACC ACGACAACCA ACGCGGCCAT GGCGGCGCTG GCAATGTCGT TACCTTCAAA
GATGGCAAAC TCTACGAACT TGCCAACGTC TTCGCTCTAG CTTGGCCCTA TGGCTATCCC
CAAGTCATGT CGAGCTACAA CTTCAGCAAC GGCGACCAAG GCCCACCCAG CAGCAATGTC
TACAATGGCA ACACCGCCGA TTGCGGTGGC AGCAACTGGG TTTGTGAACA TCGCTGGCGC
GGCATCGCCA ATATGGTTGG CTTCCGCAAC TACACTAGCA CAGCCTTCAG CACCAGCAAC
TGGTGGTCGA ATGGCAATAA TCAAATTTCG TTCAGCCGTG GCAGCTTGGG CTTCGTAGCA
ATCAACCGCG AAGGCAGCAG CTTGAGCCGC ACCTTTGCTA CGGGCTTGCC CGCCGGAACC
TACTGCGATG TAATTCACGG CGATTTCAAC AATGGCTCGT GCTCTGGCCC AACCATCAGC
GTCAACAGCA GTGGCCAAGC AACAATCACG GTCGCCGCAA TGGATTCAGT GGCAATTCAT
GGTGGCGCAA AAATCAATGG CACTAACCCA ACGCCAGTGC CAACCACCCC ACCAAGCGGC
AGCATCGCTG TCACCTTCAA CGAAAATGCC ACCACGGTTT GGGGCCAAAA TGTCTATGTG
ATTGGCAATG TCTCGGCACT TGGTAGCTGG AACACCGCCA ATGCTGTGTT GCTCTCATCA
GCAAGCTACC CAGTTTGGAG CAAGACAATC AACTTGCCAG CCAGCACCGC CATCGAATAC
AAATACATCA AGAAAGATGG TTCGGGCAAT GTGACCTGGG AAAGCGGTAG TAACCGTACA
TTTACCACGC CAAGCAGCGG CACGGTCACC CGCAACGATA CCTGGAAATA G
 
Protein sequence
MSRTTRAMAR LTLFVLVVII FSVGFFGTAP RATQAQTTPR TAFVHLFEWK WTDIAKECEN 
WLGPKGFAAV QVSPPQEHIQ GSQWWTRYQP VSYQIQSRSG TRAEFANMVS RCKAVGVDIY
VDAVINHMTG VGSGTGVAGS SYTSYNYPGN YQTQDFHHCG RNGNDDISNY QDRWEVQNCE
LVNLADLKTE SDYVRGKLAA YLNDLRSLGV AGFRIDAAKH MPAADIANIM SRASNPYIYQ
EVIDQGGEPI TSGEYTGNGD VTEFKYSTNI GRMFKTDKLA NMSNFGTAWG FIASDSAVVF
TDNHDNQRGH GGAGNVVTFK DGKLYELANV FALAWPYGYP QVMSSYNFSN GDQGPPSSNV
YNGNTADCGG SNWVCEHRWR GIANMVGFRN YTSTAFSTSN WWSNGNNQIS FSRGSLGFVA
INREGSSLSR TFATGLPAGT YCDVIHGDFN NGSCSGPTIS VNSSGQATIT VAAMDSVAIH
GGAKINGTNP TPVPTTPPSG SIAVTFNENA TTVWGQNVYV IGNVSALGSW NTANAVLLSS
ASYPVWSKTI NLPASTAIEY KYIKKDGSGN VTWESGSNRT FTTPSSGTVT RNDTWK