Gene Haur_2435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2435 
Symbol 
ID5734316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3118970 
End bp3120325 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content49% 
IMG OID641279576 
Productalpha amylase catalytic region 
Protein accessionYP_001545203 
Protein GI159898956 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCCAT GGGCCAACGA CGTACAATGT TATCAAATTT ATCCCTTAGG GCTTTGCGGC 
GCACCAATTC GCAATGATCA AACCAGCGAA CCGATTGCGC GGCTCAAGCA ATTGCACACA
TGGATTGAGC ACTTACAACA TTTAGGCAGC AACTTGCTCT ACCTTGGGCC AGTGTTTGAA
TCGACAGCTC ATGGCTACGA CACAATTGAT TATTTCACGG TTGATCGGCG GCTAGGTAGC
AACAACGATT TACAACAGTT GATTGCAGCA TTCCACGCGG CGGGCATTCG TGTGTTACTC
GATGGCGTGT TTAACCATGT TGGCCGCGAT TTTTGGGCCT TCCGCGATGT ACAAAGCCAT
GGGCAGGCTT CGAGCTATAG CGATTGGTTT GCGGGCTTGG ATTTCAATCA ACACAGCCCA
TATGGCGATC AGTTTAGCTA TCAAGGTTGG CATGGGCACT ACGATTTAGT CAAATTGAAT
TTACACAATC CGGCAGTGCG CGAACATCTT TTCCAAGCAG TCAGCCAATG GATTGCACAA
TTTGGCATCG ATGGCTTACG GCTTGATGCT GCGGATCAGA TCGATCATGA TTTTTTGGCG
GCGCTGGCAG CTCATTGCAA AAGCCTGCGC AGTGATTTTC TATTGATTGG CGAAGTGGTG
CATGGCGATT ATCGCCAATG GGCCAACCCA ACGATGCTCG ATAGCGTGAC CAACTATGAA
GCCTACAAAG GCCTGTATTC CAGCCTGAAT GATCGCAACT ATTTTGAAAT TGCTTACAGT
CTCAATCGTC AATTTGGCGC TGGTGGTATC TATCGGGCTA TGCCGTTGTA TAACTTTGTT
GATAATCATG ATGTCGATCG AATCGCCAGC ACCTTGCACA ATCCAGCCCA TCTCTACCCT
TTACATCTAT TGCTTTACAC CATGCCAGGA ATGCCTGCGC TGTACTATGG CAGCGAATGG
GGCTTGCTCG GCCAGAAAAC CGCCACCAGC GACCAAGCCT TGCGACCAGC CTTGCCTCAA
CCCGACCAAA TTCAGGCCCA ACAACCAGAT TTACTGGCAG CGATTCGCCA ATTGAGCCAG
CTACGCCACG AATATGCTGC CTTACGTTAT GGCGAATATG CCCAAATATA TGTACAGGCT
GAACAGCTTG TGTTTGTGCG TTGGACTAAG CAGCAAACAA TTGTGGTCGC ATTGAATGCC
GCCCCAACCG CCCAAACAAT TCGATTTGTT GTGCCATTTG GTGAAGGCTC ACTGCTTGAA
GATCGCTTGA ATGGCGGGGA AATCAAGGTG CAGCATGGCC AATTACAGCT GACAATTCCC
GCAACCTGGG GCCAAATCTG GCAGCTGATT GATTGA
 
Protein sequence
MHPWANDVQC YQIYPLGLCG APIRNDQTSE PIARLKQLHT WIEHLQHLGS NLLYLGPVFE 
STAHGYDTID YFTVDRRLGS NNDLQQLIAA FHAAGIRVLL DGVFNHVGRD FWAFRDVQSH
GQASSYSDWF AGLDFNQHSP YGDQFSYQGW HGHYDLVKLN LHNPAVREHL FQAVSQWIAQ
FGIDGLRLDA ADQIDHDFLA ALAAHCKSLR SDFLLIGEVV HGDYRQWANP TMLDSVTNYE
AYKGLYSSLN DRNYFEIAYS LNRQFGAGGI YRAMPLYNFV DNHDVDRIAS TLHNPAHLYP
LHLLLYTMPG MPALYYGSEW GLLGQKTATS DQALRPALPQ PDQIQAQQPD LLAAIRQLSQ
LRHEYAALRY GEYAQIYVQA EQLVFVRWTK QQTIVVALNA APTAQTIRFV VPFGEGSLLE
DRLNGGEIKV QHGQLQLTIP ATWGQIWQLI D