Gene Haur_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2969 
Symbol 
ID5734841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3745721 
End bp3747295 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content50% 
IMG OID641280113 
Productalpha amylase catalytic region 
Protein accessionYP_001545735 
Protein GI159899488 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC TTTTGAGTGG CCTGCTTTTA GCCGGGGTGA TTGTGGGCTG TGGCGGCCAA 
GCAACTCCAA CCGCTGTTCC GGCAACAGTG ACCAGCCAAT CAACCCCAAC CAGCCAAGCC
TTGAATCCAA CCGCTACTGC CGCTACTGTG GTTGATAACT CAATTACGCC AACGCCATTG
CCAACCAAAC CAACAGCCCC AATCTTTACC GCTGACGATG AGCGTTGGGC TGGCCGTTCG
ATCTACTTTA TTATGATCGA TCGCTTTGCC AATGGCGACC CAAGCAACGA CAACGCCGAT
GGCTTTGGGG CAGATCGTAG CGATCCACGG CGTTGGCATG GCGGCGATTT TCGCGGCATT
ATCGAGCGGC TCGATTACAT CAAAGGCATG GGCTTTGGCG GCATTTGGAT CACGCCAGTC
AGTAAGCAAA ATTCAACCAA TGCCTACCAT GGCTACTGGC AATACGACCC CTACCAAATT
GACCCGCATT TTGGCACGCT GGAAGAATTG CGCGAATTGG TCAGCGAAGC CCACAAACGC
GATATATTGG TGATGCTCGA TGTTGTGCCC AATCATATGG GCGATTTCTT GCCTGGCTCG
AAAGCTGCCC CGCCATTCGA TGACCCAACC TGGTATCACA ACAAGGGCAA CATTCAAAAT
TATGGCAATC AACAAGAGGT TGAAGATGGC GATTTGCTCG GGCTTGATGA TTTAGATCAG
GATAATCCTG CTACCCGTGC TGAATTACTC AAATGGATTG CTTGGCTTAA AACCGAAACT
GGGCTTGATG GCTTGCGAGT TGATACGGCC AAACATTTGC CCAAAGATTT TCTCCGTGAG
TTTGATCAAG CGGCCAATAC GTTTTCGCTG GCTGAGGTAT TTAGCAGCGA TGCGGGCTAT
GTTGCGCCCT ACACCGAATT TAACGACGCA ATTTTGGATT ACCCCTTGCA CAGCGCCTTT
AAAGAAAGTT TAGTCGGTGG TCGCACGTTG TTGGTGATTC AGCGCGTGCT CGAAAATGCC
GATCAACAGT ATCGCAATGT CCATGTCAAC GGCACATTTC TCGATAATCA CGATAACGAG
CGCTTTTTAT GCTTGGCAAC TGGCGGCCCC AACGCCGATA AAACCACTCA ATTGCGGCAA
GCTTTGGCGG TGCTCTATAG TTTGCGTGGC ATTCCGATTG TCTATTATGG CACCGAGCAA
GAACTCAACG GCTGCAAAGA TCCCTTCAAC CGCGAAGATG CCTTTGAATT GAATGCGACT
GATGTACCAG TCTATCAATG GATCAGCCAA CTCAACCAGA TTCGCCAAGC CCATCCAGCC
TTGCAACGTG GCACACTCGA AAGCCGCACA ACTCCTAGCG ATGCATGGGC CTTTCAACGC
ACGGCGGGCA ACGATACGGT CGTAGTTTGC ATCAATAACA CATGGAAATC GCTCGACTTG
GCAGTAACTG GTTTGACTGA AATTGCTGAT GGTGAGGTGT TGACTGATGC GCTTGGTAGC
GGTCAAATGA GCGTTAAAAA TGGCGAAATG AATTGTGCTC TACAACCAAA ACAGGTGCTG
ATCTATACCC GTTAA
 
Protein sequence
MKKLLSGLLL AGVIVGCGGQ ATPTAVPATV TSQSTPTSQA LNPTATAATV VDNSITPTPL 
PTKPTAPIFT ADDERWAGRS IYFIMIDRFA NGDPSNDNAD GFGADRSDPR RWHGGDFRGI
IERLDYIKGM GFGGIWITPV SKQNSTNAYH GYWQYDPYQI DPHFGTLEEL RELVSEAHKR
DILVMLDVVP NHMGDFLPGS KAAPPFDDPT WYHNKGNIQN YGNQQEVEDG DLLGLDDLDQ
DNPATRAELL KWIAWLKTET GLDGLRVDTA KHLPKDFLRE FDQAANTFSL AEVFSSDAGY
VAPYTEFNDA ILDYPLHSAF KESLVGGRTL LVIQRVLENA DQQYRNVHVN GTFLDNHDNE
RFLCLATGGP NADKTTQLRQ ALAVLYSLRG IPIVYYGTEQ ELNGCKDPFN REDAFELNAT
DVPVYQWISQ LNQIRQAHPA LQRGTLESRT TPSDAWAFQR TAGNDTVVVC INNTWKSLDL
AVTGLTEIAD GEVLTDALGS GQMSVKNGEM NCALQPKQVL IYTR