Gene Haur_3463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3463 
Symbol 
ID5735324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4359817 
End bp4361982 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content53% 
IMG OID641280610 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001546227 
Protein GI159899980 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTATG AGCAACAAAT CGAAGCATTG TTGGCCCAAA TGACTCTTGC CGAGAAGATT 
GGCCAAATGC GCCAGCTTCA TGGCACTGGC GAAACCCAGC AGCAACTGGT GCGCGAAGGC
AACTTGGGGT CGGTTCTAAA TGTGATTGAT GCTGATGCCC ACGAGATTCA GCGCATTGCC
GTTGAAGAAT CACGCTTGGG CATTCCGCTG TTAATTGGCC GCGATGTGAT CCACGGTTTT
CGCACAATCT TCCCAATTCC ACTTGGCCAG GCTGCTTCGT TTAATCCTCA GCTTGTGCGC
GAAGCCGCGC GGATTGCCGC CCGCGAAGCC TCGGCCTCTG GGATCAACTG GACATTTGCC
CCGATGATCG ATATTTCACG CGACCCACGG TGGGGGCGGA TCGCCGAAAG CTGTGGCGAA
GATGCCTATC TTTCAAGTTT GATGGGTGTG GCGATGGTCG AAGGCTTTCA AGGCGACGAT
TTGACCGCCC CCGATGCGAT TGCTGCTTGT GCCAAACATT ATGTGGGCTA TGGCGCTAGC
GAAAATGGCC GCGATTACAA CACTGCTTGG ATTCCCGAAG TGCTCTTACG TGATGTTTAT
TTAGCACCAT TCAAAGCTGC CGCCGATGCT GGCGTGGCCA CCATGATGAG CGCCTTCCAC
GATTTGAATG GTGTGCCAAC CTCAGGCAAC GAATTTACGC TGCGTCAAAT TTTAAAAGGC
GAGTGGAATT ACGATGGTAT GGTGGTCAGC GATTGGGCCT CGGTTGCCGA AATGATCGCC
CATGGCTATG CCGCTGATTT GCGCGATGCT GCCTTGAAAG GTGTAACGGC TGGGGTCGAT
ATGGAAATGG CCAGCACCAG CTACGCCGAA TATCTGGCTG CGTTGGTTGA AAGTGGCGCA
CTCAGCCTCG ATTTAATTGA TGATGCTGTG CGGCGGGTGT TGCGCATCAA GTTCCGTTTG
GGTTTGTTCG ATCAACCGTA TGCTAACGCT GCGGCGGCTG ATTCAGTCGT TGCGCCTGAT
CATTTGGCTT TGGCTCGCCA AATTGCCAAA GAAAGTTGTG TGCTATTGAG CAATCAGCAA
ACTTTGCCGC TCAACCCACA ACAAACGCGG GTGGCAATTG TTGGGCCGCT CGCCAACCAT
GCCGCCGATC AACTTGGCTG CTGGGTATTC GATGGCAAGC CCGAAGATAG CCAAACTCCA
TTACAAGCGA TTCGCGAATT GCTTGGTGAC GAGCGGGTGC AATTTGCCCA AGGCTTGCCC
GAAGCCCGCA GCTTAGATCA AAGTCTATTT GGCGAGGCAG TCGCGGCGGC TCAAACTGCT
GATGTGGTTA TTGCCTTCCT TGGTGAAGAT GCTGGCTTGA GTGGCGAAGC CCATAGCCGC
GCATTCATCG ATTTACCTGG CGCACAACTG GCCTTAGTCG ATGCCTTGGT GGCAACCGGC
AAACCAGTGG TTGCGGTTGT GATGGCTGGA CGCTCGTTGG TGTTGGGCGA ATTGCAGGAT
AAAGTGCAGG CGATTTTATA TGCTTGGCAT CCTGGCACCA TGGCTGGCCC AGCGCTCGCC
GATTTGCTGT TTGGCTTGGA TAACCCTTCA GGCCGCTTGC CAATTAGCTT CCCGCGCACC
GTCGGCCAAG TGCCAATTTA TTACAATCGC AAAAACACTG GTCGCCCACC AAGCGAAGAT
GCACCGAGTA TTCCCACGGG CACGCCGCTT GATCCGAGTG GTTTTACCTC AAGCTACCTC
GATGTTGATC ATCGGCCCTT GTTTGCTTTT GGTTATGGCT TGAGCTACAG CACATTTAGC
TATAGCAATT TGCGTTTATC TAGCCAAAAA CTGGCGGTTG GCGACACACT TAGCATCACC
ACTACGGTGA CCAACACTGG CAAGTATGCT GGCGCAGAAG TGGTGCAATT GTATATTCGC
GATCTGGTTG GCTGTATGAC TCGCCCAATC AAAGAACTCA AAGGCTTCCA ACGAATTCAT
TTGGAGCCAG GCCAAAGCCA AACTGTAACA TTTGAACTTA GCAGCGCTGA CTTGAGTTTC
CATAACAACG CCATGCAACG GATCGTCGAG CCAGGCGAAT TTAATCTCTG GGTTGCGCCA
AGCAGCATTG GTGGTTTGCA GGCAAGCTTT GAATTAGTAG CCAAGAGCAA AGAACATCGA
GCATAA
 
Protein sequence
MQYEQQIEAL LAQMTLAEKI GQMRQLHGTG ETQQQLVREG NLGSVLNVID ADAHEIQRIA 
VEESRLGIPL LIGRDVIHGF RTIFPIPLGQ AASFNPQLVR EAARIAAREA SASGINWTFA
PMIDISRDPR WGRIAESCGE DAYLSSLMGV AMVEGFQGDD LTAPDAIAAC AKHYVGYGAS
ENGRDYNTAW IPEVLLRDVY LAPFKAAADA GVATMMSAFH DLNGVPTSGN EFTLRQILKG
EWNYDGMVVS DWASVAEMIA HGYAADLRDA ALKGVTAGVD MEMASTSYAE YLAALVESGA
LSLDLIDDAV RRVLRIKFRL GLFDQPYANA AAADSVVAPD HLALARQIAK ESCVLLSNQQ
TLPLNPQQTR VAIVGPLANH AADQLGCWVF DGKPEDSQTP LQAIRELLGD ERVQFAQGLP
EARSLDQSLF GEAVAAAQTA DVVIAFLGED AGLSGEAHSR AFIDLPGAQL ALVDALVATG
KPVVAVVMAG RSLVLGELQD KVQAILYAWH PGTMAGPALA DLLFGLDNPS GRLPISFPRT
VGQVPIYYNR KNTGRPPSED APSIPTGTPL DPSGFTSSYL DVDHRPLFAF GYGLSYSTFS
YSNLRLSSQK LAVGDTLSIT TTVTNTGKYA GAEVVQLYIR DLVGCMTRPI KELKGFQRIH
LEPGQSQTVT FELSSADLSF HNNAMQRIVE PGEFNLWVAP SSIGGLQASF ELVAKSKEHR
A