Gene Haur_4334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4334 
Symbol 
ID5736194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5534064 
End bp5535914 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content51% 
IMG OID641281495 
Productglycoside hydrolase family protein 
Protein accessionYP_001547094 
Protein GI159900847 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.638626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATACA CCCTCGAATC AACAGGCTTA TTGGTATTTC AACGTGATGA CGAAGAAGTT 
TTGACGATTC AGCCCTACTT GCAGGCTTAT GGCTCGCAGT TTGCCGCATT AGGTGTACAA
ACCAAGCCTG ATACGGTGCA ATACACTAGT TTCGATAACC AGACCTTTGA GCTTAATCTT
GAAACGACGG CAACCGGCTA TCAGTTGGTG TTTCACGTGA AACATCAGGT TGCTCAAATT
GGTTTAGCAA TCGGCGTACA ACCAGCAGGC TCGTGGTATG GTATGGGCGA ACGGGTGATT
CAAAGTTGGC CGCTCAACTT GGCTGGTGTG CAAAGCCAAC CATTTATGAC CTATGATCAC
GCCAATGATG GCACACTCAA TATTGTTACG CCAGCCTGGA TTGGCGCGAA TGGCGTGGCT
TTTATCGTGG CCGAAGATAC AGGCCCGTTG CATGTCACAA TCGATAGCGA TCCAGCAGGT
GTGATTCGGC TGGTGCAGTT TCCTTCGCCA ACCCCATTTG GCGCGGGCTT GGATGGCAGC
GAAACCCATT ATGAAGGCAC GCGCTTGGTG CTCGATCTAC TGATTGCTGA AAATGTCAGC
GTTGCCGCTC AACACGTCAT TCAACAATTG GGCTACCCCA AGGCGGCTCC ACCCTTGGCA
ATGTTTAGCA AGCCAATTTG GACAACCTGG GCACGCTATA AAATGGATAT CGATCAAGCT
CAAACGCTGG CGTTTGCCCA AGAAATTATC GACCAACAGT ATCCATATTC GGTGCTAGAG
ATCGACGATC GCTGGCAAAC TGCTTATGGC GATTTAGAAT TCGATCGGCG CAAGTTTCCC
GATCCCAAGG CTATGGTCGA TCAATTGCAT CAGCTTGGCT ATAAAGTAAC CTTGTGGATT
CCACCATTTT TCGATCCAAA GAGCGCGGCT TTTGCTGAAG CTACTGCTAA TGGCTATTTG
GTCAAACATC CCGCCAACGA TCAACCGTAT TTGACCCGTT GGTGGCAAGG TTGGGGCGGT
TTGTTGGATG TCTCGAATCC TGCTGCCTTG GCTTGGTGGC AAGCAGGTTT GGAGCGCTTG
CAAACCTTGT ATGGCATCGA TGGTTTTAAA TTTGACGGCG CTGAAGGGAA TTTTCTACCC
GCCGAAGCCA AAACCCATCT GCCGATGACT CCCAACCAAT ATAGTGATCG CTATGTGGCT
TTTGTGGCCA AATCGTGGCA ATGGACAGAG GTACGAACTG GTTGGCGCTC GCAACAGCAA
CCAATCTTCT TCCGCGAATG GGACAAATGG AGTCGTTGGG GCATGGACAA TGGCTTGCAT
GCGGTCGTTA CCCAAGCGCT TGCGATGAGC GTGATCGGTT ATCCCTATGT GCTGCCCGAT
ATGATCGGCG GCAACGCCTA TAACGGTGAA TTTCCCGAGC GCGAGTTGCT GATTCGTTGG
ACGCAAGTCA CGGCATTATT GCCAGCGATG CAATTTTCAA TCGCGCCATG GCAGTACGAT
GTAGAAACCA GCCAGATTTG CCAGCGCTAT GCTCAATTGC ACGCTGAGCT AGAGCCATAC
ATTGCCGAAT TGGTGCAAGC GACCATCACC GATGGTACGC CTTTAGTTCG GCCTTTGTGG
TGGCACTATC CCGACGATGC CAGCACGCGC TTTATTGGTG ATCAATGGTT GTTTGGCGAG
CAATACTTGG TTGCGCCAAT GCTCCAAGCC AACCACTACC AACGTGACAT TTATTTGCCT
GAAGGTGGCT GGCGCGATTA TTGGACTGGC GAGAAATTCG AGGGTGAAAC CTGGCTCTAC
AATTATCCTG CGCCCTTAGA AACCCTGCCG TTGTTCGAGC GGCTGTGGTA G
 
Protein sequence
MAYTLESTGL LVFQRDDEEV LTIQPYLQAY GSQFAALGVQ TKPDTVQYTS FDNQTFELNL 
ETTATGYQLV FHVKHQVAQI GLAIGVQPAG SWYGMGERVI QSWPLNLAGV QSQPFMTYDH
ANDGTLNIVT PAWIGANGVA FIVAEDTGPL HVTIDSDPAG VIRLVQFPSP TPFGAGLDGS
ETHYEGTRLV LDLLIAENVS VAAQHVIQQL GYPKAAPPLA MFSKPIWTTW ARYKMDIDQA
QTLAFAQEII DQQYPYSVLE IDDRWQTAYG DLEFDRRKFP DPKAMVDQLH QLGYKVTLWI
PPFFDPKSAA FAEATANGYL VKHPANDQPY LTRWWQGWGG LLDVSNPAAL AWWQAGLERL
QTLYGIDGFK FDGAEGNFLP AEAKTHLPMT PNQYSDRYVA FVAKSWQWTE VRTGWRSQQQ
PIFFREWDKW SRWGMDNGLH AVVTQALAMS VIGYPYVLPD MIGGNAYNGE FPERELLIRW
TQVTALLPAM QFSIAPWQYD VETSQICQRY AQLHAELEPY IAELVQATIT DGTPLVRPLW
WHYPDDASTR FIGDQWLFGE QYLVAPMLQA NHYQRDIYLP EGGWRDYWTG EKFEGETWLY
NYPAPLETLP LFERLW