Gene Haur_2518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2518 
Symbol 
ID5734396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3215545 
End bp3217095 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content51% 
IMG OID641279658 
Productglycoside hydrolase family protein 
Protein accessionYP_001545284 
Protein GI159899037 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGCC GCTTTCACAA TCCAATCATG ACGGGCTTTT ATCCAGATCC AGCAATTTGT 
CGGGTAGGCG AAGATTATTA TCTGATTCAC TCGACCTTTG AATATTTTCC AGGTGTGCCG
ATTCATCATA GCCGCGATTT AGTGCATTGG CAGCAAATTG GCCATATTTT AGACCGCCCC
TCGCAGCTCA ACCTCGACGA AATTCATCCA TCAGCAGGAA TTTTTGCCCC AACTATCAGC
TACCATGATG GCACGTTTTA CATGATTACC ACCTTGATTG CTGGCAAAGA GCGCCATGGC
AACTTTATCG TGACTGCTCA ATCGCCAGCT GGCCCATGGT CAGACCCCTA TTGGCTTGAT
GCTGATGGGA TCGATCCCTC GCTCTTTTTC GAGGATGGTC GGGCTTGGTA TGTGGGCAAT
CGTGGCAAAG CCAACCCCGA ATACGAAGGC CAATGCGAAA TTTGGCTGCA AGAGCTTGAT
TTGACCACGA TGCAATTGAT TGGTGAGCAA GCGGTGCTGT GGCGAGGAGC GCTCAATGGC
GTGATTTGGA CGGAAGGGCC ACATCTTTAC AAAATTGATG GTTGGTATTA TTTGCTGATT
GCCGAGGCTG GCACGGAATA CAATCATGCG GTAACGATTG CCCGCAGTAG CGAATTGACC
CAAGGCTATA TAGGCTATCC CGCCAATCCA ATTCTGACCC ATCGCCAGCT TGGCCGTGAT
TATCCAGTGA TGGGCACGGG CCATTCTGAT CTGGTGCAAA CCCAAAATGG CGAATGGTGG
CTGGTGTTGC TGGCAATGCG CCCATATGGC GGCGGTTTCT ATAATCTTGG GCGTGAAACC
TTCTTAGCTC CGGTTCAGTG GGAACAAGGT TGGCCGTTGA TTAGCCCTGG CACTGGCAAA
GTTGAATTGA GCTACCCTGC GCCCGATTTG CCCTTGCAAC GCTGGCCAGT TCAAGCTGCC
TGCGATCATT TTGATGGCGA CAATTTAGCC ATGCATTGGA TGTTTTTGCG CACGCCGCGT
TCGCAATGGT GGAGCTTGAG CGAACGAGTT GGTTGGTTGC GCATGCAACT GCGGCCTGAG
CAAATCAACC AAATGGTCAA CCCTAGCTTT GTTGGGCGAC GGCAGCAACA TATGAACTTC
TTAGCGCAAA CTATGCTGGA GTTTCAGCCG CAACAGCCGC AAGAAGTAGC GGGTATGGTG
CTGATTCAAA ATCATAACTA TCAGGTGCAA TTTGTAATTA CTGGCGAGCA GCAAGCCAGC
CTGATTGTGT GTCGCAATGG TGAGCAAGAA TGTTTGGCGC AAGTGCCAAT CGCCAGCCAG
CGCAACTATT TACGAATTGT GGCTTATGGG CAGGAATATA GCTTTTTTGT GGCCGAGCAG
CCCGATGCAT GGCGGCCAGT CTTTGAAAAT CTTGATGGCC GCTTTTTGAG CACTCCGGTT
GCTGGTGGTT TTGTGGGCAC AGTGATTGGT TTGTATGCCA GTAGCCAAGG CCAAACCAGC
CAAACTGTGG CCGATTTCGA TTGGTTTGAA TATCGCGAAA TCGCCGAGTA A
 
Protein sequence
MQRRFHNPIM TGFYPDPAIC RVGEDYYLIH STFEYFPGVP IHHSRDLVHW QQIGHILDRP 
SQLNLDEIHP SAGIFAPTIS YHDGTFYMIT TLIAGKERHG NFIVTAQSPA GPWSDPYWLD
ADGIDPSLFF EDGRAWYVGN RGKANPEYEG QCEIWLQELD LTTMQLIGEQ AVLWRGALNG
VIWTEGPHLY KIDGWYYLLI AEAGTEYNHA VTIARSSELT QGYIGYPANP ILTHRQLGRD
YPVMGTGHSD LVQTQNGEWW LVLLAMRPYG GGFYNLGRET FLAPVQWEQG WPLISPGTGK
VELSYPAPDL PLQRWPVQAA CDHFDGDNLA MHWMFLRTPR SQWWSLSERV GWLRMQLRPE
QINQMVNPSF VGRRQQHMNF LAQTMLEFQP QQPQEVAGMV LIQNHNYQVQ FVITGEQQAS
LIVCRNGEQE CLAQVPIASQ RNYLRIVAYG QEYSFFVAEQ PDAWRPVFEN LDGRFLSTPV
AGGFVGTVIG LYASSQGQTS QTVADFDWFE YREIAE