Gene Haur_4805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4805 
Symbol 
ID5736650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6130186 
End bp6132072 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content50% 
IMG OID641281971 
Productglycoside hydrolase family protein 
Protein accessionYP_001547564 
Protein GI159901317 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGCC GACGACGTTA CTGGTGGATA GTGCTGATGT TAGGATTAAT CGCGGGACAA 
TTCAGTTGGT GGCAATCGAG CAGCGGTCAA ACTAGCCCGC AAACCTTGCG GGCTGCGGCT
GGCACGTTTT TAATTGGCAG CGCTGCTGAT AGCGATTTTT GGAACTTCAG CGATCGCGCT
CAATATGAGG CAATTTTGGG TGGCCAGTTT AATATTTATA CCCCAGGTAA CCAATTAAAG
TGGGATGCCG TGCATCCTCA ACGTACAACC TACAATTTTG CTCCGGTTGA TCGGCATATT
CAAATTGCCA AAAGCTATGG TCAGCAAATC CACGGCCATA CCTTGCTCTG GCATCAACAA
AACCCCGGTT GGGTCGCCAA CCAACCCTGG ACAGCCAGCG AACTGACCAG CATTTTGTAT
GATCATATTG ATACAGTGGT TGGTCGCTAC AAAAACGATA TTGCAATTTG GGATGTAGCC
AACGAAGTAT TTGATGATAG CGGCGTGTAT CGCCGTTCGT TTTGGTATAA CACCATCGGC
CAAAGCTATG TTGAGTTGGG CTTCAGACGC GCTCGCCAAG CCGATTCGGA TGCGGTGCTG
ATTTACAACG ATTACAATAT CGAAGAAGTT AACGCCAAAT CGAATGCAGT CTATGCCATG
GTCAGCGACT TTTTGGCGCG GGGTGTACCA ATCGATGGGA TTGGCTTTCA AATGCACTTG
CTCGGTTCTG GCATCAATTA CAACAGCTTT GCCCAAAATA TGCAGCGCTT TGCTGATTTG
GGCTTGAAAA TTTATGTAAC TGAAGCCGAT GTACGCTTGC AATTGCCCGC GACCAGCACA
AGTTTGGCTC AACAAGCGAC GGTCTATCAA AATGTGCTTG ATCGCTGTTT GCGCCAGCCA
GCATGTCAAG CTTTTCAATT TTGGGGCTTT ACCGATAAAT ATTCGTGGGT TCCGAATACC
TTCCCAGGTT ATGGCGCGGC CTTGATCTAC GATGAACAAT ACAATCCCAA ACCAGCCTAT
ACCGCCATTT ACAATCGTTT ATTGCAAGGT CGGGGCGGCA CGCCAACTCC AACCAGTGCC
CCAACTGCTA CGCGCACCCC AACCGCCCAA CCAACTGCGA CCAATACGGC GGTTCCAACT
AACACGCCGA CCAGTAGCCC AACACCCACC CAAACAGCGC CCAGACCAGC GCTGATTTTG
AGTGCGCCCA GCCAAGTAAC GTTGGGTCAA GTCTTTACCT TGACGATTCA ATATGTCAAT
ATTGGCTTGC AATACACCAC CGTAACTAGC AGCCCAGCTG GCTTGGTGCA GCTTGATCCA
CCGTTGAGTA TGCCCTGTAA ATATAACCAA CACCCAACGC AATGCAAAAA TATCACCTTT
AAAGCAACCG CTTTGGGTAC GGTACAATTG AATGCCAGCG CAACTGGCGA AGTGCCAATT
GCTGGTGGTG GTTGGGCTTG GGGTTCAGCG TTTGCCCAAA ATCCAGTCAG CGTCACCATT
GTCGATAGTG TTCCAACCAA CACCCCAACT CCAACGCTCA CGCCAAGCCC AACCCCAATT
AGTGGTGGTT GCCGCGTCAA CTATGCGATC AACAGTCAAT GGGGCAATGG ATTTGTGGCA
AACGTGACGG TGACCAACGC CAGCAACACG CCGATCAATG GCTGGGCGCT GAATTGGAGT
TTTGCTGGCA ACCAGCAAAT TAGCAATGCT TGGAATACCA GCCTAACCCA AACTGGCAAT
GCTGTAGTGG CTCGTAACGC TGGCTGGAAC AACCTTATTG CTGCGCAAGG CTCGGCCTCG
TTTGGCTTCC AAGCCAGTTA CAGCGGCTCA AATGCGATTC CCAGCAGCTT TAGTTTGAAT
GGTGTGGCTT GTAGCATCGT GCCATAA
 
Protein sequence
MNSRRRYWWI VLMLGLIAGQ FSWWQSSSGQ TSPQTLRAAA GTFLIGSAAD SDFWNFSDRA 
QYEAILGGQF NIYTPGNQLK WDAVHPQRTT YNFAPVDRHI QIAKSYGQQI HGHTLLWHQQ
NPGWVANQPW TASELTSILY DHIDTVVGRY KNDIAIWDVA NEVFDDSGVY RRSFWYNTIG
QSYVELGFRR ARQADSDAVL IYNDYNIEEV NAKSNAVYAM VSDFLARGVP IDGIGFQMHL
LGSGINYNSF AQNMQRFADL GLKIYVTEAD VRLQLPATST SLAQQATVYQ NVLDRCLRQP
ACQAFQFWGF TDKYSWVPNT FPGYGAALIY DEQYNPKPAY TAIYNRLLQG RGGTPTPTSA
PTATRTPTAQ PTATNTAVPT NTPTSSPTPT QTAPRPALIL SAPSQVTLGQ VFTLTIQYVN
IGLQYTTVTS SPAGLVQLDP PLSMPCKYNQ HPTQCKNITF KATALGTVQL NASATGEVPI
AGGGWAWGSA FAQNPVSVTI VDSVPTNTPT PTLTPSPTPI SGGCRVNYAI NSQWGNGFVA
NVTVTNASNT PINGWALNWS FAGNQQISNA WNTSLTQTGN AVVARNAGWN NLIAAQGSAS
FGFQASYSGS NAIPSSFSLN GVACSIVP