Gene Haur_0080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0080 
Symbol 
ID5731973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp104516 
End bp105655 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content52% 
IMG OID641277202 
Productpolysaccharide deacetylase 
Protein accessionYP_001542860 
Protein GI159896613 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGAC GTTGGATTCG TGGACTGTGG GGTTTGACGC TGTGTGGGGT GTTGTTGAGT 
TGTGGTCAGG GCACTGCCAC GCAAGCACCA CAAACCCAAG CTCCCACTGC AACCACAACT
GTTGCTGCTA GCCCAACCAC TGCCGCCTCG CCAACGACGG CTGCTTCACC AACCTTGGCT
GCAACAGCAA CTTCCGAAGC TTCACCAACC CCCGCTGGTC TCAGCGATGC CGATCTCGCG
AAGTATGCAC CCAATGAGAT TGGCTGGGTT TTGGTCTTAG AATATCACTT AATTGAATCT
CCCGATGCAG ACTATAGCCG TTCGCCCGAA AATTTGCGCA AGGATTTAGA GTGGTTGTAT
GCCAATAATT TTTACCCAAT GACCTTGCGC GATTTGGTTG ATAACAACAT TAGCGTGCCG
CTGGGCAAAT CGCCAGTTGT TTTGACCTTT GACGATTCAT CGGATGGTCA GTTCCGCTAT
CTTGAAGATG GTACGCTTGA CCCAACTTCG GCCATGGGCA TTTTGCAAGC GTTTGCCGCC
GAGCACCCCG ATTTTCCGGC GATTGCGGTG TTCTTCCCGT TGATCGATGT TGATGTAAAA
GAGCGGGTTT TGTTTGGTCA GCCTGAGTTT GCCACCCAAA AACTTCAAGA AATTGTGGCC
TTGGGCGGCG AGGTTGGCAC TCATACCTAC ACCCATCAAC GCCTCGACGA GGCTGATGCC
GAGCAAATTC AATGGCAATT AGCCTTCTCG ATCAAAGAAC TTGAGGAGCG TATTGGCGAT
GGTTATCAAG TCACGAGCCT GAGCTATCCC TTGGGTATGT TTCCCGAAGA TGAGAGTTTG
GTCCGCGAAG GCGAATCTGA AGGTGAGAGC TATACATTAA GCGCAGCGGT TGATGTCACT
GGTGGCGCTA GCCCTTCGCC CTATTCGCAA AACTTCGATC CCTACCATAT TCGCCGAACG
CAAGCGGTTG ATTCGCAATT GGAATATTGG TATCAGTTGT TTGAAGAACG ACCTGATCTC
AAATTTATCT CCGATGGCGA CCCCGATACA ATTACCGTGC CCAGCGAGGA AACGCTTGGC
GAGGAGCAAC AAGGCCGTTT GCGCCCCGAC TTGGAAGTGC GCCGCTACGA ACGGAAGTAG
 
Protein sequence
MIRRWIRGLW GLTLCGVLLS CGQGTATQAP QTQAPTATTT VAASPTTAAS PTTAASPTLA 
ATATSEASPT PAGLSDADLA KYAPNEIGWV LVLEYHLIES PDADYSRSPE NLRKDLEWLY
ANNFYPMTLR DLVDNNISVP LGKSPVVLTF DDSSDGQFRY LEDGTLDPTS AMGILQAFAA
EHPDFPAIAV FFPLIDVDVK ERVLFGQPEF ATQKLQEIVA LGGEVGTHTY THQRLDEADA
EQIQWQLAFS IKELEERIGD GYQVTSLSYP LGMFPEDESL VREGESEGES YTLSAAVDVT
GGASPSPYSQ NFDPYHIRRT QAVDSQLEYW YQLFEERPDL KFISDGDPDT ITVPSEETLG
EEQQGRLRPD LEVRRYERK