Gene Haur_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3131 
Symbol 
ID5735003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3956856 
End bp3958646 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content54% 
IMG OID641280274 
Productpolysaccharide deacetylase 
Protein accessionYP_001545896 
Protein GI159899649 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.927219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGGT TTCGCCAATC TTCGATCATT CTGCTTCTGA CCATGGCTTT GTTGGGCCTT 
GGCAGCCTGT TGATCGCTCA GTGGTTACGC CCCGATCGTT CGCCACGCAT CTGGACAGGC
GGCAATATTT TGGCTAATGC CGATTGGCAG AGCGCCGCCG ATAACGGGAT TCCCGATGGT
TGGTCGGGCA ATGGCATCAA ACGCGCCGAT ACCACCAATG GCTATGTGCT CGATGATCAA
TATAGCTTGC AGTTGTATGG GGTCAATAGT TTTGCCCGTT CGCCACGCTT AACTGCCCAA
GCAGGCCAGC GCTATCGACT GGGCTTTCAA GCCCTGATCG ATCCCGGCAC GCAGCGAAGC
AGTTTGGGCG CACAAATTCA GGTTTGGGTG CATTGGGTCG ATACTGCTGG CGATGATATT
CGGCTCGATA AGCAAGCTCC AGTTTTGCTG GGCTTCGATA GCCAAGGCGC AGCAACCTGG
ACTCCAGTGT TGGTTGAAAC TGAACCCAGC CCCAATCAAG CCGCTTGGCT AGCAATTTCA
ATCCACGCGC TTTCTGATGA CCCAATTTAT CTTGATAATT TGAGTATGGC AGCGGCTGGC
ATTTACATCG AACCCTATCC GCAGGGCGCG ATTGCTGCTG TGAGTTTTAG TGTCGATTGG
GAAACAGCCA TGGGCGGGGC GATTCACTCG CTCAGTTTGT CAACCGATGC GATCAGCAGC
GCCACGACCT TGGGTTTACA AGCTCGCCAA GGCACATACA ATTTGCTGGA CTTGTTTGCG
CCGCATCACA TTCGCGGCAC ATGGTTTGGC AATGGCTATA ATTTTTTATT CGGCAACCAA
GAGCGCCGCA CCTGGATGGA CGACCCAACC TTTGCTTGGG CTGCCAGTAC GCCGCGCCGT
TGGCAAACCA TCGATTGGTC ACAAACGCCG TGGTTCAGCC ACGATCCCTA TGGCACGGTC
GAGAGCGATC CAGCCTGGTA TTTTGGCGAT TTACTTGAAC CGCTACGAAA CGCTCAGCAA
ACGATCGAAA GTCATACATT TAGCCATATG TACATTGGCT TTGCCCATGC CAAAGAGTTA
GCTAGCGATA GCAATGAGTG GCAGGCAGTG ACAAGCAGCC AAGGCATTGC GCCCGCCAGC
AGTTTAGCTT TCCCGTATGG CGGCAGCGAT GGCATGACCG AAGCCCATTG GCAAACCTTG
CAAGCCGCAG GTATTCGCAC CGTCGTGCGC ACCCGCGTGC TTGATGCCAC CAACGTGCAA
CGTGGTGTGA ATGATCGCCA TTTGTTGATC GATCGGCGTT GGTGGCAACC GCGCCAATTG
CCAGGCCACG ATCTGGTTGC CTTGCCCGAT GTTTACCTGA CCCCGCAAAC CGCTATGACT
GCAACCAATT ACTTGCAGCA AGCCATCGCT AGCGGCGGTG TGATCGATAT CTATGCCCAT
AATTACGAAA TTTACAACCC TGAGCAAATT GGCGTTTGGC AAACGGCAAT TCAACAGGCA
GTTGAAGCCC AAGCGTGGAT CGCCACAGTG CCGGAAATTG CCGACCGCTG GCGAGCACGC
CAAACGCTCC AACTCACCAT CGAACAAACC CCTGATCAAC TTCGGCTTCA GCTCGCCAAT
CCCAGTCGTT TCGATTTAGC CCAGCTTAGC TTGCGGCTTC CGGCTGAGAG CACTGGCAGC
GATAAAGGCA ATTTTGATCA GGCTAACCAT CGGCTGATTC TTGATTTACC AGCAAACTCC
GCCGAGGAGA TTACGATATG GCTCAAACCA TCAACGCCGC ACGTACCATA A
 
Protein sequence
MQRFRQSSII LLLTMALLGL GSLLIAQWLR PDRSPRIWTG GNILANADWQ SAADNGIPDG 
WSGNGIKRAD TTNGYVLDDQ YSLQLYGVNS FARSPRLTAQ AGQRYRLGFQ ALIDPGTQRS
SLGAQIQVWV HWVDTAGDDI RLDKQAPVLL GFDSQGAATW TPVLVETEPS PNQAAWLAIS
IHALSDDPIY LDNLSMAAAG IYIEPYPQGA IAAVSFSVDW ETAMGGAIHS LSLSTDAISS
ATTLGLQARQ GTYNLLDLFA PHHIRGTWFG NGYNFLFGNQ ERRTWMDDPT FAWAASTPRR
WQTIDWSQTP WFSHDPYGTV ESDPAWYFGD LLEPLRNAQQ TIESHTFSHM YIGFAHAKEL
ASDSNEWQAV TSSQGIAPAS SLAFPYGGSD GMTEAHWQTL QAAGIRTVVR TRVLDATNVQ
RGVNDRHLLI DRRWWQPRQL PGHDLVALPD VYLTPQTAMT ATNYLQQAIA SGGVIDIYAH
NYEIYNPEQI GVWQTAIQQA VEAQAWIATV PEIADRWRAR QTLQLTIEQT PDQLRLQLAN
PSRFDLAQLS LRLPAESTGS DKGNFDQANH RLILDLPANS AEEITIWLKP STPHVP