Gene Haur_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1167 
Symbol 
ID5733060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1339259 
End bp1340515 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content50% 
IMG OID641278307 
Productglycoside hydrolase family protein 
Protein accessionYP_001543943 
Protein GI159897696 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000303009 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCGTC ATCTTTTTCG CGGGGTTCTG GCAATTGGGA TTTTGGTCTG GATCGTGTTT 
CTCTGGCAGT TTATTGATTT ACGGATCAAG CACAGTCGCG CCGCCGATGC CGCCGCCCAA
GCTCTCTTGC CCCGAGCAAC CTCAACCGCC TTGCCCGTTC CTACCTTTGT TGCGCCAACC
TTGCAGCCGC TGCCCAACAC GGTTGCCGAT GGTCAAGGCG GCTCAGAGCT ACCTGCTAGC
GCCAATGATG TGCGCGGGGT ACACCCCAAA ACAGGCCGCT ACGTCGCTGC TTGGCTGCCA
ACCTCGTTTG ATGCCGAGGC GGCCCGTGCA ACCTTTGAAG CCAACAAAGA TATTCTCGAT
GAGGTCAGCC CGTTTTGGTA TGGCGTGCGA CCTGATGGCA CGTTAATCGC CGACGTTGGC
TCACGCGATG CCGAATTGGT GCAAATTGCC AAAGAAAATA ATGTGCTGAT TATTCCAACT
GTGCATAATA TTGAAGATTT GGAAGCAGCT TCGGTGGTGT TGGCAACGCC CGAAAGCCGC
ACAAACCATA TTAATATTAT TATGGATGAG GTTCGGACCT ACGGCTACGA TGGCATCGAC
ATCGATTATG AATCGCTTGC GCTTGATTAT GAAGATGAAT TTACCGCCTT TATGACCGAA
TTGGGTGCTG CGTTGCATGC TGAAGATAAA TTATTAACCG TTGCAGTGCA TGCCCACACT
GGTCGCCCCG ATTACCAAAA TTATGCCGAT TTGGGCAAAG TGGTTGATCG GCTGCGGATT
ATGACCTACG ATTATAGCTG GCGTGGCTCG GAGCCAGGCC CAATTGCTCC GATGTTTTGG
GTCAAAGCGG TGGCCGAATA TGCCAAAACC CAAGTTGACC CCAGCAAAAT TCAAATTGGC
ATTTCGTTTT ATGCCTACGA TTGGCCAGGT AATGGCGGCT TTGGGGTTGC TCGCACCTAT
ACTGAAGTTG AAGAAATTAA AGCCACCTAT CAGCCCCAAA TTCGCTTAGT TGAGGAAGAT
GGCGGCCAGC AAATTCAGGA ATCAACTTTT AACTATGCTG GACGCACGGT TTGGTTCTCC
AATTATCGTT CGCTAACCGC CAAAATGGAG ATGGTGCGCG AAAACGATTT AGCAGGCATT
GCAATTTGGC GCTTGGGCAG CGAAGATCCC CAAAACTGGA CCTATATTCG CGAATCGTTG
AAACAAGATC CCTTGATCGT CCAACGCTCA ATCAATCGCT ATCTTCCTGG CCATTAA
 
Protein sequence
MFRHLFRGVL AIGILVWIVF LWQFIDLRIK HSRAADAAAQ ALLPRATSTA LPVPTFVAPT 
LQPLPNTVAD GQGGSELPAS ANDVRGVHPK TGRYVAAWLP TSFDAEAARA TFEANKDILD
EVSPFWYGVR PDGTLIADVG SRDAELVQIA KENNVLIIPT VHNIEDLEAA SVVLATPESR
TNHINIIMDE VRTYGYDGID IDYESLALDY EDEFTAFMTE LGAALHAEDK LLTVAVHAHT
GRPDYQNYAD LGKVVDRLRI MTYDYSWRGS EPGPIAPMFW VKAVAEYAKT QVDPSKIQIG
ISFYAYDWPG NGGFGVARTY TEVEEIKATY QPQIRLVEED GGQQIQESTF NYAGRTVWFS
NYRSLTAKME MVRENDLAGI AIWRLGSEDP QNWTYIRESL KQDPLIVQRS INRYLPGH