Gene Haur_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1431 
Symbol 
ID5733339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1657174 
End bp1659423 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content51% 
IMG OID641278569 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001544203 
Protein GI159897956 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0291974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCGA GCGATCAACA AATCAACGAC TTGTTGACTC AAATGACGTT GGAAGAAAAA 
ATTTCGCTGA CGATCGGCCA AGATATGTGG AGCACCCACC CCGTCGAACG CTTGGGGCTT
GGCTCGATTA ACATGAACGA TGGCCCACAT GGCTTGCGCA AACCCCCCGA AAATTCCTCA
ATTGGCATTA TCGATGCGAT ACCGGCAACC TGTTTTCCCA CCGCTGCTGC TGTTGCCTCA
ACGTGGGATG TTGATTTGAC CAAAGCGATT GGCGAGGCGA TTGCCCAAGA ATGCTTAGCC
AACAATGTGC AAATTGTACT TGGGCCTGGC ATCAACCTCA AACGCACGCC CTTGGGTGGC
CGCAATTTCG AATATTATTC CGAAGATCCG GTTTTGGCTG GCGAGTTGGG CACGGCATTT
GTCGAAGGTG TGCAAAGCCA TGGCGTTGGC ACATCGCTCA AACATTATGC CTGCAACAAC
CAAGAATTCG AGCGCATGAC GATTAGCTCG GAAGTCGATC AACGTACTTT GCGCGAGTTG
TATTTAGCAG CCTTTGAACG TGTGGTCAAA CGAGCGCAAC CTTGGACGAT CATGGCGGCC
TATAACAAAA TCAATGGAAT CTATGCGACC GAACATCGCC AACTGCTAAC TGAAATTCTG
CGCGAAGAAT GGGGCTTTGA AGGAATTGTC GTTTCCGACT GGGGCGCAGT TAACGATAAG
GCTGCCGCAT TAACTGCTGG CCTCGATTTG GAAATGCCTG GCCCAGCGCT TAATCATGTT
GAATTTTTGG CTGGTTTGGT ACGCAAGGGA GCGCTCTCAG AAACCGTTAT CGATACTGCT
GCCAGTCGCA TGCTCAAGAT TATTCTGCGT GGCATTGCCC AACGCCAGCC CGCAGCCAGC
TACGACAAAG CTGCCCATCA TGCTTTGGCT CGCCGTGCTG CCAGCGAATC GATGGTGCTG
CTCAAGAACG ATGGTATTTT GCCGTTGCAG CCAACTGCTG GGAGCACAGT CGCGGTGATT
GGCAATTTTG CCCAAAAGCC ACGCTATCAG GGTGCTGGTA GCTCGGAAAT TAATGCAACT
CAGGTTGATA CGCCACTCGA GGCCTTGCAA ACGTGGCTAA AAAACCAATC GGTTGAAGTT
AATTTTGCTG CTGGCTACGA TCACGATGGC AATACCAACG ATCAATTAAT TGCTGAAGCG
GTGGCAGCGG CCAAAAACGC TAGCCTGAGC TTAGTTTTGG TTGGTCTACC CGATGCCTAC
GAAACTGAAG GCGCTGATCG GGCACACATG AACATGCCAA CTGGACATAA TCAATTGCTT
GAGGCAGTAG CTGCGGTTCA AGCCAACACC GTGGCAATTT TGATCAATGG CTCAGCCGTG
ACGATTCCAT GGCTTGATCA AGTGCGTGCG GTGCTTGAAG CAGGTTTGGC AGGTCAGGCT
GTCGGCAGCG CTTTGGTCGA TGTGCTTTCG GGCGCGGTCA ATCCCAGTGG CAAATTGGCC
GAAACCTTCC CTTACGATCT TGCTGATACT CCAGCCTTTT TGAATTATCC AGGCGAGGCG
GGAGTGGTGC GCTATGGTGA AAGCCTGTTT ATTGGCTATC GCTACTACGA TGTGCGCAAG
GTCAAGCCAT TATTCCCGTT TGGCTATGGC TTATCCTACA CCAGTTTCCG CTATGATCAG
ATTGCGCTGA GTGCTGCCAG CATCGATGAA GCTACGCCCT TGACTGTCAG TGTTACCCTG
ACCAATACTG GCGAACGGGT TGGCAAAGAA GTTGTGCAAG TGTACGTCAA ACCGAGCAAT
TCGGCCTATC TGCGTCCAGT TAAAGAACTA CGGGCGTTTG CTAAAGTTGA ATTGGCTGCT
GGCGAGACGA AAACCGTTGA ATTGACCCTC GTTGCCCGCG ATTTCAGCCT GTATGATCAA
CAACGAGCGG CTTGGCGTAT GGAAGGCGGC AGCTATCAGA TTTTGGCTGG TGGTTGCAGC
GCCGATCTGC CATTAGTGGC TGATCTGACG GTGAATGAAG ACCCACGTTC AGCTCGCAAA
GTGCTCACGC GCATGAGTTC CATCAAGGAA TTCTTGGATG ATCCGATTGG CGCTGAAATT
TTGCATGCGA CCGCTGGAGC TTTTATCGAA GGCCAAAGCG CTAGCACTCG CGCGATTTTC
GAGCCAATTC CATTAGCCAA ATTTGTTAAC TTCGGCTTCT TCGAGGCCAG CCAAGTTGAC
GAAATTGTAG CCAAGGTCAA TCAGGGCTAG
 
Protein sequence
MTASDQQIND LLTQMTLEEK ISLTIGQDMW STHPVERLGL GSINMNDGPH GLRKPPENSS 
IGIIDAIPAT CFPTAAAVAS TWDVDLTKAI GEAIAQECLA NNVQIVLGPG INLKRTPLGG
RNFEYYSEDP VLAGELGTAF VEGVQSHGVG TSLKHYACNN QEFERMTISS EVDQRTLREL
YLAAFERVVK RAQPWTIMAA YNKINGIYAT EHRQLLTEIL REEWGFEGIV VSDWGAVNDK
AAALTAGLDL EMPGPALNHV EFLAGLVRKG ALSETVIDTA ASRMLKIILR GIAQRQPAAS
YDKAAHHALA RRAASESMVL LKNDGILPLQ PTAGSTVAVI GNFAQKPRYQ GAGSSEINAT
QVDTPLEALQ TWLKNQSVEV NFAAGYDHDG NTNDQLIAEA VAAAKNASLS LVLVGLPDAY
ETEGADRAHM NMPTGHNQLL EAVAAVQANT VAILINGSAV TIPWLDQVRA VLEAGLAGQA
VGSALVDVLS GAVNPSGKLA ETFPYDLADT PAFLNYPGEA GVVRYGESLF IGYRYYDVRK
VKPLFPFGYG LSYTSFRYDQ IALSAASIDE ATPLTVSVTL TNTGERVGKE VVQVYVKPSN
SAYLRPVKEL RAFAKVELAA GETKTVELTL VARDFSLYDQ QRAAWRMEGG SYQILAGGCS
ADLPLVADLT VNEDPRSARK VLTRMSSIKE FLDDPIGAEI LHATAGAFIE GQSASTRAIF
EPIPLAKFVN FGFFEASQVD EIVAKVNQG