Gene Haur_2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2431 
Symbol 
ID5734312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3114634 
End bp3116265 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content54% 
IMG OID641279572 
Productlicheninase 
Protein accessionYP_001545199 
Protein GI159898952 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000786066 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAC AACCACGCTC TTTTTTTGGC TCACGCTTAA TTCAACGAGC TATTTTGATT 
TTGGCGATCT GCGCGGTGAT TGTGCCGTTA TTCGCCAGTA AATCATCATA TGCTGCCGCC
ACGCCACGTC GCCCGTTTCC CCAACATACC CAATATGCCA GCGGCACGAT CAAGCCCAAT
CATCGCAGCC AAGCCCAACT TGATAGCGAT GTTAAGGCAT TTTATGATGT TTGGAAAAGC
CGCTATGTGG TTCGCGCTGG CACGAGCAGT GCTGGCAACC CCTACTATCG GATTAGTTTT
GGCAGCAGTG CGCCCAACGT AACCGTTTCC GAAGGCCAAG GTTATGGCAT GGTGATTATG
GCCTTAATGG CGGGCTATGA TCCCGAAGCT CAAACAATTT TTGATGGTTT ATGGGAGTTT
TCGCGCACCA ATCCCAGCAA TATCGATTCG CGCCTGATGG GTTGGCGCAT TCCTAGCGAT
GGCTCGGGCA ATGATAGTGC TTTCGATGGC GATGCTGATA TCGCTTATGG CCTGATTTTG
GCCGATGCTC AATGGGGTAG CACTGGTCGA ATCAATTATG CCAGCGCGGC AAACACGGTT
TTGGATGGGG TTTTATCATC GACCATTGGG CCAAATAGCC GCTTGCCCAT GTTGGGCGAT
TGGGTTTCGC CGAATGGTAG CCCGCATAGC CAATATACGC CACGGCCCTC AGATTTTATG
CCCAGCCACT TCCGCGATTA CCGAGCCTTT ACTGGCAATG CCACTTGGGA TACGGTGCTG
AGCAAAACCC AAGGTGTGGT TGATAGCATT CAAGCCCAAT ATAGCCCCAA TACTGGCTTG
ATGCCCGATT TTGTGGTGCA AGCCAACACA ACGCCTAAGC CATCGCCTGC CAACTTCTTG
GAAAGCGAAA ACGATGGCAA TTATTACTAT AACTCGGGTC GTGTGCCATG GCGCTTGGGA
GCCGATGCCG TGATTTTTGG CGACGCTGCT TCATTACGTC AAGCTCAAAA AATCTCGCGT
TGGATCGAGC AAGCCACTGG TGGGACAGCA AGCAATATTC GGGCTGGCTA TAGCTTGAAT
GGTACGGCCT TGCCCGATAG TGGCTATTTC AGCACCTTCT TTGCAGCGCC ATTTGGGGTT
GCAGCCATGA CCGTGCCAGC CAGCCAGCAA TGGCTCAATC GAGTTTACGA TGCGGTGCGC
AGTAATCACC AAGATTATTT CGAAGATACC GTAACGCTGC AATGTTTGCT ATTGATGTCG
GGCAATTATT GGTCGCCAAG CCGCAGCAGC ACCAGCCCAA CCGCAACCCC ACGCCCTGCA
ACTGCAACCC CACGCCCTGC GACGGCAACG CCACGACCCG CCACCGCAAC GCCACGTCCA
GCAACTGCGA CCCCGCGCCC AGCCACCGCC ACCCCACGCC CAGCCACCGC CACCCCACGC
CCAGCCACCG CCACCCCACA ACCTGCCACG GCAACCCCAA ACGGCGTTGC AGCGTGGGAT
GGCAATATGC GAGCCTACAA AGTGGGCGAT CGGGTCAGCT ACAACGGGCG CATCTATCGC
TGTTTACAAG CACATACCTC GTTATCAACT TGGACTCCTG AGGCTGTTCC GGCCTTATGG
CAAGCTGAAT AA
 
Protein sequence
MAEQPRSFFG SRLIQRAILI LAICAVIVPL FASKSSYAAA TPRRPFPQHT QYASGTIKPN 
HRSQAQLDSD VKAFYDVWKS RYVVRAGTSS AGNPYYRISF GSSAPNVTVS EGQGYGMVIM
ALMAGYDPEA QTIFDGLWEF SRTNPSNIDS RLMGWRIPSD GSGNDSAFDG DADIAYGLIL
ADAQWGSTGR INYASAANTV LDGVLSSTIG PNSRLPMLGD WVSPNGSPHS QYTPRPSDFM
PSHFRDYRAF TGNATWDTVL SKTQGVVDSI QAQYSPNTGL MPDFVVQANT TPKPSPANFL
ESENDGNYYY NSGRVPWRLG ADAVIFGDAA SLRQAQKISR WIEQATGGTA SNIRAGYSLN
GTALPDSGYF STFFAAPFGV AAMTVPASQQ WLNRVYDAVR SNHQDYFEDT VTLQCLLLMS
GNYWSPSRSS TSPTATPRPA TATPRPATAT PRPATATPRP ATATPRPATA TPRPATATPR
PATATPQPAT ATPNGVAAWD GNMRAYKVGD RVSYNGRIYR CLQAHTSLST WTPEAVPALW
QAE