Gene Haur_3638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3638 
Symbol 
ID5735499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4575362 
End bp4577296 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content56% 
IMG OID641280787 
ProductBeta-galactosidase 
Protein accessionYP_001546402 
Protein GI159900155 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTAG GTGTTTGTTA TTACCCCGAG CATTGGCCCC AAACGTGGTG GGCCGATGAT 
GCCAAACAGA TGCAGGCCTT GGGCTTGGAA TATGTGCGGA TTGGCGAGTT TGCTTGGGCT
TTGATGGAGC CAGCCGCTGG CCAGTATGAT TGGGTATGGC TCGACCAAGC AATTGAAACC
TTGGCGAGCC AAGGCTTGAA CATTGTGCTT GGCACGCCCA CCGCTACCCC GCCTGCTTGG
CTAACGCATA ACCAGCCCGA TTTAATGCGG ATCGATGCCC AAGGCCGTCG TTTAGGCCAT
GGTGGTCGTC GCCAAGCTTG TTTGGTCAAT CCTCAGTACA TCGAATATAG CCGCCAGATC
GTCACGGCCA TGGCCGAGCG CTATGGTCAG CATCCAGCGG TTGCCGCTTG GCAAATCGAT
AACGAAATTG GCAATCATGG CTCGGCGCGT TGCTACTGCG AACATTGTGC TGCGGCTTTT
CGCCAATGGC TGATCCAACG CTATGGCGAT TTAGCAGGCC TCAACGAAGC ATGGGGCACG
GCCTTTTGGA GCCAAACCTA CAGCGATTGG CAACAAATTC CCTTGCCAAA TGTACCAGTT
GGCGGCGGCC ATAATCCCTC GTTAGTGCTC GATTATCGCC GCTTCGCCTC GGATCAGCAG
GTAGCATATT GCGCGATGCA GGCAGAAATT TTGCGCCAGC ACTCTCCAAA TCGCACGATT
TTAACCAACA TCGCACCTGG CGACGATGAG ATTAATTGGT TTGATATGGC GCAGCAAGTC
GATACAATTG CTTGGGATAA TTACCCGCAT GGCTTTCCCG ATTGGCAAGC GGTGGCGATG
TATCACGACC ATATTCGTGG CCTCAAGCGT CAGCCATTTT GGGTGATGGA GCAACAGCCA
GGCCAAATCA ATTGGACTCC CACCAATCCA CCAGTGCCAC CCAACCAAGT GCGCTTGTGG
AGCTATCAAG ATGCCGCCCA TGGTGCAGCC AATGTGCTGT ATTTCCGCTG GCGGGCATGT
TGGCTCGGCC AAGAGCAATA TCATAGCGGC CTGCGCGATC ATGCCAATCG GCCAGCGCGT
GGCAGCACCG AAGCGCGGAT TGTTGCCAAC GAATGGCAGC AGCATGGCCA GCCCGAAGCT
GCACCGCGCA AGGTTGCCTT GCTGGTTTCC TACGACGATC ATTGGGCGCA ACAACTCGAT
CCGCATGCTC AAGGCTGGAA TTATTGGCAA TTGCTGCGCA CCATCCATCG CACGCTTACC
AGCTATGGCG TTGGGGTCGA TATTGTGCAG CGTGGCACGC CACTCGCTGC CTATCAACTA
GCGATTGCTG TCGCCCCAAT GCTCGATAAT CCTGCTGAAA CTGCGGGCTG GCGTGAGTGG
GTTCAGGCAG GCGGCACGTT GATCTGCACG CCACGCAGTT TAACCAAACG CCGCGACAAT
CGCACCGCTC CCGATGGCTT CCCCAGCGGC TTGACCGATT TATTTGGGGC TGATGTTGCC
GAGTGGAGCG CCCTCGACCC AGCCAAGCCG TGGGCAGTCA AATTTGGCGA GACGAGCCAC
ACCGCACCAC TTTGGATGGA AGTGCTGAAT GTGAGCCATG CCAATAGCTT AGCAACCTGG
AGCAAAAGCT ACGCAAAGGG TCAGGCTGCA ATCACCGCCG CGACCTATGG CAAAGGCCTA
GCAGTATTGA TGGGCTGCTA TCCCACCGAG GAAATTTTGG GCGATCTGCT GCCACGGCTC
TGGCCCGCTG CCCAACGCTT GCCCAACGAA ATTGAACGCA TCGAGTTGAC CGATGGCGTG
TTGTGGTTCA ACCATGGCGA ACAAGCCCAA AGCGTCAAAC TTCAAGGCAC TTGGCACGAT
CGCTTGAGTG GCGAGCAATG CAGTGGCGAT TGTTCAATCG AAAGTTTAGG TATTCGCTGG
CTCAAACCCC TATAA
 
Protein sequence
MPLGVCYYPE HWPQTWWADD AKQMQALGLE YVRIGEFAWA LMEPAAGQYD WVWLDQAIET 
LASQGLNIVL GTPTATPPAW LTHNQPDLMR IDAQGRRLGH GGRRQACLVN PQYIEYSRQI
VTAMAERYGQ HPAVAAWQID NEIGNHGSAR CYCEHCAAAF RQWLIQRYGD LAGLNEAWGT
AFWSQTYSDW QQIPLPNVPV GGGHNPSLVL DYRRFASDQQ VAYCAMQAEI LRQHSPNRTI
LTNIAPGDDE INWFDMAQQV DTIAWDNYPH GFPDWQAVAM YHDHIRGLKR QPFWVMEQQP
GQINWTPTNP PVPPNQVRLW SYQDAAHGAA NVLYFRWRAC WLGQEQYHSG LRDHANRPAR
GSTEARIVAN EWQQHGQPEA APRKVALLVS YDDHWAQQLD PHAQGWNYWQ LLRTIHRTLT
SYGVGVDIVQ RGTPLAAYQL AIAVAPMLDN PAETAGWREW VQAGGTLICT PRSLTKRRDN
RTAPDGFPSG LTDLFGADVA EWSALDPAKP WAVKFGETSH TAPLWMEVLN VSHANSLATW
SKSYAKGQAA ITAATYGKGL AVLMGCYPTE EILGDLLPRL WPAAQRLPNE IERIELTDGV
LWFNHGEQAQ SVKLQGTWHD RLSGEQCSGD CSIESLGIRW LKPL