Gene Haur_2392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2392 
Symbol 
ID5734273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3047564 
End bp3049582 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content52% 
IMG OID641279533 
ProductBeta-galactosidase 
Protein accessionYP_001545160 
Protein GI159898913 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTGA CGCTGACTGC CAACGGTCTT GCAGTCAATG GTCAGGAAGT GCCGGTTTAT 
TCCGGCACTA TTCACTATTG GCGTTTAGAG CGCGACCGCT GGGACTATAT TCTCGACCAA
GCTAAGGCGC TTGGGTTTTC GATGATCGAA ACCTACATTC CTTGGGGCGT GCATGAAACC
GCTCCCGGTC AATACGATTG GGGCCAAATC GATCCACGCA AAGATCTCGC GGCCTTTATG
CGTTTGTGCC ATGAACGGGG AATCTGGCTC ATTGCGCGGC CTGGGCCATT GATCAATGCT
GAATTGACCG ACTTTGGCTT TCCCCAATGG ATTTTGGACG ATCCAGCGAT GCAAGCTCGC
ACCGTGCTGG ATACCATGCA TATGAGCTTA GCAGCGGGTT TGCATCCACC ACATCAATTT
CCCGTGCCAT CGTATACCAG CCCTGAATAT TTGGCCGCCG TGGGCGTTTG GTTCGACCAC
ATGTGCCAAG TGATTGTGCC GCAACTTGCG CCGCATGGGC CAATTGTCGC GGTGCAAAGC
GATAATGAAA CCTGCTATAT GTTTCATGAG CAAGCTTATG CGACCGATTA CAGCCCTTCG
TCGTTGGCCT TGTATCAAGC AATGTTGGCT GAGCAATATG GCGCAATCGA AGAATTGAAT
CAAGCCTATG CGACCAACTA TGCCAATTTC AGCGAAGTGA TTGCGCCGCG CGATGCCGAT
ATCGCCAGCC GCCCCGATTT AGTGGCTCAC CTCGATTGGG TGCGCTACAA AGAAGTGCAT
GTTAATTGGA TTGTCGCGAC CATGGCCGAT ATGCTGCGTG AGCGTGGCGT GGTTGATGTG
CCATTATTCC ACGATGTAGC CTTCCAGTAT CGCACGCCGC TGGATATTAA CGCCATGGAA
GCCAACGGCC ATGTCGATTG GGTTGGGATC AACTATTATC GCCAACCACA AGGCTTTGAT
GGCGCGATCA CCTTAGTGCG CTATATGGCT GGTACCACGC GCCTGCCGTT TATCCCTGAA
TTTGGCAGCG GCTTGTGGAT TCACCATGCG CTCACGCCAC GGCCTGAAGA AGCAGAATTT
GTAACTTTAG CGGCCTTGAT GTATGGAATT AAAGCCTTCA ATTTCTATAT GTTGGTCGAG
CGTGATCGTT GGTTGGCCTG CCCAATCACT CGTCATGGCG ATTATCGACC GGAATATGCG
CAATTGTTTG AGCGCTTGAT GGGCTTTTTG CAGCAGCAAC AATGGTGGAA CTTCCAGCGC
AAACCTGAAG TTTTGGTGCT GATGAGCTAT GATCTTGGGC GCTATTGGGC CGCCACTTCA
ACCTTGCACT ATGGCCATGT TGATTTACTC GGCCTGCCGC CAGCATTAAA TCGGGTCGAA
TTGGATTTGG GCTTTAGCAC CGATCTCGAA GTTGAAAGCG ACGATTTTCA TCCGCAAAGT
TGGTATGGCT CGTTGCGGAG CACGCTCGAT GCCAACCATA TCGACTACGA TTTGAGCGAT
AGCCATTTGC GCAGCTCACG AATTGCTGAC TATAAATTGG TGTTTGCCCA GAGCGTCGAT
TGGATGAGCC GCGCCGATCA ACAGCGGTTG TTGCAGGCTG CCGAAACTGG TGCAACCGTT
TGGCTTGGCC CGACCTTGCC AACCCTTGAC GAATACTTCC AGCCCTGCAC AATCTTGGCT
GATCAATTGG CGGGCAAACG CCAAGTTGCC TTGGGCAGTG GCCATTTGGG CTTACTGAGC
CAAGCCGAAT TAGGCGCATT TTGTGCCGAA GTTGCAGCGG GTTTGCCTGT ACGGCCCCAA
AACCCTCAAT TGGCGGTTAC CAGCCATCGT CACCAAGGCC GCGAAATTCT GTTTGTAGCC
AACCCAACCG CTGAAACGAT TAACTCAAGC TTGGATTTTG CCCATCCTGT GCGCTTAACC
AGCTTGTGGG GAGCATTCGC TGGCGCTGAT CTTCAACAAC AGCAGCCGAT TAGCCTAGCA
GCCTACACGA TTGCAATTGT CGAGGTGCAG CATGATTGA
 
Protein sequence
MSVTLTANGL AVNGQEVPVY SGTIHYWRLE RDRWDYILDQ AKALGFSMIE TYIPWGVHET 
APGQYDWGQI DPRKDLAAFM RLCHERGIWL IARPGPLINA ELTDFGFPQW ILDDPAMQAR
TVLDTMHMSL AAGLHPPHQF PVPSYTSPEY LAAVGVWFDH MCQVIVPQLA PHGPIVAVQS
DNETCYMFHE QAYATDYSPS SLALYQAMLA EQYGAIEELN QAYATNYANF SEVIAPRDAD
IASRPDLVAH LDWVRYKEVH VNWIVATMAD MLRERGVVDV PLFHDVAFQY RTPLDINAME
ANGHVDWVGI NYYRQPQGFD GAITLVRYMA GTTRLPFIPE FGSGLWIHHA LTPRPEEAEF
VTLAALMYGI KAFNFYMLVE RDRWLACPIT RHGDYRPEYA QLFERLMGFL QQQQWWNFQR
KPEVLVLMSY DLGRYWAATS TLHYGHVDLL GLPPALNRVE LDLGFSTDLE VESDDFHPQS
WYGSLRSTLD ANHIDYDLSD SHLRSSRIAD YKLVFAQSVD WMSRADQQRL LQAAETGATV
WLGPTLPTLD EYFQPCTILA DQLAGKRQVA LGSGHLGLLS QAELGAFCAE VAAGLPVRPQ
NPQLAVTSHR HQGREILFVA NPTAETINSS LDFAHPVRLT SLWGAFAGAD LQQQQPISLA
AYTIAIVEVQ HD