Gene Haur_2832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2832 
Symbol 
ID5734713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3599922 
End bp3601127 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content52% 
IMG OID641279975 
ProductAlpha,alpha-trehalase 
Protein accessionYP_001545598 
Protein GI159899351 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.613992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACAGC CTGCAATCAC GCAGTATATC GATGCTTATT GGGACAAACT TGTTCGTCAA 
CAGCCGGAAG ATCAAAAAAC CTTAATTGGC TTGCCCCATC AATTTATGGT GCCAACCCAC
GACCCTACCT TTCAAGAGAT GTTTTATTGG GATAGTTTTT TCATTGCGCT GGGGTTGGGT
GGCACGCGCT ACGAAGCGGT AATCGAGGGC ATGGCCGAAA ACATGGCCTA TCTCTACCAG
CGCTTCGGGG TGATTCCTAA CGCTAGCCGT TATTATTTTC TTTCGCGCAG CCAACCACCT
TTCTGGACTC AATTGATTTG GCTGGCCTAC CAAACCAAAC AGGCCGCTGG CGATCCTGAT
AGCGCTGCTT GGTTACAACG CCTGATGGCA CTCGCTGAGC AGGAGCATGC CAGTGTTTGG
CTGGCTACCA CCCATCCGCA TCAGCGCCAA GTGCATCGCG GGTTGTCGCG CTATTTCGAT
ATTAACTATT TGGATACCCT GGCCTGTTGC GAAAGTGGCT GGGATCATTC AACCCGCTGC
AATGGCCAAT GGATGAGCCA TTTGCCAGTT GATTTGAATA GCATTTTGTA TCTGCGTGAG
TGCGATTTTG CCCAAGCCGC CCGTGTGCGC GACGATCACG CGGCTGCCGA GCAATGGCAA
TCCTGCGCCG ATCAACGCGC CGAAACCATG CAAGCAGTTT TTTGGGATGC CGCTAGTGGC
TTTTTCTATG ATTACAATTA TCTCAATGAA GTGGCTGACC TAGATAACCC TTCGTTGGCC
GGATTTTACC CCTTGTGGGC TGGTTGGGCG ACCGAAGTTC AGGCGGCGCA GGTGGTCGAG
CAATGGTTGC CAAGCTTTAT GCGAGTTGGT GGTTTGGTGA CAACGCTCAA AACCCATGCT
AGCTATCAAT GGGCTAGCCC CAACGGCTGG GCACCATTGC AATGGATTGT CGTCGAGGGT
TTGTTACGCT ACGGCTATCA ATCCCAAGCG CGTGAGGTCA TGCAAGCATG GTGTACGCTC
AACGAAACTG TCTTCGAGCG AACCAACGCC ATGTGGGAAA AATATAACGT GGTTGACCCA
ACGGGCGAAG TTGAGGGCGG CAAATATGGC TCGTTGCCAG GCTTTGGCTG GTCGAATGCG
GTTTATCTCG ATTTCAAGCG CCGCTTAGCC CAACCAACTA TCGAACGCTG GAAGCTTGGC
GAATAA
 
Protein sequence
MRQPAITQYI DAYWDKLVRQ QPEDQKTLIG LPHQFMVPTH DPTFQEMFYW DSFFIALGLG 
GTRYEAVIEG MAENMAYLYQ RFGVIPNASR YYFLSRSQPP FWTQLIWLAY QTKQAAGDPD
SAAWLQRLMA LAEQEHASVW LATTHPHQRQ VHRGLSRYFD INYLDTLACC ESGWDHSTRC
NGQWMSHLPV DLNSILYLRE CDFAQAARVR DDHAAAEQWQ SCADQRAETM QAVFWDAASG
FFYDYNYLNE VADLDNPSLA GFYPLWAGWA TEVQAAQVVE QWLPSFMRVG GLVTTLKTHA
SYQWASPNGW APLQWIVVEG LLRYGYQSQA REVMQAWCTL NETVFERTNA MWEKYNVVDP
TGEVEGGKYG SLPGFGWSNA VYLDFKRRLA QPTIERWKLG E