Gene Haur_2828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2828 
Symbol 
ID5734709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3595193 
End bp3596839 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content45% 
IMG OID641279971 
Producttrehalose synthase 
Protein accessionYP_001545594 
Protein GI159899347 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAACG ATTGGTATAA AGATGCAATC TTTTACGAAT TGCATGTACG AGCTTTTCAG 
GATAGCAATG GCGATGGCAA TGGCGATTTC CGTGGCTTGA TTTCGCGCTT GGATTATTTA
CAAAGTTTAG GCGTTGATTG TTTATGGTTA TTACCTTTTT ATCCATCGCC TGGCAAAGAT
GATGGCTATG ATGTTGCAGA TTATTGTAAT GTGCATCCTA GTTATGGCAC GATGGAAGAT
GTGGTTACAT TTTTCGAGGC AGCCCATTCA CGTGGCTTGC GGGTAATGAT TGACCTTGTA
GTTAATCACA CCTCGGATCA GCACCCATGG TTTCAAGCGG CACGCCAACC AGACTCGCCC
TATCGCGATT ATTATGTTTG GAGTGATACA AATCAGCGTT ACCGCGATGC ACGAATTATT
TTCACCGATA CCGAACGCTC GAATTGGGCT TGGGATGAAG TTTCCGAATC GTATTACTGG
CATCGTTTCT TCAGTCATCA ACCTGATTTG AATTATGAAA ATCCCGCAGT CTTGGAAGAA
ATGTACAATA TTATGCGTTT TTGGCTCGAT CGCGGGGTTG ATGGGTTCAG AGTTGATGCA
GTGCCTTACT TAATCGAACG TGAGGGCACA AATTGCGAAA ATTTGCCTGA AACTCATGTA
ATTTTGCGTA AAATGCGTGA ATTTGTTGAT CAACACTATC CACATTGTGT ACTGCTGGCT
GAGGCCAACC AATGGCCCGA TGATGTGCGC CATTATTTCG GCAACGATGA TGAATTTCAT
ATGGCCTTTA ATTTTCCAGT GATGCCACGC ATGTTTATGG CTGTACGCAA GGAAGATAGT
ACGCCAATTA TCGATATTGT ACGCCAAACG CCTAAAATTC CTGAAAACTG CCAATGGGCA
ACTTTTCTGC GCAATCACGA TGAATTAACT CTCGAAATGG TCACTGATGA AGAACGTGAT
TATATGTATC GTGAATATGC GGCTGATCCA CGCATGAAAA TCAATATTGG GATTCGACGA
CGTTTAGCAC CCTTGATGGA TAATGCACGT CGTCGTATGG AATTGATGAA TAGCATGCTG
TTGAGTTTGC CAGGCTCGCC AATTATTTAT TATGGCGATG AAATTGGCAT GGGCGATAAT
ATTTATCTTG GCGACCGCAA CGGCGTGCGT ACTCCGATGC AATGGAATGG CGACCGCAAT
GCTGGTTTTT CGGCTGCCGA TTTTGCCCGT TTGTATAGCC CTGTGATTAT TGATCCGGTT
TATGGCTATC AAGCGATCAA TGTTGAGGCG CAGGAGCGCG TGCAATCATC ATTATTAAAT
TGGATGAAGC GCTTAATTCG GGTGCGCAAG CGTTATTCAG TCTTTGGGCG TGGCGATATT
CAAATTCTCG AAACGCAAAA TCGCAAAGTT TTAGCCTATT TGCGCAGTTA CGCCGACCAA
ACGGTGCTGA TTGTCAATAA TCTTTCGCGC TTTATCCAGC CAGTTGAGCT GGATTTGGCT
GAGTTTGCGG GCATGCGCTT GGTGGAATTA ATTGGTGAAA CGCCATTTCC GGCAATTGCG
ACGACACCTT ATTTTCTGTC GCTAGCGCCG CATGGCTTTA TCTGGTTCCG AATCGAAGGA
GTGCGCCTTG CTCTCGACGA CGCTTAA
 
Protein sequence
MQNDWYKDAI FYELHVRAFQ DSNGDGNGDF RGLISRLDYL QSLGVDCLWL LPFYPSPGKD 
DGYDVADYCN VHPSYGTMED VVTFFEAAHS RGLRVMIDLV VNHTSDQHPW FQAARQPDSP
YRDYYVWSDT NQRYRDARII FTDTERSNWA WDEVSESYYW HRFFSHQPDL NYENPAVLEE
MYNIMRFWLD RGVDGFRVDA VPYLIEREGT NCENLPETHV ILRKMREFVD QHYPHCVLLA
EANQWPDDVR HYFGNDDEFH MAFNFPVMPR MFMAVRKEDS TPIIDIVRQT PKIPENCQWA
TFLRNHDELT LEMVTDEERD YMYREYAADP RMKINIGIRR RLAPLMDNAR RRMELMNSML
LSLPGSPIIY YGDEIGMGDN IYLGDRNGVR TPMQWNGDRN AGFSAADFAR LYSPVIIDPV
YGYQAINVEA QERVQSSLLN WMKRLIRVRK RYSVFGRGDI QILETQNRKV LAYLRSYADQ
TVLIVNNLSR FIQPVELDLA EFAGMRLVEL IGETPFPAIA TTPYFLSLAP HGFIWFRIEG
VRLALDDA