Gene Haur_2678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2678 
Symbol 
ID5734543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3433413 
End bp3435938 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content50% 
IMG OID641279820 
Productglycosyl transferase family protein 
Protein accessionYP_001545444 
Protein GI159899197 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGGC CCAGCGTTTC GATCATTGTG ATCAATTTCA ATGGTAAAAA ACACCTCGTT 
GATTGTTTAA ATTCGTTGTT TGTTCAGCGC TACCCAAGCA GTGCGCTTGA AATAATTGTC
GTCGATAATG CCTCGCACGA TGGCTCCTGC GACTTACTTC GTCAGCAATT TCCCAAGGTT
CGTTTAATTG AAAATCGTGA GAATCTGGGT TTTGCTCCGG CAGTCAATCA GGCGGTACGG
CTCAGTCAAG CCCAATATGT TGCCTTGATC AATAACGATG CCAAAGCCGA TCCCAACTGG
ATCGAGCATT TAGTTGCCGA TATCGAAGCG CATAAAGCCG AAAAGGTAAT TGCGGTTGGG
GCAAAAATGC TCGATTGGGA AGGCCAACAG ATTGATTTTA TCCAAGCGGC GTTGAATGTC
TTTGGCCATG GCAATCAGCC ATTTACGCGC ATGCCAACTG CTAGCATCGC AGGCCAAGCT
GGCCCACAAC TCTTTGCTTG TGGCGGGGCT ATGCTGGCCG ATCGGGCGTT TTTTTTGGCA
ATTGGCGGCT TCGACGAAAG CTATTTTGCC TATTTTGAAG ATGTCGATTT TGGCTGGCGA
GCATGGTTGT TGGGCTACCA AATTCGTTTT AATCCATCAG CGCTGGTCTA TCATCGCCAA
CATGCCACCG CCAACACTAT GGGCGGACAT CAAATTCGCG CCTTACTTGA GCGCAACGCC
CTACGCACCA TCATCAAACA TTATGCCGAT GAGCAATTGT GGCGCATTCT GCCAGCTGCC
ATTTTGCTGA TTATTCAGCG TAGTTTGCTC GATGGCAGCG GTGGCTTTGA TCGCAAAGAA
TTCGATTTAC GGCTGCGCAA ACAGGGCGAC CAAACCAGCA CCATGCAAGT TCCCAAGATT
ATGTTGAGTT ACATCGCGGC TTTGGGCGAT GTGCTTGATG GCTGGGATAG CTTGTGGGCT
GAGCGTGAAC GCTTGCAAAC GCTTCGCCAA CGTAGCGATG CTGAATTATT TAATTTGTTT
GAACAACCAT TTGGCCTGAT CGATCTTGAT GTGCGTTTGC ATATGCAGCA GCAAACCATG
GTTGAAAGCT TTAAATTGCG AGAACTTATG CCCAACCCAA CCACGAATGT GCTGATTGTC
AGCATTGATC CCTTGCAAGC AGCCTTGGCT GGCCCGGCGA TTCGCAGCGT GCAAATTGCC
AAACAACTGA GCCACTCCTG CAAAGTTGTG CTAGCAGCGC CCGATCAGGC CGATCTTGCC
ATTCCAAATG TTCAAACCAT CGCCTTTCCT AGCAACGATG GCCGCAGCTT GGGTGAGTTG
GCGCTAAATG CCGAGGTCAT TATTGTTCAA GGCTATAGTT TGCAAAAATA TCCCCAATTG
CTGAATGCTG AACGCATTTT GGTGGTCGAT CTCTACGATC CCTTCCATTT TGAAGCCCTT
GAATTAGCCG AACGCCGTGG CCTCAGTTTA GAACGAGCGC TTGAACTGAA TGATGCCAGC
GTGGCAGCCT TGACGCAACA ACTAGCGCTT GGCGATTTCT TTATCTGTGC CAGCGAACGC
CAACGTGATT TGTGGCTGGG AGCCTTGACC GTTAGCAAGC GCTTGACTCC CGAACACTAT
CGCAATGATC CAACCTTACG CAAGTTGATC GATATTGTGC CATTTGGCTT GCCCAGTGAG
CCACCCCAAG CCACTCAGCC AGTGATGCGT GGCGTAATTG AGGGCATTCA GCAAAACGAT
GTAATTGCAT TATGGGGCGG CGGCATCTGG GAATGGCTTG ATCCATTGAC GATCATTCGG
GCCATGGCCG AATTGCAGCA GAGCCACCCC CAATTAAAGC TGGTCTTTAT GGGTGGGCAA
CACCCGAATA CCCAAGATGT TGGGGTGATG CAGCGGTATA GCGAAGCAGT TGAGCTAGCA
AAACAGCTGG GTTTATACGC CAAAACGGTC TTTTTCAATC AAACATGGGT CGCCTATGAT
CAACGGGTCA ACTATTTGCT TGAGGCCGAT TTAGGAGTTA GCGCTCATCA TAATCATACT
GAAACCCGTT TTGCCTTTCG CACGCGATTG CTCGATTACC TCTGGGCCAG CTTGCCAATG
ATCGTTTCGG CAGGCGATAG TTTGGCCGAT TTGGTGCAGC AACAACAGCT TGGTCAGGTT
GTCGCAATCG AAGATGTCCA GGGCTGGGTC GCCGCTTTAA CCCATGCCGC CGATCATCCT
TCCGATCGTC AGCAACGCCA AGCCCAATTT GCCAACATTC AACAAGCCTA TACCTGGGAA
CAAGCTTGTG CGCCGTTGGT CGAGTTTTGT CGCCAACCAC AGTATGCTGC CGATAAACGC
CGCAACGTCA AAGCCCAAGG CCAACAATCA GGCCAAACCA GCATGCGCTA CCGTATGGAT
GAGCTTGATC GGGCGGTTGC CGAGAAAAAT GAGCATATCG CCCAGCTTGA GCAGCATATC
AAAGCGCTAG AAAACGGTAA AGTCATGCGC TTGTTAAAGT GGGTCAATCG GTTGCGAAAA
AGTTAA
 
Protein sequence
MDRPSVSIIV INFNGKKHLV DCLNSLFVQR YPSSALEIIV VDNASHDGSC DLLRQQFPKV 
RLIENRENLG FAPAVNQAVR LSQAQYVALI NNDAKADPNW IEHLVADIEA HKAEKVIAVG
AKMLDWEGQQ IDFIQAALNV FGHGNQPFTR MPTASIAGQA GPQLFACGGA MLADRAFFLA
IGGFDESYFA YFEDVDFGWR AWLLGYQIRF NPSALVYHRQ HATANTMGGH QIRALLERNA
LRTIIKHYAD EQLWRILPAA ILLIIQRSLL DGSGGFDRKE FDLRLRKQGD QTSTMQVPKI
MLSYIAALGD VLDGWDSLWA ERERLQTLRQ RSDAELFNLF EQPFGLIDLD VRLHMQQQTM
VESFKLRELM PNPTTNVLIV SIDPLQAALA GPAIRSVQIA KQLSHSCKVV LAAPDQADLA
IPNVQTIAFP SNDGRSLGEL ALNAEVIIVQ GYSLQKYPQL LNAERILVVD LYDPFHFEAL
ELAERRGLSL ERALELNDAS VAALTQQLAL GDFFICASER QRDLWLGALT VSKRLTPEHY
RNDPTLRKLI DIVPFGLPSE PPQATQPVMR GVIEGIQQND VIALWGGGIW EWLDPLTIIR
AMAELQQSHP QLKLVFMGGQ HPNTQDVGVM QRYSEAVELA KQLGLYAKTV FFNQTWVAYD
QRVNYLLEAD LGVSAHHNHT ETRFAFRTRL LDYLWASLPM IVSAGDSLAD LVQQQQLGQV
VAIEDVQGWV AALTHAADHP SDRQQRQAQF ANIQQAYTWE QACAPLVEFC RQPQYAADKR
RNVKAQGQQS GQTSMRYRMD ELDRAVAEKN EHIAQLEQHI KALENGKVMR LLKWVNRLRK
S