Gene Haur_5098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5098 
Symbol 
ID5737056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp124481 
End bp125491 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content49% 
IMG OID641282263 
Productglycosyl transferase family protein 
Protein accessionYP_001547854 
Protein GI159901608 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTGTC ACAATGCAAT TGTTCATCCC ACCGTCGCAC TTATTCCAGC TTTTAATGAA 
TCGCGATTCA TTGGGAGCTT GGTGCTTGCC GCAAAAGCGT ATGTTGATAT TGTGCTCGTG
GTCGATGATG GCTCTACCGA TCATACGGTT GCTATTGCTC AAAAGGCTGG TGCATGTGTT
TTACAGCATG CGGTAAATCA GGGGAAAGCC GCAGCGGTTA ATACGGGCTT TCGGTATATT
GCAACGCTTA ACCCCTTTGC TGTCGTGATG CTTGATGGGG ATGGCCAACA CAAAGTTGAT
GATATTCCAG CACTTTTAGC CCCTATCTGC CAAGGGCATG CCGATGTCGT GATTGGATCG
CGCTATGGTG CGATTCACAG CGATATCCCC CTCTATCGAA AGGTTGGGCA GATGGGATTA
ACCTCCTTAA CCAATTTGAT ATCAGGGGTG CAGGTTAGTG ACTCACAAAG TGGATTTCGC
GCCTTTTCAG CCCATGCCAT CGCCGTGATG TCATTTACGG CGAATGGCGG ATTCTCAATC
GAATCGGAAA TGCAGTTTCA TATTCATGAA CAGGCATTAC GGATCTGTGA GGTTCCCATT
CATGTGTTGT ATGTGGAAAA AGCCAAGCGA AACCCCATTG GCCATGGCAT GCAAGTGGTG
AAAGGTATTT TGGGCATCGC GACGACAATG CGTCCACTGC TCTTTTGGTG TGGCAGCGGC
TTCGCGACGT TAATGATAAG CACCGCGCTG CTGGTCTTCT TGGCGGCCCA TACAACGATG
GCATTGTCAC AGTTTGCCTG GCTGCTGAGC CTGCTCATGA TTGGGATGTT GTTGAGTATT
GGATCGATTG GAACTGGGAT TATCTTACAG CGCCAGCGCG TTATGTTACA ACGAATGGAA
ACGTCCTTAA AACAACAATT GATGCGTGCG CCCTCAGCGG CCTCCAACGA AACACTCTTC
CTGACACCAC GGGAGCGGGT TTATGACGGG GTGAATCAAC CGCTTAATTA A
 
Protein sequence
MDCHNAIVHP TVALIPAFNE SRFIGSLVLA AKAYVDIVLV VDDGSTDHTV AIAQKAGACV 
LQHAVNQGKA AAVNTGFRYI ATLNPFAVVM LDGDGQHKVD DIPALLAPIC QGHADVVIGS
RYGAIHSDIP LYRKVGQMGL TSLTNLISGV QVSDSQSGFR AFSAHAIAVM SFTANGGFSI
ESEMQFHIHE QALRICEVPI HVLYVEKAKR NPIGHGMQVV KGILGIATTM RPLLFWCGSG
FATLMISTAL LVFLAAHTTM ALSQFAWLLS LLMIGMLLSI GSIGTGIILQ RQRVMLQRME
TSLKQQLMRA PSAASNETLF LTPRERVYDG VNQPLN