Gene Haur_1530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1530 
Symbol 
ID5733417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1784091 
End bp1785272 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content49% 
IMG OID641278670 
Productglycosyl transferase group 1 
Protein accessionYP_001544302 
Protein GI159898055 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATACT CGCTGGATAA TCGTTTGTTT GCAGGTTTAG AAATAGAGTT TATGCATTTT 
ACAATTAATG GTCATTTGCT CTCATTTGAT AGCTCTTTTC GGCAGGCTGG GGTTTCTAAT
CACACCCGTT TTTTGATTGA AAACTTAGCC AAGCTCGACC ACGATAACCA ATACACCCTA
TTTGTAGGGC CAAATGTGCG CCAACACCTC AATTTGCCAG CCAACTGGGA GATTGTCGAA
TCGCGCTTGC CCACAATTCA GCCCAAATAT CGCATTCCTT GGGAGCAATT GATTGCACCC
TGGTTATTAG CCAAACGCCG AGTCAATTTG TTTCATGGCT TGTTGAATAT CTCGCCGTTG
CTTTCGCCAG TACCAACCAT CGTCACGATT CATGATTTGG CATTTATGGA TGTGACTGGT
TCGCATCGCA AGGCCAATCG GCGTTATTTG GCGGCAGCAA CTCGCCAAGG TGTGCGTCAA
GCTGCCCATC TATTTGCGGT TTCTGAGTAT ACCAAGGCCG CGATGGTCGA TCGGCTTGGG
CTTGATCCCG CTAAAATTAG CATTGCCTAT AATGCGGCTG GAGCGCAGTA TCACCCGCGT
TCGACCGCTG AAATTCATGC TTGGAAGCAG CAAAAGCAAC TGCCTGAGCA ATTTCTACTG
TATCTTGGCA CCTTAGAGCC ACGCAAAAAT ATTCCCAATT TGCTGCGAGC TTATGCCAAA
GTTAAACACG AAATTGGCAT GCCATTGTTA ATTGGTGGTG GCAAAGGCTG GAATTTTGAC
GAAATTTTCA GTACTTACGA GCAATTACAA TTGCACGATA GTGTCAGTTT CTTGGGCTAT
GTGCCAGGCG AGGAATTACC GTTGTGGTAT AACGCCGCGA CAGCCTTTAT CTATCCATCG
CGCTACGAAG GCTTTGGGAT TCCGCCGCTT GAGGCGATGG CTTCGGGCAC ACCCGTGCTG
ACTACCAATG CCACCAGCAT TCCCGAAGTC GTGGGCGATG CGGCGATTCA AGTTGACCCC
GATAATCTTG AGCAGATGGC CCAAGAATTA GTGCGGATTG CCAACGATGC CAGTTTGCGC
GACGATCTGC GTGAACGCGG TTTGCTGCGT GCCCAAGCCT TCTCGTGGGA GAATTTGGCC
AAAGCCACGC TTGAGGTTTA TCGCAAGGTT GGGGGCGAAT AG
 
Protein sequence
MLYSLDNRLF AGLEIEFMHF TINGHLLSFD SSFRQAGVSN HTRFLIENLA KLDHDNQYTL 
FVGPNVRQHL NLPANWEIVE SRLPTIQPKY RIPWEQLIAP WLLAKRRVNL FHGLLNISPL
LSPVPTIVTI HDLAFMDVTG SHRKANRRYL AAATRQGVRQ AAHLFAVSEY TKAAMVDRLG
LDPAKISIAY NAAGAQYHPR STAEIHAWKQ QKQLPEQFLL YLGTLEPRKN IPNLLRAYAK
VKHEIGMPLL IGGGKGWNFD EIFSTYEQLQ LHDSVSFLGY VPGEELPLWY NAATAFIYPS
RYEGFGIPPL EAMASGTPVL TTNATSIPEV VGDAAIQVDP DNLEQMAQEL VRIANDASLR
DDLRERGLLR AQAFSWENLA KATLEVYRKV GGE