Gene Haur_4438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4438 
Symbol 
ID5736289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5678504 
End bp5679649 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content54% 
IMG OID641281601 
Productglycosyl transferase group 1 
Protein accessionYP_001547198 
Protein GI159900951 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATAG CAATTGATTA CAATGCAGCG GTACGTCAGG GCGGCGGAAT TGGCCGCTTT 
GTTCGCGAAA TAACTCAGGT CGCCGCGCAG GCAGCGCCTC AGCATCGCTT CTCGCTGTGG
TATGCTGCGC GTGGGCTTGA CCCAAAAAGT GCTCAGATGC AGGCCTTGCA TGAGCTTCAA
CGGCGTTTGC CCAATATCAA GCCGCGCCCA ATTCCAATTA ATGAGCGCTT GTTGACGATT
CTTTGGCAAC GCTTGCGCAT GCCCTTGCCT GTCGAGACGA TTGTTGGGGC GGTTGATGTG
GTGCATGGCA CTGATTTTGT GTTGCCGCCG ACCAAGGCTA AAACGCTGCT CTCGATTCAC
GATTTTGCCT ATATTATTCA CCCTGAAACT GCGCCACCCG AGTTGCGGCG TTATTTGGGT
GGGGTTGTAC CGCGCAATGT GCGCCGCGCC GACCATATTC ACGTTAATTC GCGGGCAACC
AAAGCCGATA TGGAGCGCTT GCTGGGCACA GCACCCTCTA AATCGACAAT CGTTTATTCG
GGTAGTGGCA GCGATTTTTA TCCTCGGCCT GCGGCGGAAA TTGCCGAAAT GCGCCAACGC
TTGGGCTTGC CCGAACGCTA CCTTTTAAAT GTAGGCACGG TGCAGCCGCG CAAAAATGTT
GAGCGTTTGA TCGAAGCCTT TGGTCAATTG CCCGCTGAGT TGCGCAGCCA GCCCTTGGTG
ATCGGCGGCA AACGGGGTTG GTTGGCCGAG CCAATTTATG CAGCGGTACA ACGCCATGGC
CTTGAGCAAG CAGTCATCTT CTTGGATTTT GTCAGCGACA GCGATTTGCC CAAGCTCTAT
AGCGGCGCGA CCGCCATGGT TTATCCCTCG TTGTATGAAG GATTTGGCGT GCCGATTGTT
GAAGCTCAAG CATGTGGCAC ACCCGTGATT ACCTCAACCA TCTCCAGTTT GCCCGAAATT
GCTGGCAACG CGGCCTTGCT GGTCGATCCA CATGATACAG CGGCACTAAC AGCGGCTTTA
CAAAAAATTT TAACTGAGCC TGATGTTTGC CAAAGCTTGG CTGAAGCAGG CCCACGCCAA
GCCGCTAAAT TTACGTGGGA AGGCACTGGT TTGGGCGTTT TGGGGTTATA CGAATTGCTG
GGGTAA
 
Protein sequence
MHIAIDYNAA VRQGGGIGRF VREITQVAAQ AAPQHRFSLW YAARGLDPKS AQMQALHELQ 
RRLPNIKPRP IPINERLLTI LWQRLRMPLP VETIVGAVDV VHGTDFVLPP TKAKTLLSIH
DFAYIIHPET APPELRRYLG GVVPRNVRRA DHIHVNSRAT KADMERLLGT APSKSTIVYS
GSGSDFYPRP AAEIAEMRQR LGLPERYLLN VGTVQPRKNV ERLIEAFGQL PAELRSQPLV
IGGKRGWLAE PIYAAVQRHG LEQAVIFLDF VSDSDLPKLY SGATAMVYPS LYEGFGVPIV
EAQACGTPVI TSTISSLPEI AGNAALLVDP HDTAALTAAL QKILTEPDVC QSLAEAGPRQ
AAKFTWEGTG LGVLGLYELL G