Gene Haur_1760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1760 
Symbol 
ID5733648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2046869 
End bp2048155 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content53% 
IMG OID641278903 
Productglycosyl transferase group 1 
Protein accessionYP_001544531 
Protein GI159898284 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCGC TGCATGTGCT GTTACCAACC GACGTATTTC CACCACGCAC CGGTGGCGCA 
GGCTGGAGTA GTCATGCCTT GGCGTTGGCG TTGCTCGAAC GCGGCCATCA GGTGACGGCA
TTGGTTCCCA AGGCTGGAGT GCGTGGGCTG CATCGGCGGG TTGAAGCAGG TGTTCCGGTG
GTTGAGGTCG GCTATCAACC TGCTCGTTTG CCGTTTGTGG CCAATTGGTC GCGCTTTGAA
CTGTTCTGGC CCCAATTTGC CCAAGCAATT GTCAAGACGA TCGGCAAGCA GCGCGAACAC
GTAATCATTC ATGGACAGCA CGTCCAAGGG ATTGGTGCAG CCGTCTTGGC GGGGCAACAG
CTGAACATTC CGGTTGTGGC AACCGTGCGT GATCATTGGC CAAATCATTA TTTTGGCACA
AATTTACATG GCGATCAATT CCCGCTTGAA GATTTTGATT GGGCGGCGGC GGCCACCGAT
TTGGTTGCCC GCCGCAAACC ATTACTTGGG ATACTTTCGC TCTTGGCCTT ACCCTATGTC
CAAGCGCATA TGCAACGGCG ACGACAATTG TTACAAGCCT GTGATGCGGT TATTTCGCTG
AGTAGCTATA TCACCCAACG GCTTAGCAGC GTAGTTGCAC CCGCCAAATT ATGGCCAATT
CCCAACATGG TCAATGTAGC AGCGATAAGC AAAGTGCTAG CAACGCCTGC GCAAACGACC
ATTAATCAGC CATATATCTT GTTTACTGGC AAATTGGCTC GCAACAAAGG AGCCTATTTA
CTACCCGAAA TTATGGCTAG TTTTCGGGCA GCTGGTGGTG AGGCAACCCT CGTCATCGCT
GGCGGCAAGA ATTCAGAGCT GGTTGCAGCA ATTCAAGCTC AAGGCATTGA GGTGCTAGCC
CTCGATTGGG TTGAGCACGA TGAGGTTTTG CGCTTGATGG CGGGGGCCAA GCTCTTAATT
TTCCCCTCAA CGTGGGGCGA ACCACTCAGT CGGGTTTTGC TTGAAGCTTG TGCTGTGGGC
ATGCCGATTG TGGCAATGGC AACGGGCGGC ACACCGGATC TAATTCAGCA TGGCCTGAAT
GGCTATCTGG CTCGCTCAGC CAAGCAACTA GGTGTATTGG CGGCAGAATT GCTGCACAAC
CCGCAACGAG CCGAGCAATT GGGCCAAGTT GCCTATCAAA CCGCCCAAAC CCGTTTAGCT
AGCACAGTGG TTGCTGAGCA AGTGGAACAA CTCTATTGGA CACTACTTAC CCAACAACCA
CAGCGTGCGC TGACTGGGTA TGATTAA
 
Protein sequence
MKPLHVLLPT DVFPPRTGGA GWSSHALALA LLERGHQVTA LVPKAGVRGL HRRVEAGVPV 
VEVGYQPARL PFVANWSRFE LFWPQFAQAI VKTIGKQREH VIIHGQHVQG IGAAVLAGQQ
LNIPVVATVR DHWPNHYFGT NLHGDQFPLE DFDWAAAATD LVARRKPLLG ILSLLALPYV
QAHMQRRRQL LQACDAVISL SSYITQRLSS VVAPAKLWPI PNMVNVAAIS KVLATPAQTT
INQPYILFTG KLARNKGAYL LPEIMASFRA AGGEATLVIA GGKNSELVAA IQAQGIEVLA
LDWVEHDEVL RLMAGAKLLI FPSTWGEPLS RVLLEACAVG MPIVAMATGG TPDLIQHGLN
GYLARSAKQL GVLAAELLHN PQRAEQLGQV AYQTAQTRLA STVVAEQVEQ LYWTLLTQQP
QRALTGYD