Gene Haur_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1761 
Symbol 
ID5733649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2048152 
End bp2049324 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content53% 
IMG OID641278904 
Productglycosyl transferase group 1 
Protein accessionYP_001544532 
Protein GI159898285 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGTCT TATACATCGC CAGCGGCATC CGCGTGCCTG GTGCATTTGG TGGCGCAATT 
CATACAACCG AAGTTGCCCA AGGCTTGGCT CAACTGGGCG TGGAAATGCA TGTGATAACA
CGACCAGCCC AAGGCCAACG CCGCAAGCCA TGGCAACTGC CCAAGCGCCA AACCGGAGCA
ATTACTTGGT ACGAAGCTGA TTTGCCAAAG CCTTTGAGTT TGCTTGGCTA TTCAGCAATT
GCACGGTTGG TACGCGAATT GCGGCCCGAT GCGGTAATGG AACGCTACTA TAATTTTGCT
GGAGCAGGCA TCTTGGCTGC GGCTCGCCAA GGCATTCCGA CGTTGTTAGA AGTCAATGCC
TTGATCGTTG ATCCGCCCCA AGTACGCAAG CGCCAACTAG ATGATAGCTT GGCGTGGCTA
TTGCCTGGCA AGCATGGCCC GATGCGGCGC TGGGCCGCTT GGCAATGCCG GCATAGCACC
AAAATTGTCA CGCCCTTGCA CACAACCGTG CCACCCGAAA TTGAATGCAG TCGAATCGTC
GAATTACCTT GGGGCGCGAA TGTTCAGGCA TTTAGCCCAC AAACTCAAGC ACCAATCCAA
CCAGTGTTTG TCTTTCTTGG TTCGTTTCGC CATTGGCATG GGGTGACCGA TTTTATTCGC
GCAGCGATTC GTTTGATTCA GCAAGGCAGC CCGGCCCGAT TTTTATTAAT TGGTAGTGGC
CCAGAACAAG CTGAAGCTCA ACGGTTGGCA GCACCGTATG CCGAACGTTT TGAATGGGCA
GGGGCAGTAG CTCACGAACG CGTACCAGCC TTGTTAGCGC AGGCTAGCGT TGGGGTTGCA
CCGTTCAATC CAGCTCGTCA CCCAGCTTTA CAAGCGGCAG GCTTCTTTTG GTCGCCACTG
AAAATTTACG AATACATGGC GGCTGGCTTG CCCGTAGTAA CAGCTAATAT TCCGCCGCTC
GATACAATTA TTCGGCCACA GCAAGAAGGT GGGTTATTTG AAGCTGGCAA TATCAACGAC
CTTGCTCGCG TCATGCAGGC AGTTGCCAAC GACCCGCAAC GCCAACAATG GGGCTTGAAT
GCTCGCCAAC GCGTGGTCGA GTATTATTCG TGGGAGCGCC ATTGCCAAGC CTTATATCAA
TTATTGCAAA CCATGATTAA GGAGCAGCCA TGA
 
Protein sequence
MKVLYIASGI RVPGAFGGAI HTTEVAQGLA QLGVEMHVIT RPAQGQRRKP WQLPKRQTGA 
ITWYEADLPK PLSLLGYSAI ARLVRELRPD AVMERYYNFA GAGILAAARQ GIPTLLEVNA
LIVDPPQVRK RQLDDSLAWL LPGKHGPMRR WAAWQCRHST KIVTPLHTTV PPEIECSRIV
ELPWGANVQA FSPQTQAPIQ PVFVFLGSFR HWHGVTDFIR AAIRLIQQGS PARFLLIGSG
PEQAEAQRLA APYAERFEWA GAVAHERVPA LLAQASVGVA PFNPARHPAL QAAGFFWSPL
KIYEYMAAGL PVVTANIPPL DTIIRPQQEG GLFEAGNIND LARVMQAVAN DPQRQQWGLN
ARQRVVEYYS WERHCQALYQ LLQTMIKEQP