Gene Haur_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2166 
Symbol 
ID5734053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2732635 
End bp2735121 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content50% 
IMG OID641279307 
Productglycosyl transferase group 1 
Protein accessionYP_001544934 
Protein GI159898687 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAC GTCAGTTAAA TGTCGTTTTC TGGAGTGGTT GTGGTGGCGA GACCCAGCGT 
TATCGCTGTC AGCATGCAAT TGAGCAATTA CAGTATCGTG GGCATAAAGC CCAATTGTTT
AATCAGATTG ATCAAGCGGC GATTGTAGCG GTTGCTGCTG CCGATTTGGT GGTTGTGCAT
CGTCCTAAAG AAACTCACTT TTGGGAAACG ATTCAGCAAG CAGCACAAGG CAAACCCGTG
GTCTATGAAA CCGACGACTT GCTGTTTGAC CCAGCCTTGA TCGATTCGAT GCCAATTGTG
GCTGAGAGCA CGGGCTTTGA ACAGCAATTT TGGCGTGGCT ATGCCCGGGG CAATCCGCCA
GTGTTTGCTC GTTGTGATGC AGCAATTGTC AGCACTACGC CCTTGGCTCA GGCGGCTGAA
GCATTGCAAA AACCGGTTTG GGCGCATCGT AATGTGTTGG GCGACGATTG GATTGCATGG
TGTGAGGCGG CCTATCGCGA GCGCCAAACT CAAGCCCATG TAACAATTGG CTATTTTAGT
GGCACCTTTT CGCACGATGC CGACCTGCGT TTGATTGCCC CAGCTTTGCT AAAACTGTTG
CAACAACAGC CCAAACTACG CTTAATGCTG GGTGGCAAAA TCACGGTGCC TGATATTTTA
GCCCCGGTTG CCAACCAAAT TGAGCAATTG CCGTTTGTGC CGCTTGAGCA ATTGCCGCAA
CTCATGTCCA AAGCCGATAT TATTTTGGCT CCCTTGGATG TGGATAATGC TTTTACTCGC
TGCCGTAGCG AATTGAAGTA CCTCGAAGCC GCCGCCTTGC GCTTGCCCGT GGTGGCTAGC
CCGATTCCGG CCTTTGCCGA GGCAATTCGG CATGGAGAAA CCGGCTTTTT AGCTACTTCC
GAAGCCGAAT GGTATAGCCA ATTAAGCAAT TTGCTGGCCG ATGCCACGTT ACGTCAGCGG
GTTGGGCAAG CTGCCTATAC CCATGTGCTT GGTCATTACA CAATTGCAAC CGCAGCGGCT
GATTATGAAG CGATGCTGTT AGCCATCTTG CAGCAGTTTC CGACCAAACC TGCTCAGCCG
GCATTGCAAC CCCTACTCAG CCAATTTCAG CGTGATTTGA CCTTCCACGA TCGCTCAGTC
CATATGATCA CAGGCTGTGA TATTGGCAAC GCAGGCAATT ACCGCTGCCG CCATCGTCAA
GAGCAACTCG ATTGGTTTGA TATGTATAGC GGCGTAACGA GCCTTTACAA TGAGCCATTT
AAAATTGCCG ATAGCATTAA GTTTGGGATC TTGATTCTCC ATCGGGTTGC GCTTGATTCG
AATATTGCCA CGTTGATTGA TGCGCATCAA GCCTTGGGCC ATCCGGTTAT TTTTGATACC
GATGATTTGG TGTTTCGTAC CGATTTGCTG CATCATATCG ATGCAATTAA AGATTGGCCA
GCTGACGAAG TGGCCCTCTA TCGCGATGGA GTCGAGCGTT ACCTCAAAAC CATGCTGCTG
TGCGATGCAG TGATTGTTTC GACTGAGCCG TTGGCAGAGC AGGTACGAGC ATTCGACCTG
AATGCCTATG TGGTGCGCAA TGCCTTGAGC CAAAACCAAA TCAGCTATGC CGAGCCAATT
GCTGCTCAGC GCCAAGCTAA GCCACTGGCC CAACCGCATG ATCCGGTGTT GATTGGCTAC
TTTAGCGGTA CAGCTACCCA TAATCGCGAT TTTATGCAAG CCGAGCAAGC AATTTTGCAT
ATTTTGGCAA CCTATACCCA TGTGCGCTTG CGCATTGTGG GGCCATTGCA ATTATCGAAG
GCCTTTGATC CATATATTGA TCGGATTGAG CGCCGCGAAC TTGTGCCGCT CGAACAACTG
GCCGACGAAA TTGCTGCTGT TGATTTTGCG CTTGCTCCCT TGGAGCTTGA TAATCCATTT
TGCCAATCCA AGAGCGAAGT TAAATATATG GAAGCTGCTT TGGTTGGTGT GCCCTTGATT
GCAACCCCGA TTGAAGCTTT TCGTTATGCA ATTACCCATG GCATCAACGG TATGTTGGCG
GCAAATGAGC AAGAATGGAT TGAGGCACTT GAAGCTTTGG TAACTGATCC ATCATTGCGC
CAACGCCTAG GCCATGAGGC CTTGGCTGAT GCCCATGCCC GCTATAGCCC AAAAGCCCGT
AGCCGCGAGT TGCACAATGT GCTTCAACAA ATTTGGAGCA TGTATACCCG CCAAATGTCC
TTGATCAAGA GCAATGCCAT GTTGTTGGGT GCTAATAAAT CGCTCTCAAA GGGCAATGAA
GTTTTAGTGA TGCATATCGA TGGCTTGATT ACCAGCAACC AAGCGTTGCA TTATCGAGTG
CAGGAGCTTG AGCGCGACAA TGCTCAAGCC AAGGCTTATG CCCATCAATT AGAGATTCAG
CTGCAACAGA TTGCCAATGG TATCTTCATG CAGTTCAGCG GTAAAGCCAA AGGGCTATTA
CACCGTCTTA TCAACCGTAA AGGATAG
 
Protein sequence
MSKRQLNVVF WSGCGGETQR YRCQHAIEQL QYRGHKAQLF NQIDQAAIVA VAAADLVVVH 
RPKETHFWET IQQAAQGKPV VYETDDLLFD PALIDSMPIV AESTGFEQQF WRGYARGNPP
VFARCDAAIV STTPLAQAAE ALQKPVWAHR NVLGDDWIAW CEAAYRERQT QAHVTIGYFS
GTFSHDADLR LIAPALLKLL QQQPKLRLML GGKITVPDIL APVANQIEQL PFVPLEQLPQ
LMSKADIILA PLDVDNAFTR CRSELKYLEA AALRLPVVAS PIPAFAEAIR HGETGFLATS
EAEWYSQLSN LLADATLRQR VGQAAYTHVL GHYTIATAAA DYEAMLLAIL QQFPTKPAQP
ALQPLLSQFQ RDLTFHDRSV HMITGCDIGN AGNYRCRHRQ EQLDWFDMYS GVTSLYNEPF
KIADSIKFGI LILHRVALDS NIATLIDAHQ ALGHPVIFDT DDLVFRTDLL HHIDAIKDWP
ADEVALYRDG VERYLKTMLL CDAVIVSTEP LAEQVRAFDL NAYVVRNALS QNQISYAEPI
AAQRQAKPLA QPHDPVLIGY FSGTATHNRD FMQAEQAILH ILATYTHVRL RIVGPLQLSK
AFDPYIDRIE RRELVPLEQL ADEIAAVDFA LAPLELDNPF CQSKSEVKYM EAALVGVPLI
ATPIEAFRYA ITHGINGMLA ANEQEWIEAL EALVTDPSLR QRLGHEALAD AHARYSPKAR
SRELHNVLQQ IWSMYTRQMS LIKSNAMLLG ANKSLSKGNE VLVMHIDGLI TSNQALHYRV
QELERDNAQA KAYAHQLEIQ LQQIANGIFM QFSGKAKGLL HRLINRKG