Gene Haur_3622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3622 
Symbol 
ID5735483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4552399 
End bp4554483 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content55% 
IMG OID641280771 
Productglycosyl transferase family protein 
Protein accessionYP_001546386 
Protein GI159900139 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCAT CAACCACGAC GAGCAATCTG CGCCGCCCAT TGGCTTGGCG GATTGCCCAA 
TTAACCCGTA GCCAAGCGTT GGTCCTGCTG AGTATCATTT TGCTATGTGC TGGCTGGCTG
CGTTTGCAGA ATTTCGCTGC TGTAGCTGAA GGCAATACCT ACTACACCGC CGCCACCGTT
GCCATGACCC AATCGTGGCA TAACTTCTTC TTTGCAGCAG CCGAGCCTGG TGGCTCAGTC
ACAATTGACA AGCCAGCGCT AGGGTTATGG ATTCAGGCGA TTTTTGGCAA GTTGTTTGGG
GTTAGTGGCA CAGTGGTGGT GCTGCCGCAA GTTTTGGCTG GGCTAGCCAC AATTGGCCTG
CTATATTGGA TTGTGGCGCG GCGCTGGGGG CGGTCGGCAG GCTTATTGGC GGCAGCGATT
CAGGCGATTA GCCCAATTAG CATCGCCGTC GAACGTACTA ACAATCTTGA TGCGTTGTTG
ATCGTGACTT TGGTTGGGGC GATGGCCTTG TTTTTGGTGG CAACCGAACG TGCCAGCACT
AAATATTTGT TATTGGCAGG TGCAGTGGTT GGCCTCGGCT TTAATATCAA AATGCTACAG
GCCTTTTTGC CCTTGCCAGC CTTTTATGCG ATGTATTTTT TTGCTGCCAA AACTGGCTGG
TGGCGTAAGC TTTGGCAACT TGGACTGACC ACACCTGTGC TGTTGGCGGT CAGCTTTTCG
TGGGCAATCG CGGTTGATCT GGTTCCAGCC AGCGAACGGC CATACATCGG TAGTAGCGAT
ACCAACTCGG TTGTCAACTT GATTTTGGGC TACAACGGGG TTGAGCGCTT GACTGGACGC
GAAGGCCAAG CCATGGGCGG CGCAATTCCA ACAACTGATG ATCGCCAGCG ACCCAATGCC
ACCGATGGTA ATGCTCAAAT GCCTGCAATG CCCAATGGCA ACGCACAAAC TCCGCCCAAC
GGTATGACCG ATGAAGGCAT GCGCGGCCAA TTTGGCGGCC AAACTGGCCG CACTGGTGGC
GGTGGCCCAG GCATGGATAG CGGCGAATCG GGTTTATTCC GAATGTTCAG TTCACCGATG
AACACTAACA TTGGTTGGTT GCTAGGAGCG GCACTCTTCG CCGTGGTTGG CTTGGCTGCC
CACTACATCA AGCAACGCCG CTGGCCCGAT GCCGATGTTT GGGGCTGGGC TGGTTGGCTG
GTTACTGCCT TTGTCGTGCT GAGTTTTGCT GGTTTTTCGC ATGGCTACTA TAGCGCTACG
ATTGCTCCGG CGATTGCCGG CACACTGGCA ATTGGCATCA CGGTTTGGCG ACGCAGCGCC
AGCAAGATCG TTGGTTTATG GCTCATTGGT TTGGTAGCAG CGGCCTTGGT AGTGCAAGTG
ATTGCTGCGC AGCCCAGCGT CAGCGGCTGG CTGATTCCCT CAGTGGCACT TGGGTTGGCG
CTGGTGGCGG CTGGCTTGGC ATTCCGAGCT TCGTGGCGGG TGGCAGCAAC GGCGGTTGGC
ATCGCCGCGA TTCTGCTCAT CCCTAGTGAA TGGGCCTACA AAACGTCGGC GATGGAGCAA
ATGAATACAA CCTTGCCCAG CGCCGCTGCC CCAACCGATA ATGCCACTGG CTTTGCGGCG
GGCTTTGCTG GCAATCGTAA CCGCTCGGAT AGCAGCAGCC CCAGCGCTTT GGCAACCTAT
CTACAAGAGC GTACTAGCGA TACTTACTAT ATGCTCGCCG TGCCAAGCTC GATGATGGGT
TCGTCGTTGG TGATTGAAAC TGGGCGGCCT GTTTTGATGA TGGGCGGTTT CTCCGGCAGT
GATCCGGTGA TTGATGCCGC TGATATTGCC CAGTTGGTAG CCGACGGCAA ATTGCGCTAC
ATCATGACTG GCGGTATGGG CGGCGCTGGC CGTGGTGGCA GTTCAACGGT GCAAACTTGG
GTCCAGCAAA ACTGTACGGC AGTCACTGAT GCACCGAACA GTCAAGCAGG CTTTGATTTG
CCAAATGGCC AAATGCCAAA TGCTCAAGCA GCCCCCACCA ATGGCAACAC TGGTGGCGCA
CAATTTGCTC AAAACAATTC ATCATTGTAT CGTTGTGGCG AATAA
 
Protein sequence
MSASTTTSNL RRPLAWRIAQ LTRSQALVLL SIILLCAGWL RLQNFAAVAE GNTYYTAATV 
AMTQSWHNFF FAAAEPGGSV TIDKPALGLW IQAIFGKLFG VSGTVVVLPQ VLAGLATIGL
LYWIVARRWG RSAGLLAAAI QAISPISIAV ERTNNLDALL IVTLVGAMAL FLVATERAST
KYLLLAGAVV GLGFNIKMLQ AFLPLPAFYA MYFFAAKTGW WRKLWQLGLT TPVLLAVSFS
WAIAVDLVPA SERPYIGSSD TNSVVNLILG YNGVERLTGR EGQAMGGAIP TTDDRQRPNA
TDGNAQMPAM PNGNAQTPPN GMTDEGMRGQ FGGQTGRTGG GGPGMDSGES GLFRMFSSPM
NTNIGWLLGA ALFAVVGLAA HYIKQRRWPD ADVWGWAGWL VTAFVVLSFA GFSHGYYSAT
IAPAIAGTLA IGITVWRRSA SKIVGLWLIG LVAAALVVQV IAAQPSVSGW LIPSVALGLA
LVAAGLAFRA SWRVAATAVG IAAILLIPSE WAYKTSAMEQ MNTTLPSAAA PTDNATGFAA
GFAGNRNRSD SSSPSALATY LQERTSDTYY MLAVPSSMMG SSLVIETGRP VLMMGGFSGS
DPVIDAADIA QLVADGKLRY IMTGGMGGAG RGGSSTVQTW VQQNCTAVTD APNSQAGFDL
PNGQMPNAQA APTNGNTGGA QFAQNNSSLY RCGE