Gene Haur_3630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3630 
Symbol 
ID5735491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4562845 
End bp4564887 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content53% 
IMG OID641280779 
Productglycosyl transferase family protein 
Protein accessionYP_001546394 
Protein GI159900147 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.253169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAC GTTTTCGTTG GTGGCCGACC ATTCGGCGCT ATCGACTGAT ATTAATGCTT 
TTGCTGGTTG GGGCGGCAGC GGTTCGCTTA GGGCTATGGC TATTGCCAAG CCACCAACCA
GCCAATGATG AAGTTGAATA CCTCGCGGTT GCCCGCGATT TGTTGGCAGG CCAAGGTTGG
CGTTTTTATG ATGCGTACCC ATGGCTACGT GCTCCGTTGT ACCCGCTCTA TCTGGCGGCA
ACCCTCTGGT TAAGCAACAA CGATCCCCAA CATGCCTTGC TCTTTAATAT CGGTTTGAGC
CTGATTCACC TGTGGTTGCT GTGGTTACTT GGGCGCGATT GGGCTGGCGA TCATCCTCAT
GCTGAAAAAG TTGGCCTGTG GACAGCTGGC TTGGCTAGTG GTTTGTTGAC CTTCGCAACT
TTTGCCAATT TATGGATGAG CGAAACGCTC TGGAATGTGC TTTGGGCTAG CAGTTTGCTG
CTGATCCTGC GTTGGCAGCG GACGAAACAA CTACGTTTTG CCTTGGCAGC AGGCTTAACG
ATTGGCCTGA CGATTCTGAC GCGCTCGTTG CCCTTAGCAT TTGTGCCAGT ATTAATTGGC
TGGATGTGGT GGAATTGGCG AGGTTGGCGG AGTGTGCAGC ATGGCTTGGC CTTGGGCTTA
GTCGCCTGTG CGGTGGTTGC ACCATGGTCG TTACGCAATT GGCTGGCTTA TGGCCGTTTG
ATTACGGTTG AAACTGGCTT TGCCTATAAT CTATGGGCAT TTAGTGAACC AGCACTTGAG
CTAAGCGAGA TTAATACTAT TCTCGCGGCG ATACCTAATC CAGCGGAGCG AGCTGATTAT
GCCTCAGCCA AAGGCCGCGA GTTGTTAGCC GCCGATCCGA GCATTTTGGC GCGTAAGCCT
TGGACGAATA CGGTGTATCT CTGGCGGATC AAGCCGATTG AGGATCGTTT TATTCAGGCC
AATTATTATA GTGATGTGGC TTTACCATAT TTAATTGCTG CCTTGATTTT CGATGATATG
TGGTATGTGC TGTGTGTCGT GTTGGCACTA TGGGGCTTTG CGCGTGCTCC GCGTGATGCC
CGCTGGTGGC TGGCCTTGCT CTGGGTGGGT TATATTGTTG GTAGCACGAT GTTCACCCAC
GGCGAGGCCC GTTATCGCCA GTTTTTCTGG CCAATCATGC TGATGTATGC GGCGTTTAGT
TTGGCTAAAT TACGGGTCAG CACGCCAATT TGGCAACGCT TGGGAGCAAG CGCACTCGGC
CTAATCATTG GCTGGGTGAT TATTGATCAT TCGCCTTGGC AATTGCTGCA ACAGCAAATA
AGCCGTGGTT GGTGGCGCTG GCAAGCCGAC CAAGCCCTAG CGGCTGGCGA TTTGGCCCAA
GCTGAGGCAG CAAGTTTACG CGCACTCAAT CGTTTGCCTT CTGCTGATGG CTGGTTGGCC
TTGGCCAACA TTAAAGAGCA ATTTGGTCAA GCTGATGCTG CACTCGAAGC CTATATTGCT
GCGGCCAATC ACAACCGCGA TTATCCCTTG GCTAGCTATC GCTTAGGCAA TTATTTACTC
AAACGCGGGG ATCTGGCAGC GGCGCGTGAG GCCTGGGCTA ACCCCTATGT TGATCGCACC
AGCCTGCTGG ATTGGGCTTG GCGGTCAGCC GATCGCTCAG CCGAAACACA GATCGATCTT
GGGGCTGGCT TGGATATTGG CTTGATCGAG GGATTTTATC CGGCTGAAAA CCTGAATGCG
AGCACCGCCC GTTGGTCAAC CGCTCATGCT CGTTTGCGCC TGCCTGCGGG CGAGGCTGGC
GTTGTCCGTT TGCGGATTGC CAATCCACGG CCAGGCGATG CGCCAGCGGC AAACCTGCAA
ATCTGTGTTA GCGAGCAGCA CTGTATCCAT GCCGAACTTG GGGCTGAGTG GCGGGTGCTG
CATCTGCCAC TTGCTGAGAG TTCAAACGAA CGTCAGATTG AACTATTGGC TCAGCCATGG
CAAGCCCCAA GCGACCAACG TCAGCTGGGC ATTGTGGTGG ATTGGGTTGA ATTAGCGAGG
TAA
 
Protein sequence
MDERFRWWPT IRRYRLILML LLVGAAAVRL GLWLLPSHQP ANDEVEYLAV ARDLLAGQGW 
RFYDAYPWLR APLYPLYLAA TLWLSNNDPQ HALLFNIGLS LIHLWLLWLL GRDWAGDHPH
AEKVGLWTAG LASGLLTFAT FANLWMSETL WNVLWASSLL LILRWQRTKQ LRFALAAGLT
IGLTILTRSL PLAFVPVLIG WMWWNWRGWR SVQHGLALGL VACAVVAPWS LRNWLAYGRL
ITVETGFAYN LWAFSEPALE LSEINTILAA IPNPAERADY ASAKGRELLA ADPSILARKP
WTNTVYLWRI KPIEDRFIQA NYYSDVALPY LIAALIFDDM WYVLCVVLAL WGFARAPRDA
RWWLALLWVG YIVGSTMFTH GEARYRQFFW PIMLMYAAFS LAKLRVSTPI WQRLGASALG
LIIGWVIIDH SPWQLLQQQI SRGWWRWQAD QALAAGDLAQ AEAASLRALN RLPSADGWLA
LANIKEQFGQ ADAALEAYIA AANHNRDYPL ASYRLGNYLL KRGDLAAARE AWANPYVDRT
SLLDWAWRSA DRSAETQIDL GAGLDIGLIE GFYPAENLNA STARWSTAHA RLRLPAGEAG
VVRLRIANPR PGDAPAANLQ ICVSEQHCIH AELGAEWRVL HLPLAESSNE RQIELLAQPW
QAPSDQRQLG IVVDWVELAR