Gene Haur_0636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0636 
Symbol 
ID5732534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp733732 
End bp734901 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content51% 
IMG OID641277763 
Productglycosyl transferase group 1 
Protein accessionYP_001543412 
Protein GI159897165 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0521552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCTG CAAGCCTCAG TGTACTAACC CCCACTCCAG TGTATCCAGC CCATGCTGGT 
TCAAAAAACT ATAGTTTGAA TGCCGTACAG CAATTAAGTC ATTATTATAC TGTCGATAGT
TATTGTTTAG CCACCCAACC TGAAGCCGTC GATTGGGGGC CATTGCCGCA ATGGTGTCGT
GATCTACGGG CGTTTACCCC AACCAAGCCA GCGCGAAAAG GCATCGATCC ACCAGCGGTG
CATTTGGAAT TTTCGCAACC CATGTGCGAT TATTTACAAC AACGCTGGAT GCGCAATCTG
CCTGATTTGT TGCAACTTGA AGGCACAACC ATGGCTCAGT ATGCGCCATT TGCCCGCCGT
TTGGGGCTAA AAGCAATTAT CTGTACCATA CATCAAGTTG GGTTTGTGGC ACAATGGCGA
CGTTTACAAC GTGAACACCA TTGGAAACTA CGTGCCCGCC GCTTAGCTGG TTTACTCAGC
TTATGGCTAT ATGAGCAGCG AGCCTTGCGC CAGTGCGATT TGCTGGTTAC CTTGAGCACA
ACCGATCAAC AAACCTTGAA TCGTTGGCAA CCAAAACTCA ACGTTGTCAC TGTGCCAGCA
GGGGTTGATT TAAGCCAATG GCCATTATGT CGCCAGTCCC AAGCACCACA GCAGGTGCTG
TTTGTCGGTA ACTATTTTCA CCCGCCAAAT GTTGAAGGAG CCTTGTGGTT AGCACGGGAG
GTTTGGCCAT TGGTGCAAGC TCAACTGCCT GAAGCACGCT TGATGCTAGC AGGGCGCAGC
CCAACCCCTG AAATTCAACA ACTTGCGAGT GCTACAATTC AAGTGCCTGG CACAATTGAT
GATTTACAGG CCGTGTATCG GCAAAGCCAG GTGGTGGCAG CGCCAATTTT TTGGGGCAGC
GGCGTGCGCA TCAAAATTTT AGAGGGCTTG GCGACAGGCT TGCCCTTAGT CACCACAACG
TTGGCGGCTG AAGGCTTGCC GCTGAAACAC GAAGAGCATG CGCTGTTTGC CGAAACGCCG
CAAACCTTTG CCGCAGCGCT TGTGCGCATT TTGAACTCGC CACGTTTGGC CGAACAACTC
GGCGAAGCAG GCCGGCAATT GATCGCCCAA CAGTATGATT GGCAGGCAAT TGGGCGACAA
TTAGCCCAGC ACTATCAGAA GTTACGCTAA
 
Protein sequence
MQPASLSVLT PTPVYPAHAG SKNYSLNAVQ QLSHYYTVDS YCLATQPEAV DWGPLPQWCR 
DLRAFTPTKP ARKGIDPPAV HLEFSQPMCD YLQQRWMRNL PDLLQLEGTT MAQYAPFARR
LGLKAIICTI HQVGFVAQWR RLQREHHWKL RARRLAGLLS LWLYEQRALR QCDLLVTLST
TDQQTLNRWQ PKLNVVTVPA GVDLSQWPLC RQSQAPQQVL FVGNYFHPPN VEGALWLARE
VWPLVQAQLP EARLMLAGRS PTPEIQQLAS ATIQVPGTID DLQAVYRQSQ VVAAPIFWGS
GVRIKILEGL ATGLPLVTTT LAAEGLPLKH EEHALFAETP QTFAAALVRI LNSPRLAEQL
GEAGRQLIAQ QYDWQAIGRQ LAQHYQKLR