Gene Haur_1027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1027 
Symbol 
ID5732931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1172859 
End bp1173992 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content54% 
IMG OID641278162 
Productglycosyl transferase group 1 
Protein accessionYP_001543803 
Protein GI159897556 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGGC GACTCTGGTT CGTCACCGAT AGCACAGCCC TTGGGGGAGC CGAGGGCTAT 
TTAGAAACCC TGTTGCTCAA TGCCGACCAA CAGCAATTTG AACTTGGCTT GTTGCTGCCG
CCGCGCCCAG CCACCCAACC CTTGATCGAT CGAGCCAAAG CCCATGGAGC GAGCATTGCA
ACCTTGGATG TGGTGCACGA GCATGGGTTA TCGCTGAAGG CAGTCAATCA AGCGGCACGA
CTTTTTCGCC AATTACAGCC TGATATTGTG CATTTCGTTG TGCCTTCACC GCGCCGCGCC
GCAGAATTGG TGCTTGGGGC GGCCTTGGCG CGAGTGCCAC GCCGAGTTAT CACCTTCCAG
TTGGTCACGC CAATTCCCCG CTTCAATTGG CTTTCCCATC ATCTGCGGCT GCTCAATCGG
CGTTGGCAAT ACGCCACCTT ACACGCTGGC ATCGCGGTTT CGCAAGGCAA TGCTCAATTG
TTATTAGAGC AATTTGGCTT TCCCAAGCGG CGTTTGCATA CCATTTATAA TGCGGTTGAT
AGCCAACGCT GGCAGCCGCA ACCGCGCGAT CCTGCAACTC GTGCAGCGTG GCAAATTCCC
GCCGATGTGC CACTTTTAGG CGTGGTTGGG CGTTTGAGCC GCCAAAAAGG CCACCAAATT
TTATTCGAGG CCTTACCAAC GTTGTGGCAA GCGCAGCCGA ATTTGCATGT CGCATTAATC
GGCGAGGGCG ATTTAGCTGA CGAATTACGT CAAGCTGCCC AACAACTACC CAAGCCAAAT
CAAGTGCATT TTGTCGGCCA GCAAACTAAT ATGCCTGCGG CTTTGGCCGC ACTTGATGTT
TTTGTCTTGC CATCGCTGTA CGAAGGCTTA TCGTTTGCCT TGCTCGAAGC CATGGCCAGT
GGGCAAGCAA TTGTTGCCAG CAGCACCGAT GGTACACGCG AAGCAATCAG CGATGGAATC
CAAGGTCTAT TGGTTGAGCC AGGCCAAAGT GCTGCGCTGG CGCAGGCAAT CGGGCGCATG
CTCAGCGATC AATCATTAAA CCAAGCCTGT CGCCAAGCCG CCCGCCAACG CATTCAACAA
CAATTTGAGT TGCAAACGAT GTTGCAACGC ACGTTTGATT TGTATCGAGC ATAG
 
Protein sequence
MKRRLWFVTD STALGGAEGY LETLLLNADQ QQFELGLLLP PRPATQPLID RAKAHGASIA 
TLDVVHEHGL SLKAVNQAAR LFRQLQPDIV HFVVPSPRRA AELVLGAALA RVPRRVITFQ
LVTPIPRFNW LSHHLRLLNR RWQYATLHAG IAVSQGNAQL LLEQFGFPKR RLHTIYNAVD
SQRWQPQPRD PATRAAWQIP ADVPLLGVVG RLSRQKGHQI LFEALPTLWQ AQPNLHVALI
GEGDLADELR QAAQQLPKPN QVHFVGQQTN MPAALAALDV FVLPSLYEGL SFALLEAMAS
GQAIVASSTD GTREAISDGI QGLLVEPGQS AALAQAIGRM LSDQSLNQAC RQAARQRIQQ
QFELQTMLQR TFDLYRA