Gene Haur_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1038 
Symbol 
ID5732942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1184626 
End bp1185663 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content54% 
IMG OID641278173 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_001543814 
Protein GI159897567 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTCA GTGATACGCC CCATCGCCGC TACAACCCGC TGACTGACGA ATGGGTCTTG 
GTTTCGCCAC ATCGAACTGC CCGCCCATGG CAAGGCCAGG TCGAAAAAAC CATTCCCGAC
CAGCGCCCAG CCTTTGATCC GCAATGCTAC CTTTGCCCTG GAGTAACGCG GGCCAATGGC
GAGATCAACC CAGTTTATGC TAGCACCTTT GTGTTCCCTA ATGATTTTGC CGCCTTGCTG
CCCGATAGCC CGAATGCAGC GCTTGATGAT GGCCTGTTTC AAGCCCATAG CGAACGTGGT
ATTTGTCGGG TGATTTGTTT CTCGCCACGT CACGATTTGA CCTTGGCCGA AATGGAGATT
CCCGATATTC GTTTGGTCAT CGATTTGTGG GCTAGTCAAT TTAGCGAGCT AGCTGCGATC
GATTGGATCA AGCATGTCGA GATTTTTGAA AATCGCGGCG CAGCGATGGG GGCAAGCAAT
CCACACCCGC ATGGCCAAAT TTGGGCCAAC GAAAGCATTC CAACGCTGGT CGCCACCGAA
CAGCGTAGCC AAAAGGCTTA CTTTGCTCAA CACCAACGCC CATTACTGAT CGATTACGTG
GAGCAAGAAT TAGCGCGGGG CGAACGGGTG GTCTATGCCA ACGATTACTG GGCAGCGGTC
GTGCCATTTT GGGCAGTTTG GCCTTATGAA ACCATGCTTT TGCCACGTCG GGCGGTGAGC
ACCTTGGCCG AATTAAGCGA AGCTGAACGT GATGGCCTCG CCGATTTACT CAGCCATACC
CTGATTCGCT ACGATAATTT GTTCCAAACC TCGTTTCCCT ATACGTTTGG CTGGCACAAT
GCCCCCTGCG ATGGCGAACA ATATCCCCAT CATGTGGTGC ATGCCCATAT TTACCCGCCA
TTGTTACGCT CGGCCACGGT GCGCAAATTT ATGGTTGGCT ACGAAATGTT GGCCCAACCA
CAGCGCGACT TAACCGCCGA AACCGCCGCG CAACGGCTAC GCGATTTACC CAGCCTGCAT
TGGACTAAAG CTGAGTAG
 
Protein sequence
MNLSDTPHRR YNPLTDEWVL VSPHRTARPW QGQVEKTIPD QRPAFDPQCY LCPGVTRANG 
EINPVYASTF VFPNDFAALL PDSPNAALDD GLFQAHSERG ICRVICFSPR HDLTLAEMEI
PDIRLVIDLW ASQFSELAAI DWIKHVEIFE NRGAAMGASN PHPHGQIWAN ESIPTLVATE
QRSQKAYFAQ HQRPLLIDYV EQELARGERV VYANDYWAAV VPFWAVWPYE TMLLPRRAVS
TLAELSEAER DGLADLLSHT LIRYDNLFQT SFPYTFGWHN APCDGEQYPH HVVHAHIYPP
LLRSATVRKF MVGYEMLAQP QRDLTAETAA QRLRDLPSLH WTKAE