Gene Haur_5029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5029 
Symbol 
ID5736988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp38668 
End bp39894 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content52% 
IMG OID641282196 
Productglycosyl transferase family protein 
Protein accessionYP_001547787 
Protein GI159901541 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.153009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCA TTATCTATCT GCTTCCACCA GCCCACGGTC ATGTAAATCC TACCCTGCCA 
GTCATCCAAG AATTAGTGAC CCGTGGTGAA ACCATCATTT GCTACAACAC GGCGGAGTTT
CGTGTGCAGA TTGAACAGAC TGGTGCCCAC TTTCGGGCCT ATCCTCCCAT GGAGATGACC
CCGGTCGCGC TCTCGAGACT CCTCCAGGAC GGCAATCTCG CCAGGATAAC GGGGTTAATC
CTCCGCACCA CTGAACACCT GTTGCCCTTC TTGCTTGATG CGTTTGCGCA TGAAAAACCT
GATCTGATCG TCTTTGATTC GATTGCGCTC TGGGGGAAAA TGGCAGCAAC CATCTTAGGG
GTGCATGCCG TAGCGTCGAT TAGTCATTTC GTCATGGATG AACATCAGTT ACCATTTCTC
GATATCGTGC GCCTGTTGGG CCAGGTACTC CCCCAGATGC CAGCGATCCT TTTCGCGCGT
CGTCGCCTGA TGAATACCTA TGGAACCGCG TATCCCTCAG CCCGTCCCTT GTTTCCTATG
CGCGGTGACT TAAACATTGT CTTTACGTCA CAGGAATTAC AGCCCTCCAT CCCATTAATT
GATGCGACAT TCCGGTTTGT TGGTCCTGCG ATCAATCCAC AGACACGCAG CGGTACGTCC
ATGGCCGATG AACTCGGGCA GGAGAAGGTA ATCTATATTT CCTTGGGAAC GATTCACACA
CCACAATCAT CGTTCGTGCG GACGTGTTTT GCAGCCTTTG CTGACTATCC AACCCGCGTC
ATCATGTCCG TGGGATCCCA AGTGGCTAGC AGTGCGATTG GTTCAATCCC CGCAAACTTC
ATCGTGCGGC CATCCGTCCC GCAACTTGAT GTCCTTCAGC AGACGGCGGT TTTTATTACG
CATGGTGGTA TGAATAGTAT CCATGAAGGG TTATACTATG GTGTTCCCCT CATCCTTATC
CCGCACCAAG TCGAGCAATT GCTCAATGCG CGGATCGTGA CAGCCCGCGG GGCAGGATAC
CTTCTTACGC ATCAGCTCAC TCATACGCAG ATCACCGTAC CCATCCTCCG TCAAGCCGTA
GACACTGTGA TGGCTGATCC GCACTATCGC GTAGCGGCAC AGTCTCTCCA GGGTTCCTTG
CGTGCAACGG GTGGCTATTA TCAGGCAGCA GATGCCATTC AGTCCTACAT CAGTGAATCA
AGAATGATAG TCGTAACGCC TGTATAG
 
Protein sequence
MSTIIYLLPP AHGHVNPTLP VIQELVTRGE TIICYNTAEF RVQIEQTGAH FRAYPPMEMT 
PVALSRLLQD GNLARITGLI LRTTEHLLPF LLDAFAHEKP DLIVFDSIAL WGKMAATILG
VHAVASISHF VMDEHQLPFL DIVRLLGQVL PQMPAILFAR RRLMNTYGTA YPSARPLFPM
RGDLNIVFTS QELQPSIPLI DATFRFVGPA INPQTRSGTS MADELGQEKV IYISLGTIHT
PQSSFVRTCF AAFADYPTRV IMSVGSQVAS SAIGSIPANF IVRPSVPQLD VLQQTAVFIT
HGGMNSIHEG LYYGVPLILI PHQVEQLLNA RIVTARGAGY LLTHQLTHTQ ITVPILRQAV
DTVMADPHYR VAAQSLQGSL RATGGYYQAA DAIQSYISES RMIVVTPV