Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5029 |
Symbol | |
ID | 5736988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 38668 |
End bp | 39894 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282196 |
Product | glycosyl transferase family protein |
Protein accession | YP_001547787 |
Protein GI | 159901541 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.153009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCA TTATCTATCT GCTTCCACCA GCCCACGGTC ATGTAAATCC TACCCTGCCA GTCATCCAAG AATTAGTGAC CCGTGGTGAA ACCATCATTT GCTACAACAC GGCGGAGTTT CGTGTGCAGA TTGAACAGAC TGGTGCCCAC TTTCGGGCCT ATCCTCCCAT GGAGATGACC CCGGTCGCGC TCTCGAGACT CCTCCAGGAC GGCAATCTCG CCAGGATAAC GGGGTTAATC CTCCGCACCA CTGAACACCT GTTGCCCTTC TTGCTTGATG CGTTTGCGCA TGAAAAACCT GATCTGATCG TCTTTGATTC GATTGCGCTC TGGGGGAAAA TGGCAGCAAC CATCTTAGGG GTGCATGCCG TAGCGTCGAT TAGTCATTTC GTCATGGATG AACATCAGTT ACCATTTCTC GATATCGTGC GCCTGTTGGG CCAGGTACTC CCCCAGATGC CAGCGATCCT TTTCGCGCGT CGTCGCCTGA TGAATACCTA TGGAACCGCG TATCCCTCAG CCCGTCCCTT GTTTCCTATG CGCGGTGACT TAAACATTGT CTTTACGTCA CAGGAATTAC AGCCCTCCAT CCCATTAATT GATGCGACAT TCCGGTTTGT TGGTCCTGCG ATCAATCCAC AGACACGCAG CGGTACGTCC ATGGCCGATG AACTCGGGCA GGAGAAGGTA ATCTATATTT CCTTGGGAAC GATTCACACA CCACAATCAT CGTTCGTGCG GACGTGTTTT GCAGCCTTTG CTGACTATCC AACCCGCGTC ATCATGTCCG TGGGATCCCA AGTGGCTAGC AGTGCGATTG GTTCAATCCC CGCAAACTTC ATCGTGCGGC CATCCGTCCC GCAACTTGAT GTCCTTCAGC AGACGGCGGT TTTTATTACG CATGGTGGTA TGAATAGTAT CCATGAAGGG TTATACTATG GTGTTCCCCT CATCCTTATC CCGCACCAAG TCGAGCAATT GCTCAATGCG CGGATCGTGA CAGCCCGCGG GGCAGGATAC CTTCTTACGC ATCAGCTCAC TCATACGCAG ATCACCGTAC CCATCCTCCG TCAAGCCGTA GACACTGTGA TGGCTGATCC GCACTATCGC GTAGCGGCAC AGTCTCTCCA GGGTTCCTTG CGTGCAACGG GTGGCTATTA TCAGGCAGCA GATGCCATTC AGTCCTACAT CAGTGAATCA AGAATGATAG TCGTAACGCC TGTATAG
|
Protein sequence | MSTIIYLLPP AHGHVNPTLP VIQELVTRGE TIICYNTAEF RVQIEQTGAH FRAYPPMEMT PVALSRLLQD GNLARITGLI LRTTEHLLPF LLDAFAHEKP DLIVFDSIAL WGKMAATILG VHAVASISHF VMDEHQLPFL DIVRLLGQVL PQMPAILFAR RRLMNTYGTA YPSARPLFPM RGDLNIVFTS QELQPSIPLI DATFRFVGPA INPQTRSGTS MADELGQEKV IYISLGTIHT PQSSFVRTCF AAFADYPTRV IMSVGSQVAS SAIGSIPANF IVRPSVPQLD VLQQTAVFIT HGGMNSIHEG LYYGVPLILI PHQVEQLLNA RIVTARGAGY LLTHQLTHTQ ITVPILRQAV DTVMADPHYR VAAQSLQGSL RATGGYYQAA DAIQSYISES RMIVVTPV
|
| |