Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4279 |
Symbol | |
ID | 5736138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5463593 |
End bp | 5464834 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281439 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001547039 |
Protein GI | 159900792 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00024185 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCATGC TCTACTTCAC CACTGCCTAC AACGCTGCCT TGCTTGATCG GGTGCATGAA GAATTTTTGT TGCGCTGGCA GGCGCTTGGC CATGAAACCA GCATTCTTGT GCCCGACTCC AGCCGCGACC GCCAAAGCCG TTGGTTGGTC GAAGATGGTG CAATTCAGGT CTATCGGCCA GCAGTGAGCA TGCAGCGCAG CGATCGGCTA TTGAATAATG TAGGCCGACG CTTGACTGAA TATAGCTATT TTTTCAGTTT GCTGCGCGAT TATTTAAGCT TTTTGCGCCA ACACCCTGAG ATCGAGATTA TTCACGTTGA GTCGGTTTAT CCATTGGGGG CGATTGCCGC CGTGGCTTCG TTGATTGATC GACGGCCATT TGTACCAACC ATCCGTGGTG GCGACTTAAT TGCTGATGAT TCGATCAGCT ATGGTTTTGC TCGCTACAAA CGAGTCCGTG CTTTGCTCAA ATTGACCTTT GCGCGAGCTG CGGCAATTCG TTCGGTTTCA CCAAGTGCCA GTGCGATGGC CGAGCAATTT GGCTGCCCAA CGCAGAAAAT CATCACGATT GGTCGGAATA TTCGCGACGA ATATTTCGAG CGCGATCAAG CGGCCTTTCG GGCAGAAAGT CGAGCTTGGT TGCGCCAAAC CTACCCTGCA ATTGCTGGGC GCAACGTGAT CGTCGCGGCA GGACGTTTAT TGCCAGTCAA AGGCTTTGAT GATTTGATTC AGGCCTTGGT GGGTTTACCA CAGGCTGTAG CACTGATTTG CGGGCCAAAT CGAGTTGACG AAAAACTTGG CGATTATGGC GAATATTTAG GCCAATTGGC CCATCGCCAT AGCGTGGCCG ATCGAGTAAT CTTTACAGGA GGCATTCCCC GTGAGCAAAT GCCGCAGTAT TTTGCTGGAG CCGACGTGCT AGCCGTACCT TCGATTATTG AAGGCGGCAA CCGCACCGTT TTAGAAGCAG CAAGCTTGGG AGTGCCCTTC GTGGCGACTC GCAGCGCAGG CACACCCGAA TTTTTTAGTG CTGCTGCCGG CATTAGCATC GCGCCACATC GGCCTGATCA ACTCTGGGCT GGCTTAGCCA CAATTTTAGC TGAAACCTCG GAACAAGCCC AGGCTCGCAG CCAAACCTGT CAACAAGAAG CCCAACAATT TTATTCACCT CAAGTCGCCC AGCGGCTCGC TCGGCTTTAT ACAGCAATTT TAGCCAAGCA GCCGCTATCA GGGAACTTTT AG
|
Protein sequence | MRMLYFTTAY NAALLDRVHE EFLLRWQALG HETSILVPDS SRDRQSRWLV EDGAIQVYRP AVSMQRSDRL LNNVGRRLTE YSYFFSLLRD YLSFLRQHPE IEIIHVESVY PLGAIAAVAS LIDRRPFVPT IRGGDLIADD SISYGFARYK RVRALLKLTF ARAAAIRSVS PSASAMAEQF GCPTQKIITI GRNIRDEYFE RDQAAFRAES RAWLRQTYPA IAGRNVIVAA GRLLPVKGFD DLIQALVGLP QAVALICGPN RVDEKLGDYG EYLGQLAHRH SVADRVIFTG GIPREQMPQY FAGADVLAVP SIIEGGNRTV LEAASLGVPF VATRSAGTPE FFSAAAGISI APHRPDQLWA GLATILAETS EQAQARSQTC QQEAQQFYSP QVAQRLARLY TAILAKQPLS GNF
|
| |