Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4314 |
Symbol | |
ID | 5736173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5507373 |
End bp | 5508578 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281474 |
Product | glycosyl transferase family protein |
Protein accession | YP_001547074 |
Protein GI | 159900827 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGATTG ATGCTGCAAT TATTGTTGTA ACCTACAACC ACGCCCGCTA TATTGGCGAT TGCCTCAGCT CGTTGCTGGC GCTTGATCCT GCGCCCAGCG AATTGGTGGT AGTTGACAAT GCCTCGCGTG ATGCAACGGC CAAGATCGTC AAAAATCAAT TTCCTCAAGT GCGCTTTGTG CAAACCGGAG CCAATTTGGG CTTTGCAGGT GGTTGCAATC AAGGCGCTCG ATTAACCTCG GCTGAATATA TTGTGCTGGC TAACCCCGAT TTGATTGTAC AGCCCGATTG GCTTGAGCAG CTTATTGCGC CGTTGGAGCG TTGGCCGAGC GTTGGTGCGG TTGGCGGCAA ATTGTTATAT CGCGATGGCA CGACCATTCA ACATGCAGGC GGAGTTTTGC GCTTGCCATG GGGTTTAGGC CATCATCGTG GGGTTGGCGA GCATGATCAT GGCCAATACA ACGCACTTGA AACTGTCGAT TATGTAACTG GGGCGGCCTT TGCTTGTCGG CGGAGCACTT GGGATGTGCT CGATGGCCTT GATGAGCAAT TTTACCCGGC CTATTACGAA GAAGTTGATT TTTGTACTCG CCTACGGCGT GCTAGCCTTG ATGTGCTATA CACGCCCCGC GCAGTCGCAA CCCATATCGA AGGCTCCAGT GTGGGCCATC GCAGCGCCGT TTATTTGCAA CTGTACCATT TTAATCGGCT ACGCTACCTT TTTAAATACT TCAACAATAC CTGGCTGATG CGTACATGGC TGCCTGCCGA GATGGGCCAT ATTCGGGCTT GTGCCAGCGA TGACGAGATT CAAGCGCTGA AAATCGCCTA CTTGGCCTAT CAATCGGCCT TTTTAAACCA TGAATCCCAG CCCGTCATCA GCGAACTGGA TATTTTTCCC GATGAGACCG CTGATGGTGG CGAGACCGAA TTACAATGGA TTGAACGCCA ACTTCGTGCT AAAGTGAAGG TCGAACCAGC GCCGATTCCA GCACGGCAAC GTTGGTTAGG GGCGATTCGC AATGGCCTGC TACGCTTGGC CACCCGTGAT TTTATTGTGC CGATAGTACA AGCACAAAAT GATACTAACG CCGCCTTGTT AGAATCGATT CAAGCATTGA GCCGCCAACG CCGTGCAGCC GATGCAACCA TCCTACTACA AGGCATGTTA TTGGCCAAAA GTTTGGATCA ACAACCAAAG GCCTAA
|
Protein sequence | MTIDAAIIVV TYNHARYIGD CLSSLLALDP APSELVVVDN ASRDATAKIV KNQFPQVRFV QTGANLGFAG GCNQGARLTS AEYIVLANPD LIVQPDWLEQ LIAPLERWPS VGAVGGKLLY RDGTTIQHAG GVLRLPWGLG HHRGVGEHDH GQYNALETVD YVTGAAFACR RSTWDVLDGL DEQFYPAYYE EVDFCTRLRR ASLDVLYTPR AVATHIEGSS VGHRSAVYLQ LYHFNRLRYL FKYFNNTWLM RTWLPAEMGH IRACASDDEI QALKIAYLAY QSAFLNHESQ PVISELDIFP DETADGGETE LQWIERQLRA KVKVEPAPIP ARQRWLGAIR NGLLRLATRD FIVPIVQAQN DTNAALLESI QALSRQRRAA DATILLQGML LAKSLDQQPK A
|
| |