Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4689 |
Symbol | |
ID | 5736536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5987165 |
End bp | 5990278 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281853 |
Product | glycosyl transferase family protein |
Protein accession | YP_001547448 |
Protein GI | 159901201 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0744] Membrane carboxypeptidase (penicillin-binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCAAA CTCGAAATGT AATCTCACGG CGACAACGCC GCACTACACG ATTTATTCCC TCACGGCTTG GCAATAAGCA AGCTCCACGC CCTATGGGCC GCCGCATTGT GCTGGCCTTT GTGGGCTTGT TGGTGGCTGG GCTCGTGCTC ATGGGCGTGG CTGGGGTGGC CATGGCCGTT ACCTACAACG GTATTGCCGC CAACCTCAAG CCACGGCTTG ATCAAATTCA TACCTATACG GCTTTTCAGC CATCAAAAAT TTACGATCGC AATGGCACGC TGCTATATGA ATTTGTTGGC GAAGGTCGGC GTACACCAGT TAAACTTGAA GAAGTTTCTA AGCATTTGAT TAACGCGACG GTTGCCGCCG AAGATGCCTC GTTCTTCGAA AACTCTGGTG TGAACTATTT CAGTATTGCG CGGGCAACCT ATGCCAACCT TACCCAGCAA AGTGTTGGGG CTGGCGGTGC TTCAACCATT ACCCAGCAGG TTGTGCGCTT GATCGTACTG ACCACCGAAG AGCGTCAAGA TCCCAACGTC TATAGCCGCA AGGTCAAAGA AATTATCTTG GCCCAAGAGT TGAACACGGT TTATAGCAAA AACGAAATTC TCGAACTGTA TCTGAACGAA ATTCCCTATG GCAACTTGTC GTATGGCATT CAGGCCGCCG CCCAGAATTA TTTCGGGGTT GATGCCAAAG ATTTGGATAT TGCTCAATCG TCGTTGCTCG GCGGGATTCC CCAATTGCCC ACGACCTATA ACCCCATGCC GTGGCTCGAC GATAATTTGC TGCTCAAAGG AATTAAATTA CCCAAAGATG TTTGGATTGA TCCGCTCTAC GATTTGAGCA ATGACATCAA AGGCGAGATT GCCCCACCCA AAGGTCGCCA AATCGAAGTG CTGCGCCAAA TGGTCAAAAA CAATTATTTG ACTGAACGCG AAGCGCGAGC AGCAGTGGCT AAAGATTTAC AGTTTGCCAA GCCTGAAGTC AGTTTATTAG CACCACACTT TGTCTTTTAT GTCAAAGATT ATTTGCAACA ACGCTATGGG GCTGAGGTGG TTTCAAATGG TGGTCTCAGC ATCACCACCA CCCTCGATTT GGAAACCCAA AATCTAGCCC AAACCATCGC CTATACTCGT ATTCAAGAAC TTAACGCCGA TAATCGCAAT ATTCACAATG CTGCGGTGGT GGTGATGCAG CCCAACACTG GCCAAATTTT GGGCATGGTT GGCTCGATTG GCTATGATCT TTCCGAAACC ACCACAACCC CCGGCGAAGA GGGCAACGTG CTTGATGGCA AAGTCAATGT CACCACTGCT TTGCGCCAAC CAGGCTCGGC TTTGAAACCA TTCACCTATC TTTCGGGGAT GGAGCAATAT GTGGCGACCG ACGGCGCACG CGGGATCACC CCTGCCAGTG TGTTGTGGGA TGTGCCAACG ATTTTCAACC CACGCGGGGT CAAATACGAA CCACAAAACT TCGATAATCA ATTTCATGGG CCATTACGGG CACGCACTGC AGTTGCCAAC TCGCTGAATA TTCCAGCAGT CAAGGGCTTG AAGGCTGCTG GCATTCCCGA AACCCTTGAT CTATTGCATC GTTTGGGCAT TTCGCCGAAT GTTTTGGCTA ACGACCCAGG CTATTATGGC TTGGCGCTGA CCCTTGGTGG TGGCGAAGTT ACCCCATTGG ATTTGGCGAC AGCCTACAAT ACGGTTGCCA GCGGTGGTCG CTATTTTGCG CCAACCCCAA TTCTCAAAAT TACCGATGCT CGTGGCAAAA CCTTGGAAGA ATTCAAGCCC ACGCCATTGG CCAACCCCGA AAGCGATGCG GTCAGCGATA CTAGCAAGTG TGTGATTCCT GAAGGCGAGG ATTATCAATT GGGTGCGCGA GTTCCCAATG GAACCCAATG TGTTGATGGT CGCTTGAACT ACATCATCAC CAACATGATC AGCGATAACG AAGCACGTCG CCCAATCTTC GGTCTGAATA GCATTTTGAA GCTCTCGCAA CCATCAGCGG TCAAAACTGG GACGACCAAC GACTTCCGCG ATGCATGGGC TTCGGGCTTC ACGCCATTCG TTACCGTTAC AGTCTGGACG GGCAATAACA ATAACGAACA AACCGCCCAA GTTGAAAGTA CCCAAGGCGG TGGCGTGATT TGGGCTCGCA CGATGGAAGC CATTTTTGCC AATGAACAGA TCATGAATCG CTTGGCAGGC TTCTATGGTG GCATCGAAAA TATGCCCCAA AGCTTCGAAA AATCGTATCC CGGGGTCTAT CGCGAGAGCA TTTGTGAAAT TCCCGGGCCA TTCGGTGGTC GCACCGACGA GCTGTTTATT GATGGCTTGG ATGCTGGCGG TAAGTGCGAT CTCTACGAAA AAGTCTCGGT TGTGCGGCTA ACCGTCACCG ATGCTGAGGG CAAAGAAACC ACGACCTACT GCCGCCCAGT CGAAGGTGCT GAGTATCCCG AAGGCGCAAT TTCATCAATT TATGTTTGGA AGTTGCCCGA AAGCAATGAT GACGAACGGA TCGATCTCAG CAAATGGAAG GGCTATACAT CGGATCGTGG CAATAGTGGC AACGACGAAG ATGAGCCAGT GGCGCTTGAT CCGGATAAGT TGCCTGGCTG CGATACGATT GCTCCAACCC CAACTCCAGG CACACCAACG CCTGATCCAT TGACCCCAAC CGTACCAGTG CTGGGGCCAG GTCAAGTTTT GATGCCTAAT TTGGTTGGCT ATGGCGAAAA TCAAGCTCGC CAACAGTTGA TGAGCTTAGG CTTTGCGCCT GATAAGATTG TGGTCGATTA TCAAGGCCGC GATCGACTTG GGCCAGTTTT TGATCAATAT CCAGCCTATG CTGTGGTCAG TAGCCTGCCC GGGGTTGGCT CGGTGGTTGA TCTGAATACT GTGATTATTT TGGGCATTCG CTCGCCTGAT GGCAGCCAGC CAACCACGGC TCCACCAGTA ACAGGCCAAC CAACGACGGC TCCGCCGCCA GTTAACCCAA CTCCGGCATT ACCATTGCCG ACGACGATTA TTATTCAGCC GTCGCCAGTT GAGCCATCGC CTGTGCCTGA ACAACCCCAA CCAACTCCAG TGGTAACACC CTAA
|
Protein sequence | MRQTRNVISR RQRRTTRFIP SRLGNKQAPR PMGRRIVLAF VGLLVAGLVL MGVAGVAMAV TYNGIAANLK PRLDQIHTYT AFQPSKIYDR NGTLLYEFVG EGRRTPVKLE EVSKHLINAT VAAEDASFFE NSGVNYFSIA RATYANLTQQ SVGAGGASTI TQQVVRLIVL TTEERQDPNV YSRKVKEIIL AQELNTVYSK NEILELYLNE IPYGNLSYGI QAAAQNYFGV DAKDLDIAQS SLLGGIPQLP TTYNPMPWLD DNLLLKGIKL PKDVWIDPLY DLSNDIKGEI APPKGRQIEV LRQMVKNNYL TEREARAAVA KDLQFAKPEV SLLAPHFVFY VKDYLQQRYG AEVVSNGGLS ITTTLDLETQ NLAQTIAYTR IQELNADNRN IHNAAVVVMQ PNTGQILGMV GSIGYDLSET TTTPGEEGNV LDGKVNVTTA LRQPGSALKP FTYLSGMEQY VATDGARGIT PASVLWDVPT IFNPRGVKYE PQNFDNQFHG PLRARTAVAN SLNIPAVKGL KAAGIPETLD LLHRLGISPN VLANDPGYYG LALTLGGGEV TPLDLATAYN TVASGGRYFA PTPILKITDA RGKTLEEFKP TPLANPESDA VSDTSKCVIP EGEDYQLGAR VPNGTQCVDG RLNYIITNMI SDNEARRPIF GLNSILKLSQ PSAVKTGTTN DFRDAWASGF TPFVTVTVWT GNNNNEQTAQ VESTQGGGVI WARTMEAIFA NEQIMNRLAG FYGGIENMPQ SFEKSYPGVY RESICEIPGP FGGRTDELFI DGLDAGGKCD LYEKVSVVRL TVTDAEGKET TTYCRPVEGA EYPEGAISSI YVWKLPESND DERIDLSKWK GYTSDRGNSG NDEDEPVALD PDKLPGCDTI APTPTPGTPT PDPLTPTVPV LGPGQVLMPN LVGYGENQAR QQLMSLGFAP DKIVVDYQGR DRLGPVFDQY PAYAVVSSLP GVGSVVDLNT VIILGIRSPD GSQPTTAPPV TGQPTTAPPP VNPTPALPLP TTIIIQPSPV EPSPVPEQPQ PTPVVTP
|
| |