Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2820 |
Symbol | |
ID | 5734701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3586898 |
End bp | 3588799 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279963 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001545586 |
Protein GI | 159899339 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000391085 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCAGA AACAACGTTC GTTTCGCACC CGCCTAGCCA TGATCGGCGG GCTAGTTACC CTACTCTTGG CTGGTCAGCC CGCTACTACC AAGCCAACCG CTGCCGCCGC TATCTGCGAG GTCACTTACA CAATCTCGAA TCAGTGGTCT ACTGGCTTCA CTGCTAATGT GAGTGTTAAG AATCTTGGGA TCGGTCTCAA TAATTGGCAG GTTGGCTGGA CATTCGCGGG CAATCAGGCA ATTACCAATC TTTGGAATGG GGTGCTCACC CAAACTGGCG CTCAGGTCAG TGTGTCAAAT CCAGCATGGG CCGCCAGTTT ACCCAGCAAT GGTACTGCCA GCTTTGGTTT CCAAGCTTCG TATACTGGTA GTAATGCGAT TCCAAACGCA TTTACCTTGA ATGGCGTTAG TTGCAACGGC GATCAGCCAA GCCCAATGCC TACAAATACT GCTATTCCGA GCATTCCACC TGCCACCAAC ACCCCTAATC CGCCAACCAA TACGCCAATT GCTACCACAA CCGGAACCCC TCGCCCAACC AATACGCCAA CCAGCGTCAT TCCAACGGTC ACGAATACAC CTCGCCCAAC CAATACTCCA GTTCCAACCA CGGTCAATCC AACGGCTACG AGTACCCCAA CTGGTAATAA CAATAATGAT GATTGGCTCC ACACCAATGG CAATCAAATT GTTGATAGCG CAGGTCGCCC AGTTTGGTTA ACTGGAGTCA ATTGGTTTGG CTTCAATGCA ACTGAGCGGG TGTTTCATGG CTTGTGGTCG GCCAATTTGA CCAGCATGAT GCAAAGCATT TCGCAACGTG GATTGAACAT TATTCGCGTA CCAATCTCAA CTGAATTGAT TTTGGAGTGG AAAGCCGGGG TTTTCAAAAC ACCAAATGTC AACACTTACG CCAATCCTGA ATTAGAAGGC TTAACCTCGT TGCAAATATT TGATCGCTTC GTGATGCTTT CAAAGCAATT TGGCATTAAG GTGATGATCG ATGTGCATAG CGCCGAAGCC GATAATTCAG GCCATTATGC GCCACTCTGG TACAAAGGTT CGTTTACCAG CGAGCAGTTT TATCAGGCTT GGGAGTGGAT TACTGATCGT TACAAAAATG ACGATACGGT GATCGCAATG GATATTAAGA ATGAGCCACA CGGCACGGCC CACGATAATC AAACCAGCAG TCAATTTGCC AAATGGGATA ACTCGACCGA TATCAACAAC TGGAAATACG TTTGCGAAAC TGCTAGCAAA CGAATTTTGG CGATTAACCC TAATGTCTTG GTGCTATGCG AAGGCAACGA GGTTTATCCA AAGGCCGGCG CAAGCTATAC CTCAAGCAAC AAAAATGATT ACTACTTTAC CTGGTGGGGC GGAAATTTAC GTGGCGTGCG TGATTATCCG GTCAATCTTG GCAGCAACCA AGATCAATTG GTCTACTCGC CACACGATTA CGGCCCGTTG GTCTTCAATC AATCGTGGTT CTACCCTGGT TTTACCAAAG AAACGCTTTA CAACGATGTT TGGTATCCTA ATTGGTTTTT TATCCATGAA GAAAATATTG CGCCATTGTT TATTGGCGAA TGGGGTGGCT TTTTGGATGG TGGCGCAAAT GAACAATGGA TGAAGGCGTT GCGCGATTTG ATCAAAGAGC ACTATCTACA CCATACCTTC TGGGTACTCA ACCCCAATTC TGGCGACACT GGCGGTTTGC TCGGATACGA TTGGGCCACT TGGGATGAGG CTAAATATGC CTTGCTCAAG CCAGCCTTGT GGGCAGATCG CAATGGTAAA TTTGTCAGCC TCGATCATCA AATTCCGCTC GGTGGCACAG CTACTGGCAC AACCATTACC CAATATTATC AACAGGGCAA CCAAGCTCCA AGCAATCCCT AA
|
Protein sequence | MSQKQRSFRT RLAMIGGLVT LLLAGQPATT KPTAAAAICE VTYTISNQWS TGFTANVSVK NLGIGLNNWQ VGWTFAGNQA ITNLWNGVLT QTGAQVSVSN PAWAASLPSN GTASFGFQAS YTGSNAIPNA FTLNGVSCNG DQPSPMPTNT AIPSIPPATN TPNPPTNTPI ATTTGTPRPT NTPTSVIPTV TNTPRPTNTP VPTTVNPTAT STPTGNNNND DWLHTNGNQI VDSAGRPVWL TGVNWFGFNA TERVFHGLWS ANLTSMMQSI SQRGLNIIRV PISTELILEW KAGVFKTPNV NTYANPELEG LTSLQIFDRF VMLSKQFGIK VMIDVHSAEA DNSGHYAPLW YKGSFTSEQF YQAWEWITDR YKNDDTVIAM DIKNEPHGTA HDNQTSSQFA KWDNSTDINN WKYVCETASK RILAINPNVL VLCEGNEVYP KAGASYTSSN KNDYYFTWWG GNLRGVRDYP VNLGSNQDQL VYSPHDYGPL VFNQSWFYPG FTKETLYNDV WYPNWFFIHE ENIAPLFIGE WGGFLDGGAN EQWMKALRDL IKEHYLHHTF WVLNPNSGDT GGLLGYDWAT WDEAKYALLK PALWADRNGK FVSLDHQIPL GGTATGTTIT QYYQQGNQAP SNP
|
| |