Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1170 |
Symbol | |
ID | 5733063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1342627 |
End bp | 1344708 |
Gene Length | 2082 bp |
Protein Length | 693 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278310 |
Product | carbohydrate-binding CenC domain-containing protein |
Protein accession | YP_001543946 |
Protein GI | 159897699 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000394833 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGTT TGGTTCAACT TTCGCTGCTG ATGGTGTTGG GTGTCGCATG GCTCGGTTCG ACGCGGGCAC ATACCGCAAT TCCGAATGGC ACATCGATTG TATTTGATGA AGCCTTGGCT ACTGGTTGGC AAAATTGGTC GTGGAGCACA GCGGCAGCGT TTGATAGCAC TGCCCAAGCG CATACCGGCA CGAAATCGAC GGCAATAACC TTTGAGGCGT GGGGCGGTTT TTCGCTACAC TCACCTGGGC TGATTCCCGC CGAAGCCTTG ACTGGAATTA GTTTTTGGGT CTATGGCGGT ACTGGTGGCG CACAGTTTAA TGCTGGGGTC ACGACCTCTG ATAGCGGTAA TATCGTTGCT GCTGTGCCAG TTAACGCTCC CGCTGGCCAA TGGACGCAGG TTCAATTGAC AATGGCCCAG TTGGGCAATC CCAGTGCAGT TTCACGGATT AACTTGCAAG AATCCACAGG TACGGCTCAA GCAACCTTCT ACATCGACGA TCTTATCCTC AATATTGTCG ATTTTGCCCC ACCAGGCACA GTTAATGTAA CGATCGATAC CGATCAAACC GTGCTTGACT TTAGTGCCCG CAATGCACTT GGCACCAATT ATGGTATGTG GGGGGTTGAA GCTGCGGCCA ATTCAACTTT CTTGGCCCGC AGTGGCGCGT TGGGTAATAT GACAATTCGT TTGCCAGGTG GCTCGATGAG CCAAGACCAT GGGGCAATCA ACTGCGAATT GGGCACGGGC AACACAATTG CCAACGCATG GCCATGTAAT TATGAATGGG CCATGACCCC AACCGATTTT ATTGAATTTC TGAGCGCAAC CGATCGCGAT GCGATCTGGA CGATCAATAC CAACGTGACT TCGGGCGAAG CTGCCGCGTT GGTTGCCTTC TTCAATGGCG ATGTCAATGA TCAACGGGCA CTTGGCACCG ATCACAAAGG TGTGAATTGG CAGACAGTTG GCCATTGGGC GCAACTGCGG GCCAGCCATG GCAATGCCGC ACCAGTTGGC ATCAAATATT GGGATTTTGG CAACGAAACC TATGGATCAT GTGTCCAAGG CTGGGAAGTT GGCTGGACCT GTGATGGTGC TGAATATATC AACGGCTTGC CAGCACCCAA TCGCCACGAA GGCTATCTCG AAGTTCGTGC AGCAATGAAA GCGGTTGACC CAACGATTAT GGTCGGTGTA ACTGGCTTAG CCAACCCAAC CGAATTCAAC AACTGGACCA CCAAAGTGCT GACCGAGGGC GGCGATGTGA TCGACTACCT CGCAATTCAC CCCTACTCCT TTGTTGATGT GCCAAACAAT GATGCGCTTG GCCGTTATTT GATGCTTTCA CGCCCGCAAA AACAATGGCC TGAAGTTTTT GCTAGCACTC GTCAAGCGAT CGCCGATTTT GCTGATGGTC GCAATATTCC GATTATGATT GGCGAATACA ATTTGAGCGC TGGCTGGCCA TATGATCCGC AAGCGATGAT GACCAAAGCG CTGAATATGT TGTTTATTGC TGATACAACC GGCCAAATTA TTCAAGGCGA TGTGTTGGCT GCCAACCATT GGCTGTTGAA TGGCAATGTG CAAAGCAATG GCACCGATTA TGGTTTGATT GGCGATGTGC CAACCTACAC TCGTAGCCCG CAGTATTATG CCTTCCATCT CTGGCAATTT TTCGGCAATC GCCTACTTGA ATATACCAGC GATGCCGATG CCGAAACCCA AATTAGCATG TATGCTGGCA GCGATGGCCA TACGATCTCG CTGATGGCAA TCAACAAAAC AGGCCAAGCG ACCACCACCA ATGTCCAACT TCGCGCTGGT GGTGTGGTCA TTCCGGCAAT TTCTGGCTCG ATTGATGTGG CCCAAGCTGT TAGCCTCGAT GCAACCACAA TCACGTATAA TGGGCAAACC AATGTGAACG ATGATTTTAG TAATGCACAA ACGGTTGAAT TGACCGAAGT CGGCGAAACC ACCACGGTCA CGCTGCCCGC TTACTCAATC ACGCTCTTAC GGTTTGCAAC CGATGACGGC ATCCATACGC TGTATAATCC GTTGGTGCTG CTCAACAAAT AG
|
Protein sequence | MRRLVQLSLL MVLGVAWLGS TRAHTAIPNG TSIVFDEALA TGWQNWSWST AAAFDSTAQA HTGTKSTAIT FEAWGGFSLH SPGLIPAEAL TGISFWVYGG TGGAQFNAGV TTSDSGNIVA AVPVNAPAGQ WTQVQLTMAQ LGNPSAVSRI NLQESTGTAQ ATFYIDDLIL NIVDFAPPGT VNVTIDTDQT VLDFSARNAL GTNYGMWGVE AAANSTFLAR SGALGNMTIR LPGGSMSQDH GAINCELGTG NTIANAWPCN YEWAMTPTDF IEFLSATDRD AIWTINTNVT SGEAAALVAF FNGDVNDQRA LGTDHKGVNW QTVGHWAQLR ASHGNAAPVG IKYWDFGNET YGSCVQGWEV GWTCDGAEYI NGLPAPNRHE GYLEVRAAMK AVDPTIMVGV TGLANPTEFN NWTTKVLTEG GDVIDYLAIH PYSFVDVPNN DALGRYLMLS RPQKQWPEVF ASTRQAIADF ADGRNIPIMI GEYNLSAGWP YDPQAMMTKA LNMLFIADTT GQIIQGDVLA ANHWLLNGNV QSNGTDYGLI GDVPTYTRSP QYYAFHLWQF FGNRLLEYTS DADAETQISM YAGSDGHTIS LMAINKTGQA TTTNVQLRAG GVVIPAISGS IDVAQAVSLD ATTITYNGQT NVNDDFSNAQ TVELTEVGET TTVTLPAYSI TLLRFATDDG IHTLYNPLVL LNK
|
| |