Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0294 |
Symbol | |
ID | 5732189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 346850 |
End bp | 348679 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277418 |
Product | cellulose-binding family II protein |
Protein accession | YP_001543074 |
Protein GI | 159896827 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.558691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAAGC ATCGCAAAAT GTTGCTGATG GTGGCGTTGC TCACGCTACT GACCGTGCCA TGGTGGCAAT CCAAAGCCGC CCAAGGCAAT AGTGTTAGCC CATTATTATT CCCATACAAT CAAGCTTCGG GCATTAACTA CAATGTGACC GATCTCACGC AGGCTTGGAA CGAGTGGAAA AGCAACATGA TCACCGCCAA CAACGCTGGT GGCGGTAATC GGCTACGGGT GATGGGTGGG GTCGATAGCT CATCGACGGT TTCCGAAGGC CAAGGCTATG GCATTTTGTT TGCCTCGCTA TTCGACGACC AAACCACCCT CGATGGGTTG TGGTTGTTCA CCCGCGATCA CCTTGACCCC AACGGCTTGA TGCACTGGCA CATTGGCAAT CCTGGCCAAT TGCGCGGCAG CTACGCCGCT ACCGATGGCG ATGAAGATAT TGCTCTCGGC CTCGTCAATG CCTGTGTCAA AGTACGCAAG GGCGTTTGGC CAAATAGCAG CAATGGCTTA GATTATTGTA GCCTTGCCAC CACCATGATC AACAATATTT ACACCTATGA AGTTGATCAC CCAGGCTCAT CGCCAGTCGC TGGCTTGCCC AGCAACCCAG GCAACGAATT ATTGCCCGGC GATGGTTGGA ATTTGGCCCG CGATTATCCC GAAGGCATCG TCAATCTCTC GTATTTCTCG CCTGGCTATT TCACGGTATT TGGTAAATTC ACGGGCAAGA CCAGCGAATG GGAAGCCGTC AACACCCGCA ATTATGAAAT TACCAACCTC GCTCAATCGC GGCCAGGCAA CTGCTCGAAG CTTGTGCCCA ACTGGAACCA ATATGATGGT GATGCCCAAT TAGTTTCGTG GCAGCCCGAA GAATATGCTT GGTGGAGCTA TGACGCTGCT CGCTTTGCAT GGCGCGTTGC CGTCGATAAA GCTTGGTACA ACACCGCCAG CTCACGCGAA ACCATGAACG AAGTTGGGGG GTTCTTCAGT AGCGTGGGCA TCGAAAATGT GCAGGCTCGC TATCGGATGA ACGGCACATC AGTTGATAAT TATCGTGGTG TGTTCTTCGT CGCCAATGCT GCTGCCGCAA TTTGGGCCGC TCCTGCACCG CAAGCTATCA ATTGTGGCGC GGCAACGGGC ACGCTCAAAA CCAGCCCGCA ACAAGCCTAC AATATGGTGT TGACCACCAA AGATTCGCCA AATTCATATT ATGTCAATGC TTGGCGTTTG ATGAGCATGT TGTTGTTGAC TGGCAATTTC CCCAATATTT ATGAATTGGC CAACGGTGTT ACGCCAACCA ACACGCCTGT GCCAACCAGT GTGCCACCAA CTAACACGGC GGTTCCAACC AACGTGCCAC CAACCAACAC AACCGTGCCG CCAACCAGCA CACCACGGCC AACCAACACC CCTGTCACAA TCACACCGAT TTTGACCCCA ATTCCAACCC TGACCCCGAT TCCAACCACG GTTCCGACCA GCGTGCCACC AACCAATACG CCAATTGCTG GGGCATGCCA AATTACCTAC AGCATCAGCA ACGATTGGGG TAGTGGTTTT ACCGCCGATG TAAGCATTCG CAACAACGGC ACAGCAATCA ACAATTGGAA TGTGCGCTGG AATTTCGCAG GCAATCAACA AATTAACAAT CTCTGGAATG GAACTGTGAG CCAAACTGGC CAAGCAGTCA GCGTCAATAA CGTAGGCTGG AATGGCTATA TTGGTAGTGG CGGCACTGCC AGCTTTGGCT TCCAAGCCAG CTACAATGGC AGCAACCCCA AACCAACCAG TTTTAGCCTG AATGGCACAG CTTGTAGCGT TGCGCCATAG
|
Protein sequence | MLKHRKMLLM VALLTLLTVP WWQSKAAQGN SVSPLLFPYN QASGINYNVT DLTQAWNEWK SNMITANNAG GGNRLRVMGG VDSSSTVSEG QGYGILFASL FDDQTTLDGL WLFTRDHLDP NGLMHWHIGN PGQLRGSYAA TDGDEDIALG LVNACVKVRK GVWPNSSNGL DYCSLATTMI NNIYTYEVDH PGSSPVAGLP SNPGNELLPG DGWNLARDYP EGIVNLSYFS PGYFTVFGKF TGKTSEWEAV NTRNYEITNL AQSRPGNCSK LVPNWNQYDG DAQLVSWQPE EYAWWSYDAA RFAWRVAVDK AWYNTASSRE TMNEVGGFFS SVGIENVQAR YRMNGTSVDN YRGVFFVANA AAAIWAAPAP QAINCGAATG TLKTSPQQAY NMVLTTKDSP NSYYVNAWRL MSMLLLTGNF PNIYELANGV TPTNTPVPTS VPPTNTAVPT NVPPTNTTVP PTSTPRPTNT PVTITPILTP IPTLTPIPTT VPTSVPPTNT PIAGACQITY SISNDWGSGF TADVSIRNNG TAINNWNVRW NFAGNQQINN LWNGTVSQTG QAVSVNNVGW NGYIGSGGTA SFGFQASYNG SNPKPTSFSL NGTACSVAP
|
| |