Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3461 |
Symbol | |
ID | 5735322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4355100 |
End bp | 4356560 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280608 |
Product | cellulose-binding family II protein |
Protein accession | YP_001546225 |
Protein GI | 159899978 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3866] Pectate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.560967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTAT CAATTGCCCG ACGTTGGGTT ATTGCCACCG GCTTAGCCAC TGCCTTGATT GGCGCAATCC AAAGCCCAAT CACCAACGCC CAAACTGGCC TAAGCTGCCA AGTCAATTAT ACGCTTACTA ACCAATGGGG CAGCGGGTTT CAAGCCGATG TGGTGGTGCG CAACACCGGA ACCAGTGTGA TCAACGGCTG GACGGTTGCT TGGAGTGCTG CCAGCGGCCA GCAAATTGGC CAAATGTGGA ATGCGACGTT TACCCAAAGT GGCAGCCAAG TCAGTGCCAA AAATGTTGAT TGGAACGCCA GCATCGCTGC TGGTGGCAGC CAAAGCTTTG GCTTTACCGC TACCACGACT GGTAGCTTGG CCGTGCCCAG CAGCTTTACG GTCAATGGCG TTGTTTGTGG CGGTAGTGTT AGCCCAACCG CAACCCGCAC GCCAGCCGCG ACTGCCACGC GTACTCCAAT TGCCACTGCA ACCCGCACGC CAGCTGTAAC CGCCACGCGC ACCCCTGTGG CAACTACTAC CCGTACTCCA ATTGCCACGG CTACCGTCGT TCCAACCAAT CCACCAGTCA GCAATGGCTT GATTGGCTGG GCCACGGTTG CTGGTTCGGG CTTAAGCACA ACTACTGGCG GCACTGGTGG TAGCACAGTT ACCGCAGCCA ACTTTACTGA ATTGCAAAAC TACGCCAAAT CGTCATCGCC CATGATTATC AAGTTCTCGG GTACGATGCA AGGCACACTG ACGGTTGCCT CGAACAAAAC GATTATCGGC AGCAATGGAG CCTTGATCCA AGGTAATGTC AAAATCTCAG GCGCTCAAAA TATTATTTTG CAAAATTTTG CGATCAACGG CAATAGCTGC TCAAGCTACG ATAACTGCCG CGCTGGAAGC GATGCCTTGG GGATTAGCAA TTCGCACCAT ATTTGGGCCG ACCACTTGAC GATTACCAAT GGCCAAGATG GCAATTTCGA CATTAACAAT GGCTCTGATT TCATTACGGT TTCGTGGAGC AAATTCGGCT ATACCACCAA CAAAGAGCAT CGTTTCTCGA ACTTGATTGG TAGCTCAGAC GATGCAGCCT CGACCGATAG CGGTAAATTG AACGTGACCT TCCATCATAA CTGGTGGTTT GGTGGGGCAA TGCAGCGCAT GCCACGTACG CGCTTCGGCA AAATTCACGT ATTCAATAAT TTGTACACCA CCACTGGCAA CGATTATTGT GTTAGCTCAG GCTATCAATC CAAAGTGTTG CTCGAAAATA ATGCCTTCAT TGGGGTCAAC ACGCCGCACC GCTTGCACGA TGGCGATCTC AAGGCGGTGG GCAATCTCTA CCAAAACACC AGCGGCGATC AAATTAGTAC TGGCGTTGCC TTCACGCCGC CCTACAGCTA TAGCGCCGAA GCTGCTAGCT CACTCAGCAG TTCAGTCCAA GCTGGCGCAG GAGCGAAGTA G
|
Protein sequence | MKLSIARRWV IATGLATALI GAIQSPITNA QTGLSCQVNY TLTNQWGSGF QADVVVRNTG TSVINGWTVA WSAASGQQIG QMWNATFTQS GSQVSAKNVD WNASIAAGGS QSFGFTATTT GSLAVPSSFT VNGVVCGGSV SPTATRTPAA TATRTPIATA TRTPAVTATR TPVATTTRTP IATATVVPTN PPVSNGLIGW ATVAGSGLST TTGGTGGSTV TAANFTELQN YAKSSSPMII KFSGTMQGTL TVASNKTIIG SNGALIQGNV KISGAQNIIL QNFAINGNSC SSYDNCRAGS DALGISNSHH IWADHLTITN GQDGNFDINN GSDFITVSWS KFGYTTNKEH RFSNLIGSSD DAASTDSGKL NVTFHHNWWF GGAMQRMPRT RFGKIHVFNN LYTTTGNDYC VSSGYQSKVL LENNAFIGVN TPHRLHDGDL KAVGNLYQNT SGDQISTGVA FTPPYSYSAE AASSLSSSVQ AGAGAK
|
| |