Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3462 |
Symbol | |
ID | 5735323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4356928 |
End bp | 4359249 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280609 |
Product | cellulose-binding family II protein |
Protein accession | YP_001546226 |
Protein GI | 159899979 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0281342 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTATC GCTGGCGAAT ATGCCTGCTA TTAATTGCAA CAATACTGAG TACATTTAGT TACACCAATA CCAATGCCCA AAGTGGCCTA AGTTGCCAAG TTAATTATGC GCTTACCAAC CAATGGGGCG GCGGGTTTCA AGCCGATGTG GTGGTGCGCA ACACCGGAAC CAGCGCGATC AACGGCTGGA CGGTTGCTTG GAGTGCCGCC AGCGGCCAGC AAATTGGCCA AATGTGGAAT GCAACCTTTA CCCAAAGTGG CACGCAAGTT AGTGCCAAAA ATGTTGATTG GAATGCCAAC ATCGCCGCTG GTGGCAGCCA AAGCTTTGGC TTTACCGCTA CCACGACTGG TAGTTTGGCC GTGCCTAGCA GTTTTACGGT CAATGGTGTT GTGTGTGGCG GTAGCGTTAG CCCTACGGCA ACCACTCCAG CAGCAACGGC AACGCGCACG CCAACCAGTG TGCCAACCGC AACCCGTATT GCGACTAGCA TACCAACCGC AACAAATGCG GCAACCAGCG TGCCGACGGC AACGCGTGCA CCAACCAGCG TGCCCACGGC CACCCCAAGC ACAACCACAG GCCGCCAAAT GGAAAAACTC AATCGCGGGA TCATCAGCGT GCGCCAAGGC AGCAATAATT TTGTGAGCTG GCGCATGTTT GGCACTGATC TTAGCGCGAT TGGCTTTAAC CTCTATCGTG GCACCACCAA AGTTAATTCC AGCCCAATTA CCAATGCTAC CAGTTATTTG GATAGCGGCG CGGCGGCCAA TAGCGTTTAC ACCGTGCGGC CTGTGATCGA TGGCGTTGAA CAAACTGCCT CAGAAAACTC GCTCAATTTT GCCAATGGCT ATCTCGATGT GGCCTTGCAA ATTCCGGCTG GTGGCACGAC ACCTGATGGC GTGGCCTATA CCTACACCGC CAACGATGCC AGCGTCGGCG ACCTCGATGG CGATGGGCAA TACGAAATTG TACTGAAATG GGATCCAACC AATTCCAAAG ATAATTCGCA ATCTGGCTAT ACTGGCAATG TTTATCTCGA TGGCTACAAA TTGAACGGAA CCCGTTTATG GCGCATCGAT TTGGGCCGTA ATATTCGGGC TGGGGCGCAT TACACCCAAT TTATGGTCTA CGATTTGGAT GGCGATGGGA AGGCCGAAGT TGCCGCCAAA ACTGCCGATG GCACGCGTGA TAATTCTGGC ACCGTGATTG GCAACGCCAG CGCCGATTAT CGCAATTCCA GCGGCTACAT TCTTTCTGGC CCCGAATATC TGACGGTATT CAATGGCCAA ACGGGCGTGA TTCGCTCGAC CGTCAATTAT GATCCTGCAC GGGGCACGGT TTCGTCGTGG GGCGATAGCT ATGGCAACCG CGTTGATCGC TTTTTGGGTG GAATTGCCTA CCTCGATGGT CAACGCCCGA GCCTGATTAT GAGCCGTGGC TACTACACCC GCAGCGTAAT TGCCGCTTGG GATTTCCGCA ATGGCAGTTT GACCAAGCGT TGGACGTTTG ATAGCAATGT GTCGGGCAGC CAATATGCTG GGCAAGGCAA CCATGGCCTT TCGATCGCCG ATGTTGATCA AGATGGCAAA GATGAGATTA TCTTCGGAGC CATGACGATT AATGATAATG GCCAACCACT GTGGAACACT CGCAATGGTC ATGGCGATGC GATGCACGTC GGCGATCTTG ACCCAAGTCG GGCTGGCTTG GAAGTGTTCA AAGTCAGCGA GGATTCATCA AAGCCTAGCT CGTGGTTTGC CGATGCCCGC ACAGGCCAAA TTTTGTGGCA AACAGCGGCA GGTGGCGATA ATGGGCGCGG CGTTTCGGGC GATATTTGGT CGGGCAGCCC GGGCGCTGAA TCGTGGTCAT CGATGGATAG CAATTTGCGT AGCGTCAGCG GGGCAACTCT TGGCCGCAAA CCATCAGCAA CCAACTTCTT GATTTGGTGG GATGGCGATC CAATGCGCGA ATTGCTTGAT GCCACCCGTA TCGACAAATA TGGCACATCA GGCGATACGC GCTTGCTGAC TGGCAGCAAT GTTAGCTCCA ACAACAGCAC TAAATCGACC CCAGCGCTCA GCGGCGATAT TTTGGGCGAT TGGCGCGAAG AGGTGATTTG GCGCACCAGC GATAACACCG CGCTGCGGAT TTATTCAACT AGCACCAGCA CCAACCGCCG CATCTTCACC TTGATGCACG ATGCCCAATA TCGAGTGGCA ATTGCTTGGC AAAACACCGC CTACAATCAA CCACCGCATC CTAGCTTTTT CTTGGGCGAT GGCATGAGCA ATCCACCGCA ACCGAATATC TACTTGCGCT AA
|
Protein sequence | MNYRWRICLL LIATILSTFS YTNTNAQSGL SCQVNYALTN QWGGGFQADV VVRNTGTSAI NGWTVAWSAA SGQQIGQMWN ATFTQSGTQV SAKNVDWNAN IAAGGSQSFG FTATTTGSLA VPSSFTVNGV VCGGSVSPTA TTPAATATRT PTSVPTATRI ATSIPTATNA ATSVPTATRA PTSVPTATPS TTTGRQMEKL NRGIISVRQG SNNFVSWRMF GTDLSAIGFN LYRGTTKVNS SPITNATSYL DSGAAANSVY TVRPVIDGVE QTASENSLNF ANGYLDVALQ IPAGGTTPDG VAYTYTANDA SVGDLDGDGQ YEIVLKWDPT NSKDNSQSGY TGNVYLDGYK LNGTRLWRID LGRNIRAGAH YTQFMVYDLD GDGKAEVAAK TADGTRDNSG TVIGNASADY RNSSGYILSG PEYLTVFNGQ TGVIRSTVNY DPARGTVSSW GDSYGNRVDR FLGGIAYLDG QRPSLIMSRG YYTRSVIAAW DFRNGSLTKR WTFDSNVSGS QYAGQGNHGL SIADVDQDGK DEIIFGAMTI NDNGQPLWNT RNGHGDAMHV GDLDPSRAGL EVFKVSEDSS KPSSWFADAR TGQILWQTAA GGDNGRGVSG DIWSGSPGAE SWSSMDSNLR SVSGATLGRK PSATNFLIWW DGDPMRELLD ATRIDKYGTS GDTRLLTGSN VSSNNSTKST PALSGDILGD WREEVIWRTS DNTALRIYST STSTNRRIFT LMHDAQYRVA IAWQNTAYNQ PPHPSFFLGD GMSNPPQPNI YLR
|
| |