Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2647 |
Symbol | |
ID | 5734527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3397816 |
End bp | 3398739 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279789 |
Product | chitin-binding domain-containing protein |
Protein accession | YP_001545413 |
Protein GI | 159899166 |
COG category | [S] Function unknown |
COG ID | [COG3397] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0138367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACAAC GCCGTATCTT TAGTTTAACC GTCGGTTGTT GGTCGGCATT ATTGCTGGTT TTGGCAATTA GCAGCCAAGT TAACGCTCAC GGAGCGATGC AAACGCCAGT CAGTCGGACA TATGCATGTT TTCTTGAAGG CCCAGAAACT CCTGATACCG CCGCTTGTCG CGCAGCAATT GATATCGGAG GCACGCAGCC ACTCTACGAT TGGAATGAAG TTAATATTGG TAATGCTGCG GGTCAGCATC GTTCGTTAAT TCCCGATGGC AAATTGTGTA GTGCAGGCCG CGCCAAGTAT GCCGGCTTTG ATCTAGCTCG CGCCGATTGG CCCGCCACAC CATTAACATC TGGCAGTTCG ATGAGTTTTC TCTATCGGGC GACCGCACCG CACCCTGGTT CGTTCGAGTT TTATATTACT CGCGACGGTT ATAGCCCGAC CCAAGCCCTC AAATGGTCGG ATCTTGAGGC CACGCCCTTT TTGAAGGTCA CCAATCCGCA ATTAGTTAAT GGCTCATATG TGATCAATGC CCGCATTCCT AACAACAAAA CTGGCCGCCA TTTGATCTAT TCAATTTGGC AGCGCTCGGA TAGTGCCGAA GCCTTTTACA CCTGCTCGGA TGTGACCTTT GGCGGCACCA ACCCAACCAG CGTGCCAACC GCCACACCAC GGCCAAGCGC AGTGCCAACC GCAACCGTGC CACCAACCGG CACACCAATT GCCACCGCCA CACCGCGCCC AACCAACGTA CCAACTGCCA CGCCAACCAA TCCAACTACT CAAGCTTGGC AACCCTATAC CAGCTATGCC ACTGGCGCAG GCGTAGCTTA TGCAGGCAAC ACCTACATCT GTCGCCAAGG CCATACCTCG CTGCCAGGTT GGGAACCCTC AGCAGTGCCA GCGTTATGGC AATTGATCAA TTAA
|
Protein sequence | MVQRRIFSLT VGCWSALLLV LAISSQVNAH GAMQTPVSRT YACFLEGPET PDTAACRAAI DIGGTQPLYD WNEVNIGNAA GQHRSLIPDG KLCSAGRAKY AGFDLARADW PATPLTSGSS MSFLYRATAP HPGSFEFYIT RDGYSPTQAL KWSDLEATPF LKVTNPQLVN GSYVINARIP NNKTGRHLIY SIWQRSDSAE AFYTCSDVTF GGTNPTSVPT ATPRPSAVPT ATVPPTGTPI ATATPRPTNV PTATPTNPTT QAWQPYTSYA TGAGVAYAGN TYICRQGHTS LPGWEPSAVP ALWQLIN
|
| |