Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3384 |
Symbol | |
ID | 5735245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4267435 |
End bp | 4268661 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280531 |
Product | hypothetical protein |
Protein accession | YP_001546148 |
Protein GI | 159899901 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3693] Beta-1,4-xylanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAAT CACGCTGGTT AAGTGCCGCC ATGGTCTGCT TGCTAGCAGT TAATCTTTTG GCGGCCTGTG GTGGCGATAG TGCGCCCACT ACCCAACCAA CCAATCCTGA AGCAGCCACG GCTACGCCCG AAACTGCTGC TCCAACCACT GATAGCAATC TACCAGTAAC GACGGGCAAC CCGTTGCAAT TGCCCTATTT GCAATATGGT GCAGCCGCGC AACTGTACTA TACTGATCGT AATCGCGCCT TGACCTTGAT GAACAACGCT GGGTTCGATT GGGTGCGCCA ACAGATTCAA TGGAAAGATA TTGAAGGCCC AAAAGGTAAC TTTGGCTGGG GCGAACTCGA TGCAATTGTT GCTGATGCCA ACGCCAAAAA TATCAAAGTG CTGTTGAGCA TTGTACGCTC ACCATCGTGG GCACGGGCCG ATGGAACCAA CGGCATGCCC GATAACATCA AAGATTTTGG CGATTTTGTT GAGGCCTTGG TGGTACGCTA CAAAGGCAAA GTCCAAGCCT ACGAAATTTG GAACGAACAA AATCTTGATC ATGAAAATGG CGGCTCACGT GATTCGATCG ACGCTACCAA ATATGTTGAT CTCTTGGTCG AAGCCTACAA TCGGATCAAA CCGATCGATC CTGAAGCCTT TGTGATTTCA GGAGCATTGA CTTCAACTGG CGATTCACCA GCGGCGATCG ATGATATGAC CTACTTTGAG CAAATGTTTA GCTACAAAGA TGGCATTTTC AAAGATCACA TCGATGGTGT GGGCTTCCAT CCTTCGCCAT CGTACAATCC GCCAGCAACC TTATGGCCCG ACCAGCCCGG CCCAGGCCCA GGTTGGCTCG AAAGCCCAAC CCACTACTTC CGCCATATCG AAAATCTCAA AATCTTGATG GATAAATATG GCATGCAAGA TTATCAAGTG TGGGTGACTG AATTTGGCTG GGCGACCCAA AACACCAGCC CAGGCTATGA GTATGGCAAC GAAATTAGCT TCGAGCAACA AGGTCAATAT GTGCTCGATG CGCTGCAAAT GACCCGCCGC GATTACCCAT GGGTTGCCAC CATGTTTGTG TGGAACCTCA ATTTTGCGGT AACTTCGCCT GATCCGCTTG ATCAAACCGC CTCATTCGGT ATTCTCAACC CCGATTGGAG TCCACGGCCA GTCTTTGAAA AAATTCAAGG CTTTGTTAAC GCCGTCAAAA CCGAGGAAGG TCGCTAA
|
Protein sequence | MPKSRWLSAA MVCLLAVNLL AACGGDSAPT TQPTNPEAAT ATPETAAPTT DSNLPVTTGN PLQLPYLQYG AAAQLYYTDR NRALTLMNNA GFDWVRQQIQ WKDIEGPKGN FGWGELDAIV ADANAKNIKV LLSIVRSPSW ARADGTNGMP DNIKDFGDFV EALVVRYKGK VQAYEIWNEQ NLDHENGGSR DSIDATKYVD LLVEAYNRIK PIDPEAFVIS GALTSTGDSP AAIDDMTYFE QMFSYKDGIF KDHIDGVGFH PSPSYNPPAT LWPDQPGPGP GWLESPTHYF RHIENLKILM DKYGMQDYQV WVTEFGWATQ NTSPGYEYGN EISFEQQGQY VLDALQMTRR DYPWVATMFV WNLNFAVTSP DPLDQTASFG ILNPDWSPRP VFEKIQGFVN AVKTEEGR
|
| |