Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0859 |
Symbol | |
ID | 5732760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 972259 |
End bp | 973710 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277991 |
Product | Ricin B lectin |
Protein accession | YP_001543635 |
Protein GI | 159897388 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCCT CACGGTTTAT GTGGATTTTT GCGTTACTTG TTCTTTTTGG TAGTTCTCTT CCCTCAACCC AAGCTCAATC GCGCAAAGAC GGCGCGGTTG AACAGGCTGA CGATACCCTG CCCGTCAACG ACCCCGAACG CGGCATGATC TACGATGGCT TGACTCCAGC GAAAACTGGT CCGTGTGCTG GCATGTATGA AGTTAAAGTT GGCGACCAAA TCTTCTGTAG CCACGGCCCC GACCCAATCG CCAAAGGCAA ATCGGTTAAA GACGAAACCT TGCCGCTCGA AAAGGCCGCA ATCAACCCAA TGATCGTCTG TAACGGCGAT GGCCACGCTG GTAACCGCAC CCAAGTGATG TATGTGCGGG CTTCCGACCG ACCTGATCGT TTCAACCAAT TTGTTGGTTC GATTCGCCAA TGGTCATCGG ATATGGATCG GATCTATCAA ACTAGTGCCC AAGAAACTGG CGGCTTCCGC GCGGTCAACT TCGTCACCAA CAATTGTGAA ATTTACGTGA TGAATGTGGT CGTGCCAGCC GATGGCGACG ATTCAATCGA CAAAACTGCC AATGCTTTGA AAGCCCTCGG CCACAATCGC GCCGACCGCA AATATTTGAC CTTCGTCGAT AACAACATTT TGTGTGGCGT AGCCTTCTAC ATGCTCGATG ATCGCCCCGA CCAAAATAAC GCCAATAACT TCGGCCCTGG CTTTGCACGT ATGGATAACG GCTGCTGGAA CGGCGCAACC GCTGCTCACG AACATATGCA CACGATGGGT GCAGTCCAAC GCTCGGCTCC CAATAGCACC GCCTATGGTC ACTGTATCGA TGATGAAGAT ATTATGTGCT ACGTTGACGG CCCAGGCACG CCACCAATGC AAAATCGCTG TGGTGCCAGC CAATATGGCC GCTTCGATTG TAACCACGAC GATTATTTCC ACACCAATCC GCCAGCAGGC AGCTACTTGG CAACCAAATG GAACTCGGCC AACAATAAGT TCTTGTTGAA GCAAGCACCT GGCGCAACCT ACTATCGGGT GCGCAACCGA GCAACCAACA AGTGTATCGA TGTTGCCGAG GGTGGCAACC CCGCTAATGG CACGGTCATT CTGCAATGGG ATTGCCACAA CGGCTACAAT CAACAATGGC AATTGATTTC GACCGACAAT GGTTTTGTGC GCTTCGTCTC ACGCGCCACC GGCAAGGTAA TGGATGTAAG TGCCGCCTCG ACCAGCGATG GAGCCAAAAT CCATCAATGG GAATGGGTCG GCGGTGCCAA CCAACAATGG CGCATCAATA ATCTTGGCAA TGGCTATTCA ACCATTACAG CTCGCCACAG TGGGTTAGCG GTTGATATTC CATGGTGCGA TGCAGCCAGC GGCGTGCAAT TGCAGCAAGT CAATCCAACC AACAACGATT GTCAATCATT TGTATTTGAG CCAATGCAAT AA
|
Protein sequence | MKPSRFMWIF ALLVLFGSSL PSTQAQSRKD GAVEQADDTL PVNDPERGMI YDGLTPAKTG PCAGMYEVKV GDQIFCSHGP DPIAKGKSVK DETLPLEKAA INPMIVCNGD GHAGNRTQVM YVRASDRPDR FNQFVGSIRQ WSSDMDRIYQ TSAQETGGFR AVNFVTNNCE IYVMNVVVPA DGDDSIDKTA NALKALGHNR ADRKYLTFVD NNILCGVAFY MLDDRPDQNN ANNFGPGFAR MDNGCWNGAT AAHEHMHTMG AVQRSAPNST AYGHCIDDED IMCYVDGPGT PPMQNRCGAS QYGRFDCNHD DYFHTNPPAG SYLATKWNSA NNKFLLKQAP GATYYRVRNR ATNKCIDVAE GGNPANGTVI LQWDCHNGYN QQWQLISTDN GFVRFVSRAT GKVMDVSAAS TSDGAKIHQW EWVGGANQQW RINNLGNGYS TITARHSGLA VDIPWCDAAS GVQLQQVNPT NNDCQSFVFE PMQ
|
| |