Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4574 |
Symbol | |
ID | 5736419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5852324 |
End bp | 5854342 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281736 |
Product | Ricin B lectin |
Protein accession | YP_001547333 |
Protein GI | 159901086 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGGCT TCAATCATCG TCATTGGCGT TGGGGGCTGG TGCTAAGTTG CCTAAGTAGT CTGTTGGTTG GGGCGATGTT GACCCAGCCA ACCCGTGCTG CCGAGCCAGT TGCGGTTAAT GCTGAGGGCT ATACCACCCG CAGCTATGGT AGCGTCACGT TTGAAGGCAT TAATTACGCA GTGCAAAGTA ATGTTGCTAA CGAATATGTT CCGAGTACCA CCCATTCGTA TAGCGATCTG GGCAGCTATT ATTTGGTCAA TAGTGATTAT AGTTTGCCAA ATGTGCCCAA TATTACCAGC GGCGTATTGT GGTCAACTGG CAACAAAAAT CGCGGTTGGG CGATCAACAG CGAGTACGAT ATTCGGGCTT TGGTGCAGGC CAATGGCGGT TTGTATGCAC CGTATCAGAC CATGCCTGGC TATCAATTGG GGCCATGGAA TGCTAGCACA CCCTGTTGTG GCTGGACGTT GCAGCGCAAT ACTACTGGTT TTTATATTCA GGCCGATGGC AAGGTGCGTG TGCCCAAAAC GCCTGCTGCT GCCCAACAAA CTTGGGATGC CAACCAAAGC CTGACTGCCA TCAACACCAC CAACGATATT GTGGCTGTTA CCGATGTGAT GTTTCCTGGC GACGAGGATT ATTACGCTGG CAACACCTAT TTGCCGCGTT CAGCGGGCGT ACTCACTGCC AAATATAAAC ATTACGACAA TCGCAATACT CACATTTATT GGGGCTTGAA GGGCCAGCAT GTGCGTGATG TGGAAGATTG GGAAGCCGAT GCGCCAGGCG GCAGCAAACG TAAAATCTAT ACTGGCGGTT TCAAAATCGA CGAAAGTGAT AATGGTCAAG TCTGGGCTGG CATTTCGCAT GGCAACGAAT TTGTTGATCT TAATTTGCAG CCGAGCGTAA CCGCCCAACA ACTGTACAAA GTTGAGTTGT GGATTCAACG TCCAACAGGT ATGGAATATT GGGGTGGTTT GAGCTACCAG CAGGGCGCTG ATGGCAAGTG GCGAGCCTTT GGCGATGGTA GCCATGTGAC TAATTGGGGC AACGGCACGT TTGGCTTGGT AGCAACCGCC TATCGCAATC GCAACGAACG CTTGTTGCTG GTTTATCGCG CCTTGCCAGG TGGTGATAAT CCGCCAACTC CCACCCCGCC ACCACCGCCA CCAACCAATG CTGCATCCTT TAATCTGATC AATCGCAGTA GTGGGCTATG TTTGGATGTT GCTGGGGCGA ATGCCGCCGA TGGTACCAAA GTGCAGCAAT GGACCTGTAA TAACGCGACG GCGCAACAGT GGGAACTACG CTTGGCCGAA AGTGGCTATT ATCAATTAGT TTCAAAAGCA ACTGGCAAAT GTTTAGATCT GGCGGCGTGG AGCACTACCG ATGGTGGGAT TGCCCATCAG TGGTCGTGCG GCAACAATCA ATCGAATCAG CAGTGGAATT TCCAAACCGT CAGCGATGGT TGGCTGCGAA TTGCCAACCG CAACAGTAGC AAATATCTCT CGATCGTCTA TGGTTCGGTG GATGCTGGGG CTGCGACTCA CCAATGGCCT TGGCTGGGCA ATCCCGACCA ACAATGGCGG ATTCAGCCTG TGGGTACACT GCAAATCGCC AACAAAAATA GCAATAAATG TATTGATGTT GCCAATAATA ATAGTGCTGA TGGCACGAAT ATTTTGCAAT GGCCTTGCTA CGCTGGCCTG GCCCAGCAAT GGCAATTTCA ACATAGCGAT AATGGCTATT ACAAGTTGCG CCACCCCAGC AGTGGCAAAA TGCTCTCGGT TTCGGGCGAT TCAAATGCCG ATGGGGCCAA CATTCACCTC TGGACAGCGG TGAGTAACCC GAGCCAACAA TGGCGGCTTG AACTGCTCGA CGATGGCTTT ATGCGCTTTG TCAATCGGGC AACCGGCAAA GTGGTTGATG TGGCTGGTGG CAGTAGCGCC GATAACGCCA ACATTCAGCA ATGGACGTGG AATAGTAGCA ACGCCCAACG CTTTAAACTG ACGAATTAG
|
Protein sequence | MIGFNHRHWR WGLVLSCLSS LLVGAMLTQP TRAAEPVAVN AEGYTTRSYG SVTFEGINYA VQSNVANEYV PSTTHSYSDL GSYYLVNSDY SLPNVPNITS GVLWSTGNKN RGWAINSEYD IRALVQANGG LYAPYQTMPG YQLGPWNAST PCCGWTLQRN TTGFYIQADG KVRVPKTPAA AQQTWDANQS LTAINTTNDI VAVTDVMFPG DEDYYAGNTY LPRSAGVLTA KYKHYDNRNT HIYWGLKGQH VRDVEDWEAD APGGSKRKIY TGGFKIDESD NGQVWAGISH GNEFVDLNLQ PSVTAQQLYK VELWIQRPTG MEYWGGLSYQ QGADGKWRAF GDGSHVTNWG NGTFGLVATA YRNRNERLLL VYRALPGGDN PPTPTPPPPP PTNAASFNLI NRSSGLCLDV AGANAADGTK VQQWTCNNAT AQQWELRLAE SGYYQLVSKA TGKCLDLAAW STTDGGIAHQ WSCGNNQSNQ QWNFQTVSDG WLRIANRNSS KYLSIVYGSV DAGAATHQWP WLGNPDQQWR IQPVGTLQIA NKNSNKCIDV ANNNSADGTN ILQWPCYAGL AQQWQFQHSD NGYYKLRHPS SGKMLSVSGD SNADGANIHL WTAVSNPSQQ WRLELLDDGF MRFVNRATGK VVDVAGGSSA DNANIQQWTW NSSNAQRFKL TN
|
| |