Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1428 |
Symbol | |
ID | 5733336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1649056 |
End bp | 1652475 |
Gene Length | 3420 bp |
Protein Length | 1139 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278566 |
Product | Ricin B lectin |
Protein accession | YP_001544200 |
Protein GI | 159897953 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5498] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.737248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACAC AACATTTACC TAATCGCTCA AGACGATTAC CATGGCTTGC AGGAACCCTC CTAACATTAT TAACTTCCAG TTTGTTCTTT TCTCCAACCC AACCTGTTGC CAACGCTGAT CAAGCTGGGC TGGGCAGTTA TACTACCACC TTGCCAGCAG GCGCAAAAGT TCCCGATGAT TTTAATGGAA ATCCTGTTTC TCCCAAACGC ACTGCCAATG TCACTGGCGC TATGCCCACC AACGATTGGT GGAGTTCGCT CGGCTGGCAG CGCTTCCCTG GTAATCCCTA TTCGGAAAAT ATGACGGCCT TGCCGTTGAT TGTCAAAGCC AAAGCGACTG GCTTGGGCGT GACCTTCCCA ACAATTCCGG CAATTTCAAC TGGCTCGCCC AACTACATTG GTGAGTTTCA CTATAACGCT TCCGAAGACC TGAACCTTGG TTTGGTGGGC TTGAATTCGC CCGATGCCAA AGTTGACGGC TACTCGGATT GGACAGTCAC GGCCTATTGG AATGGTGGCG GCACGCTCCG CGCAACCTTT GGCCATGGCA TGCCTTTCGT CTATTTGACC AAGAGTGGCG GCGATGCTTT GATTAGCGCG GCGGCTCCGC CAAGCGTTTG GACTGGCGCT GGCACGAATG CCCTCGGCAT CACGGTCAAC GGCCATCATT ACGGTATTTT TGCCCCAACT GGCACAACCT GGAGCCAATC GGGGAACAAT TTCCAATCAA ATTTGGCTGG CAAAGATTAT TATTCGGTGG CAGTATTGCC TGATAATAGC GTGGCAACCT TTAATTTCTT CAAATCCCGC GCCTATGCCT TTGTCACCAA CACCACCGCC AGCTGGAGCT ACGACCAAGC TTCTGCTACC TTGAATACAA CCTTCAGCGC AACCACGGTT GCCAAAGAAG GCAGCAATAC CAACACTGTT TTGGGCTTGT ATCGCCACCA CGCGATCAAC TCATCGACGG CGTTGACCAA CTACAGCTAC ACCACCGCCC GTGGTCAAAT TCGCTTGCGC GATGGTAATT CGTTTACCAC GGCCATGCGC TTCAATGGTG TGCTGCCAAC CCTGCCAGAT GCTGGCGATT ACAACCGCAC CACCCTCAAC AATCACTTGA ATGATGTTGC CTTCGAAGCT AGCCACTTTG GCGGTGCTGA TACCTACTAC ACTGGTAAGG CGCTCTTACG GTTGGCCAAC TTGATTCCGA TTGCTGAACA ACTGGGCAAC ACCAACGCCC GCAATGCGTT GATTACCGCT GTGCGCAATC GTTTGCAAGA ATGGTTCACC GCTAGTGCCA ACGATACCAA TGGCCAGTTC TACTACAATA GCAATTGGGG TACGGTCATC GGCTATCCAG CCTCGTTCGG TTCGGATACC GAATTGAACG ACCACCACTT CCACTATGGC TATTACATCT ACGCTGCGGC AATCTTGGCC CAATATGATC CAAACTGGGC GCTCGATAGC AACTGGGGTT CGATGGTCAA GCTGTTGATC AACGATGCTG CCAACATCAG CACCGCCACT GATCCCCGCT TCCCACGCTT GCGCACCTTC GACATCTACG AAGGCCACTC ATGGGCTTCG GGTCACGCGG GCTTTGGCGC AGGCAACAAC CACGAATCAT CATCAGAAGC AATGATGTTC AACAGCGCTG TGCTGTTGTG GGGTGCAAAC ACTGGCAATA CCCAATTGCG CGACCTTGGG ATCTTCATGT ATACCCATGA AACCCACGCG ATCGAGCAAT ATTGGTTCAA TGTTGATAAC GCTGTGTTTC CTGCTGGCTT CACCGCCAAC AACAACCACC CCGCCGTCGG GATGGTTTGG GGCGATGGTG GTAGCTACGC AACTTGGTTC AGCGCCAACC CCGAAATGAT CCACGGGATC AACTTCTTGC CCTTCCACGG TGGTTCGTTG TACTTAGGCC GTAATCCAGC CTATGTCAAC AAAAACTACA GCCAAATGCG CAACAACATC GGCGGCGCAG AACGTTATTG GTTAGATGTT ATTTGGCAAT TCCAAGCGTT TGGTGATGCA GCAACCGCCG CAACCAAGTT TGATACGGTC GCCTATACCC CAGAAGAAGG CGAAACCAAG GCTCATACCT ACCACTGGAT TCGCAACTTG AAGCAGTTGG GTTCGATCGA TACTTCGATC ACCGCCAACA CGCCAACCTA TGCAGTTTTC AACAAAAATG GTGTACGCAC CTATGTTGCT TGGAACCCAA CCGCGAACCC ATTGACCGTA ACCTTCTCGA ATGGCGTGGT GTTGAATAGC ATTCCTGCAC GCAGCATGGC TCGCAGCACG GGCACAACCC CACCACCAAC CGCGACTCCG GTCAACCCAA CCGCTACGCC AGTACTACCA ACTGCTACAC CAGTCAACCC AACCGCGACT CCGGTCAACC CAACCGCTAC GCCAGTACTA CCAACTGCTA CACCAGTTGC CGGCTGTTCA GCAGTGAGCT TGGATGCCAA TAGCTACTAT CGGGTAACGG CTCGTCACAG TGGCAAGGCC TTGGATGTTG CGGATGTTTC AAGCGCTGAT GGGGCCAATG TGCATCAATG GGGCTATGTT GGTGGCTTGA ACCAACAATG GCGCTTCGAA AGCGTTGGCA GCAACTACTT CAAAGTTACC GCTCGCCATA GTGGCAAGGC GCTTGATGTT GCTGGCGGTA CAACCGCAAC TGGCAACGGT GTCAACATTC ACCAATGGCC ATATGGCAAC ACCACCAACC AACAATGGTG TTTGCGCGAT GTTGGTAGCG GCTACTATGC GATTATTGCC CGCCACAGTG GCAAGGCACT TGATGTTGCT GATGCTTCAA CCGCCGATGG TGGCAATGTC CATCAATGGG ACTATGTCGG CGCAACCAAC CAACAATGGC AACTCACCAA GATCGATGCT GGTGGCAACA CCTTGCACGT CATCGATGGC GCTGCCCAAA ATGTAGCTGG TACGTTGAGC CTGAGCGCAG GCGCAGGGGC CAACACTGAC AGCATTCCAT CGGCTGGCGG AGCTAACCGC GATGGTACGC CAACCAATGC CTTGGTTTAT ACCATCTCAG GCTTGACCCG TACCTACAAC AGCCAAGCCA CCCAATTCAA GTTGTTCGTC GATTCCAACA CTGCCGTGGG CAACGGGGTT CAAGCCCGCA TCTCCTACGA CTGGACTGGC GACGGCAGCT ATGATCGCAC TGAAACCTAC AACTACTTCC CAACTGATCC GGTTGCAGGT TTCGAGCAAT ATAGCCAAAC CGCTGGCCTC AAGAGCAGCA GTGGTGCTTG GGCCAACCTG AGCAATGGTC GGGTACGGAT TGAAATTTGG AATGCAATTG GCAATGGTAC AGCGAGCGTT CGCACCAGCG CCACCAGTGA CCAAGGCCAA CAATCGACCA TCACCTTGCC ATTCAATTAA
|
Protein sequence | MNTQHLPNRS RRLPWLAGTL LTLLTSSLFF SPTQPVANAD QAGLGSYTTT LPAGAKVPDD FNGNPVSPKR TANVTGAMPT NDWWSSLGWQ RFPGNPYSEN MTALPLIVKA KATGLGVTFP TIPAISTGSP NYIGEFHYNA SEDLNLGLVG LNSPDAKVDG YSDWTVTAYW NGGGTLRATF GHGMPFVYLT KSGGDALISA AAPPSVWTGA GTNALGITVN GHHYGIFAPT GTTWSQSGNN FQSNLAGKDY YSVAVLPDNS VATFNFFKSR AYAFVTNTTA SWSYDQASAT LNTTFSATTV AKEGSNTNTV LGLYRHHAIN SSTALTNYSY TTARGQIRLR DGNSFTTAMR FNGVLPTLPD AGDYNRTTLN NHLNDVAFEA SHFGGADTYY TGKALLRLAN LIPIAEQLGN TNARNALITA VRNRLQEWFT ASANDTNGQF YYNSNWGTVI GYPASFGSDT ELNDHHFHYG YYIYAAAILA QYDPNWALDS NWGSMVKLLI NDAANISTAT DPRFPRLRTF DIYEGHSWAS GHAGFGAGNN HESSSEAMMF NSAVLLWGAN TGNTQLRDLG IFMYTHETHA IEQYWFNVDN AVFPAGFTAN NNHPAVGMVW GDGGSYATWF SANPEMIHGI NFLPFHGGSL YLGRNPAYVN KNYSQMRNNI GGAERYWLDV IWQFQAFGDA ATAATKFDTV AYTPEEGETK AHTYHWIRNL KQLGSIDTSI TANTPTYAVF NKNGVRTYVA WNPTANPLTV TFSNGVVLNS IPARSMARST GTTPPPTATP VNPTATPVLP TATPVNPTAT PVNPTATPVL PTATPVAGCS AVSLDANSYY RVTARHSGKA LDVADVSSAD GANVHQWGYV GGLNQQWRFE SVGSNYFKVT ARHSGKALDV AGGTTATGNG VNIHQWPYGN TTNQQWCLRD VGSGYYAIIA RHSGKALDVA DASTADGGNV HQWDYVGATN QQWQLTKIDA GGNTLHVIDG AAQNVAGTLS LSAGAGANTD SIPSAGGANR DGTPTNALVY TISGLTRTYN SQATQFKLFV DSNTAVGNGV QARISYDWTG DGSYDRTETY NYFPTDPVAG FEQYSQTAGL KSSSGAWANL SNGRVRIEIW NAIGNGTASV RTSATSDQGQ QSTITLPFN
|
| |