Gene Haur_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1428 
Symbol 
ID5733336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1649056 
End bp1652475 
Gene Length3420 bp 
Protein Length1139 aa 
Translation table11 
GC content53% 
IMG OID641278566 
ProductRicin B lectin 
Protein accessionYP_001544200 
Protein GI159897953 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5498] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.737248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAC AACATTTACC TAATCGCTCA AGACGATTAC CATGGCTTGC AGGAACCCTC 
CTAACATTAT TAACTTCCAG TTTGTTCTTT TCTCCAACCC AACCTGTTGC CAACGCTGAT
CAAGCTGGGC TGGGCAGTTA TACTACCACC TTGCCAGCAG GCGCAAAAGT TCCCGATGAT
TTTAATGGAA ATCCTGTTTC TCCCAAACGC ACTGCCAATG TCACTGGCGC TATGCCCACC
AACGATTGGT GGAGTTCGCT CGGCTGGCAG CGCTTCCCTG GTAATCCCTA TTCGGAAAAT
ATGACGGCCT TGCCGTTGAT TGTCAAAGCC AAAGCGACTG GCTTGGGCGT GACCTTCCCA
ACAATTCCGG CAATTTCAAC TGGCTCGCCC AACTACATTG GTGAGTTTCA CTATAACGCT
TCCGAAGACC TGAACCTTGG TTTGGTGGGC TTGAATTCGC CCGATGCCAA AGTTGACGGC
TACTCGGATT GGACAGTCAC GGCCTATTGG AATGGTGGCG GCACGCTCCG CGCAACCTTT
GGCCATGGCA TGCCTTTCGT CTATTTGACC AAGAGTGGCG GCGATGCTTT GATTAGCGCG
GCGGCTCCGC CAAGCGTTTG GACTGGCGCT GGCACGAATG CCCTCGGCAT CACGGTCAAC
GGCCATCATT ACGGTATTTT TGCCCCAACT GGCACAACCT GGAGCCAATC GGGGAACAAT
TTCCAATCAA ATTTGGCTGG CAAAGATTAT TATTCGGTGG CAGTATTGCC TGATAATAGC
GTGGCAACCT TTAATTTCTT CAAATCCCGC GCCTATGCCT TTGTCACCAA CACCACCGCC
AGCTGGAGCT ACGACCAAGC TTCTGCTACC TTGAATACAA CCTTCAGCGC AACCACGGTT
GCCAAAGAAG GCAGCAATAC CAACACTGTT TTGGGCTTGT ATCGCCACCA CGCGATCAAC
TCATCGACGG CGTTGACCAA CTACAGCTAC ACCACCGCCC GTGGTCAAAT TCGCTTGCGC
GATGGTAATT CGTTTACCAC GGCCATGCGC TTCAATGGTG TGCTGCCAAC CCTGCCAGAT
GCTGGCGATT ACAACCGCAC CACCCTCAAC AATCACTTGA ATGATGTTGC CTTCGAAGCT
AGCCACTTTG GCGGTGCTGA TACCTACTAC ACTGGTAAGG CGCTCTTACG GTTGGCCAAC
TTGATTCCGA TTGCTGAACA ACTGGGCAAC ACCAACGCCC GCAATGCGTT GATTACCGCT
GTGCGCAATC GTTTGCAAGA ATGGTTCACC GCTAGTGCCA ACGATACCAA TGGCCAGTTC
TACTACAATA GCAATTGGGG TACGGTCATC GGCTATCCAG CCTCGTTCGG TTCGGATACC
GAATTGAACG ACCACCACTT CCACTATGGC TATTACATCT ACGCTGCGGC AATCTTGGCC
CAATATGATC CAAACTGGGC GCTCGATAGC AACTGGGGTT CGATGGTCAA GCTGTTGATC
AACGATGCTG CCAACATCAG CACCGCCACT GATCCCCGCT TCCCACGCTT GCGCACCTTC
GACATCTACG AAGGCCACTC ATGGGCTTCG GGTCACGCGG GCTTTGGCGC AGGCAACAAC
CACGAATCAT CATCAGAAGC AATGATGTTC AACAGCGCTG TGCTGTTGTG GGGTGCAAAC
ACTGGCAATA CCCAATTGCG CGACCTTGGG ATCTTCATGT ATACCCATGA AACCCACGCG
ATCGAGCAAT ATTGGTTCAA TGTTGATAAC GCTGTGTTTC CTGCTGGCTT CACCGCCAAC
AACAACCACC CCGCCGTCGG GATGGTTTGG GGCGATGGTG GTAGCTACGC AACTTGGTTC
AGCGCCAACC CCGAAATGAT CCACGGGATC AACTTCTTGC CCTTCCACGG TGGTTCGTTG
TACTTAGGCC GTAATCCAGC CTATGTCAAC AAAAACTACA GCCAAATGCG CAACAACATC
GGCGGCGCAG AACGTTATTG GTTAGATGTT ATTTGGCAAT TCCAAGCGTT TGGTGATGCA
GCAACCGCCG CAACCAAGTT TGATACGGTC GCCTATACCC CAGAAGAAGG CGAAACCAAG
GCTCATACCT ACCACTGGAT TCGCAACTTG AAGCAGTTGG GTTCGATCGA TACTTCGATC
ACCGCCAACA CGCCAACCTA TGCAGTTTTC AACAAAAATG GTGTACGCAC CTATGTTGCT
TGGAACCCAA CCGCGAACCC ATTGACCGTA ACCTTCTCGA ATGGCGTGGT GTTGAATAGC
ATTCCTGCAC GCAGCATGGC TCGCAGCACG GGCACAACCC CACCACCAAC CGCGACTCCG
GTCAACCCAA CCGCTACGCC AGTACTACCA ACTGCTACAC CAGTCAACCC AACCGCGACT
CCGGTCAACC CAACCGCTAC GCCAGTACTA CCAACTGCTA CACCAGTTGC CGGCTGTTCA
GCAGTGAGCT TGGATGCCAA TAGCTACTAT CGGGTAACGG CTCGTCACAG TGGCAAGGCC
TTGGATGTTG CGGATGTTTC AAGCGCTGAT GGGGCCAATG TGCATCAATG GGGCTATGTT
GGTGGCTTGA ACCAACAATG GCGCTTCGAA AGCGTTGGCA GCAACTACTT CAAAGTTACC
GCTCGCCATA GTGGCAAGGC GCTTGATGTT GCTGGCGGTA CAACCGCAAC TGGCAACGGT
GTCAACATTC ACCAATGGCC ATATGGCAAC ACCACCAACC AACAATGGTG TTTGCGCGAT
GTTGGTAGCG GCTACTATGC GATTATTGCC CGCCACAGTG GCAAGGCACT TGATGTTGCT
GATGCTTCAA CCGCCGATGG TGGCAATGTC CATCAATGGG ACTATGTCGG CGCAACCAAC
CAACAATGGC AACTCACCAA GATCGATGCT GGTGGCAACA CCTTGCACGT CATCGATGGC
GCTGCCCAAA ATGTAGCTGG TACGTTGAGC CTGAGCGCAG GCGCAGGGGC CAACACTGAC
AGCATTCCAT CGGCTGGCGG AGCTAACCGC GATGGTACGC CAACCAATGC CTTGGTTTAT
ACCATCTCAG GCTTGACCCG TACCTACAAC AGCCAAGCCA CCCAATTCAA GTTGTTCGTC
GATTCCAACA CTGCCGTGGG CAACGGGGTT CAAGCCCGCA TCTCCTACGA CTGGACTGGC
GACGGCAGCT ATGATCGCAC TGAAACCTAC AACTACTTCC CAACTGATCC GGTTGCAGGT
TTCGAGCAAT ATAGCCAAAC CGCTGGCCTC AAGAGCAGCA GTGGTGCTTG GGCCAACCTG
AGCAATGGTC GGGTACGGAT TGAAATTTGG AATGCAATTG GCAATGGTAC AGCGAGCGTT
CGCACCAGCG CCACCAGTGA CCAAGGCCAA CAATCGACCA TCACCTTGCC ATTCAATTAA
 
Protein sequence
MNTQHLPNRS RRLPWLAGTL LTLLTSSLFF SPTQPVANAD QAGLGSYTTT LPAGAKVPDD 
FNGNPVSPKR TANVTGAMPT NDWWSSLGWQ RFPGNPYSEN MTALPLIVKA KATGLGVTFP
TIPAISTGSP NYIGEFHYNA SEDLNLGLVG LNSPDAKVDG YSDWTVTAYW NGGGTLRATF
GHGMPFVYLT KSGGDALISA AAPPSVWTGA GTNALGITVN GHHYGIFAPT GTTWSQSGNN
FQSNLAGKDY YSVAVLPDNS VATFNFFKSR AYAFVTNTTA SWSYDQASAT LNTTFSATTV
AKEGSNTNTV LGLYRHHAIN SSTALTNYSY TTARGQIRLR DGNSFTTAMR FNGVLPTLPD
AGDYNRTTLN NHLNDVAFEA SHFGGADTYY TGKALLRLAN LIPIAEQLGN TNARNALITA
VRNRLQEWFT ASANDTNGQF YYNSNWGTVI GYPASFGSDT ELNDHHFHYG YYIYAAAILA
QYDPNWALDS NWGSMVKLLI NDAANISTAT DPRFPRLRTF DIYEGHSWAS GHAGFGAGNN
HESSSEAMMF NSAVLLWGAN TGNTQLRDLG IFMYTHETHA IEQYWFNVDN AVFPAGFTAN
NNHPAVGMVW GDGGSYATWF SANPEMIHGI NFLPFHGGSL YLGRNPAYVN KNYSQMRNNI
GGAERYWLDV IWQFQAFGDA ATAATKFDTV AYTPEEGETK AHTYHWIRNL KQLGSIDTSI
TANTPTYAVF NKNGVRTYVA WNPTANPLTV TFSNGVVLNS IPARSMARST GTTPPPTATP
VNPTATPVLP TATPVNPTAT PVNPTATPVL PTATPVAGCS AVSLDANSYY RVTARHSGKA
LDVADVSSAD GANVHQWGYV GGLNQQWRFE SVGSNYFKVT ARHSGKALDV AGGTTATGNG
VNIHQWPYGN TTNQQWCLRD VGSGYYAIIA RHSGKALDVA DASTADGGNV HQWDYVGATN
QQWQLTKIDA GGNTLHVIDG AAQNVAGTLS LSAGAGANTD SIPSAGGANR DGTPTNALVY
TISGLTRTYN SQATQFKLFV DSNTAVGNGV QARISYDWTG DGSYDRTETY NYFPTDPVAG
FEQYSQTAGL KSSSGAWANL SNGRVRIEIW NAIGNGTASV RTSATSDQGQ QSTITLPFN