Gene Haur_4574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4574 
Symbol 
ID5736419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5852324 
End bp5854342 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content51% 
IMG OID641281736 
ProductRicin B lectin 
Protein accessionYP_001547333 
Protein GI159901086 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGGCT TCAATCATCG TCATTGGCGT TGGGGGCTGG TGCTAAGTTG CCTAAGTAGT 
CTGTTGGTTG GGGCGATGTT GACCCAGCCA ACCCGTGCTG CCGAGCCAGT TGCGGTTAAT
GCTGAGGGCT ATACCACCCG CAGCTATGGT AGCGTCACGT TTGAAGGCAT TAATTACGCA
GTGCAAAGTA ATGTTGCTAA CGAATATGTT CCGAGTACCA CCCATTCGTA TAGCGATCTG
GGCAGCTATT ATTTGGTCAA TAGTGATTAT AGTTTGCCAA ATGTGCCCAA TATTACCAGC
GGCGTATTGT GGTCAACTGG CAACAAAAAT CGCGGTTGGG CGATCAACAG CGAGTACGAT
ATTCGGGCTT TGGTGCAGGC CAATGGCGGT TTGTATGCAC CGTATCAGAC CATGCCTGGC
TATCAATTGG GGCCATGGAA TGCTAGCACA CCCTGTTGTG GCTGGACGTT GCAGCGCAAT
ACTACTGGTT TTTATATTCA GGCCGATGGC AAGGTGCGTG TGCCCAAAAC GCCTGCTGCT
GCCCAACAAA CTTGGGATGC CAACCAAAGC CTGACTGCCA TCAACACCAC CAACGATATT
GTGGCTGTTA CCGATGTGAT GTTTCCTGGC GACGAGGATT ATTACGCTGG CAACACCTAT
TTGCCGCGTT CAGCGGGCGT ACTCACTGCC AAATATAAAC ATTACGACAA TCGCAATACT
CACATTTATT GGGGCTTGAA GGGCCAGCAT GTGCGTGATG TGGAAGATTG GGAAGCCGAT
GCGCCAGGCG GCAGCAAACG TAAAATCTAT ACTGGCGGTT TCAAAATCGA CGAAAGTGAT
AATGGTCAAG TCTGGGCTGG CATTTCGCAT GGCAACGAAT TTGTTGATCT TAATTTGCAG
CCGAGCGTAA CCGCCCAACA ACTGTACAAA GTTGAGTTGT GGATTCAACG TCCAACAGGT
ATGGAATATT GGGGTGGTTT GAGCTACCAG CAGGGCGCTG ATGGCAAGTG GCGAGCCTTT
GGCGATGGTA GCCATGTGAC TAATTGGGGC AACGGCACGT TTGGCTTGGT AGCAACCGCC
TATCGCAATC GCAACGAACG CTTGTTGCTG GTTTATCGCG CCTTGCCAGG TGGTGATAAT
CCGCCAACTC CCACCCCGCC ACCACCGCCA CCAACCAATG CTGCATCCTT TAATCTGATC
AATCGCAGTA GTGGGCTATG TTTGGATGTT GCTGGGGCGA ATGCCGCCGA TGGTACCAAA
GTGCAGCAAT GGACCTGTAA TAACGCGACG GCGCAACAGT GGGAACTACG CTTGGCCGAA
AGTGGCTATT ATCAATTAGT TTCAAAAGCA ACTGGCAAAT GTTTAGATCT GGCGGCGTGG
AGCACTACCG ATGGTGGGAT TGCCCATCAG TGGTCGTGCG GCAACAATCA ATCGAATCAG
CAGTGGAATT TCCAAACCGT CAGCGATGGT TGGCTGCGAA TTGCCAACCG CAACAGTAGC
AAATATCTCT CGATCGTCTA TGGTTCGGTG GATGCTGGGG CTGCGACTCA CCAATGGCCT
TGGCTGGGCA ATCCCGACCA ACAATGGCGG ATTCAGCCTG TGGGTACACT GCAAATCGCC
AACAAAAATA GCAATAAATG TATTGATGTT GCCAATAATA ATAGTGCTGA TGGCACGAAT
ATTTTGCAAT GGCCTTGCTA CGCTGGCCTG GCCCAGCAAT GGCAATTTCA ACATAGCGAT
AATGGCTATT ACAAGTTGCG CCACCCCAGC AGTGGCAAAA TGCTCTCGGT TTCGGGCGAT
TCAAATGCCG ATGGGGCCAA CATTCACCTC TGGACAGCGG TGAGTAACCC GAGCCAACAA
TGGCGGCTTG AACTGCTCGA CGATGGCTTT ATGCGCTTTG TCAATCGGGC AACCGGCAAA
GTGGTTGATG TGGCTGGTGG CAGTAGCGCC GATAACGCCA ACATTCAGCA ATGGACGTGG
AATAGTAGCA ACGCCCAACG CTTTAAACTG ACGAATTAG
 
Protein sequence
MIGFNHRHWR WGLVLSCLSS LLVGAMLTQP TRAAEPVAVN AEGYTTRSYG SVTFEGINYA 
VQSNVANEYV PSTTHSYSDL GSYYLVNSDY SLPNVPNITS GVLWSTGNKN RGWAINSEYD
IRALVQANGG LYAPYQTMPG YQLGPWNAST PCCGWTLQRN TTGFYIQADG KVRVPKTPAA
AQQTWDANQS LTAINTTNDI VAVTDVMFPG DEDYYAGNTY LPRSAGVLTA KYKHYDNRNT
HIYWGLKGQH VRDVEDWEAD APGGSKRKIY TGGFKIDESD NGQVWAGISH GNEFVDLNLQ
PSVTAQQLYK VELWIQRPTG MEYWGGLSYQ QGADGKWRAF GDGSHVTNWG NGTFGLVATA
YRNRNERLLL VYRALPGGDN PPTPTPPPPP PTNAASFNLI NRSSGLCLDV AGANAADGTK
VQQWTCNNAT AQQWELRLAE SGYYQLVSKA TGKCLDLAAW STTDGGIAHQ WSCGNNQSNQ
QWNFQTVSDG WLRIANRNSS KYLSIVYGSV DAGAATHQWP WLGNPDQQWR IQPVGTLQIA
NKNSNKCIDV ANNNSADGTN ILQWPCYAGL AQQWQFQHSD NGYYKLRHPS SGKMLSVSGD
SNADGANIHL WTAVSNPSQQ WRLELLDDGF MRFVNRATGK VVDVAGGSSA DNANIQQWTW
NSSNAQRFKL TN