Gene Haur_0941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0941 
Symbol 
ID5732827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1077755 
End bp1078834 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content51% 
IMG OID641278073 
Productpeptidoglycan-binding LysM 
Protein accessionYP_001543717 
Protein GI159897470 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.984882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCAA CCAAGCAGCA GCTGCAGCAG GCGATTAATC AGGCCTTGCC TGAACAACAG 
CTTAGCTATA GTCAACAACA GGTCATCAAT CAAGCGCTTG AACAACAGCT GGAGCAAGTT
CGCCAGCTTC AGCCAACCCG TGATCGGCGC AAAGCCCCTA AAACCAAACG CCGAATTTGG
CAATGGCAAT GGAGTCGGGC GCAAATTTTT GCATTAATTG TGGCAACTGC TCTTTTATTT
TTTGTTACCT ACGCCACCTT AGCCACATCG GCTCCTAGCG TGCCCCATAA TGCCAGTGTG
CAAATTTTCG ATGGCACAGC CGTGATCAAC AACCTTCGTA CTGGCGCTGA ACGCCGCTTA
AACGCTGGCG ATGTAACCAT TTTAGAGCCA GGCGATACAA TTCAAACTGA AACCGGTCGG
GCATTAATTA CTTATTTCGA TGGCCAAACT ACTACCTTGC AAGCCAACGC TCGCCTAACC
CTTGAAACCA TGGATAGCCA AAATGGCGGC CAACAAATTC GGCTTAAAGT TTGGTTTGGC
CGCACGCTCA ATGGAGTCAA ACGGTTGCTT GGGCCAAATG ATCAGTTCGA AGTTGAAACG
CCATCCTCAG CAGCTTCAGT CCGTGGCACA GAGTTCACTG TCGAATCGCG CAATAATACT
ACCACCTTTT ATGCCACCGA CAAGGGCAAT GTGCAAGTGG CGATGGATGG TCAAACGGTG
TTTGTGCGGG CTGGCGAACA ATTATTGGCC GAGCAATCCA AACCATTGGT GGTTCAGCCC
CAAATCTCGC CAACCAATAC CCCAACCAAC ACGCCAACAC CAACGGCGAC GGCCACGCCA
ACCAATACCC CAACCAACAC GCCAACGCCA ACGGCGACGA CCACGGCCAC GCCAACTGCA
ACGCCAAGCG CAACGCCAAC GGCCACACCA CAGCTTTACA TTACCCAAGC TGGCGATACA
ATCAATGGCA TCGCCCAACG CTTTGGAATC ACCCCTGATG CTTTGGTCAA CGCCAACCCG
ATCATTCGTG ATCGTGATGA GATTCCGATT GGTTTAACTT TGATTATTCC GCAGCCATAG
 
Protein sequence
MASTKQQLQQ AINQALPEQQ LSYSQQQVIN QALEQQLEQV RQLQPTRDRR KAPKTKRRIW 
QWQWSRAQIF ALIVATALLF FVTYATLATS APSVPHNASV QIFDGTAVIN NLRTGAERRL
NAGDVTILEP GDTIQTETGR ALITYFDGQT TTLQANARLT LETMDSQNGG QQIRLKVWFG
RTLNGVKRLL GPNDQFEVET PSSAASVRGT EFTVESRNNT TTFYATDKGN VQVAMDGQTV
FVRAGEQLLA EQSKPLVVQP QISPTNTPTN TPTPTATATP TNTPTNTPTP TATTTATPTA
TPSATPTATP QLYITQAGDT INGIAQRFGI TPDALVNANP IIRDRDEIPI GLTLIIPQP