Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0941 |
Symbol | |
ID | 5732827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1077755 |
End bp | 1078834 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278073 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_001543717 |
Protein GI | 159897470 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.984882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCAA CCAAGCAGCA GCTGCAGCAG GCGATTAATC AGGCCTTGCC TGAACAACAG CTTAGCTATA GTCAACAACA GGTCATCAAT CAAGCGCTTG AACAACAGCT GGAGCAAGTT CGCCAGCTTC AGCCAACCCG TGATCGGCGC AAAGCCCCTA AAACCAAACG CCGAATTTGG CAATGGCAAT GGAGTCGGGC GCAAATTTTT GCATTAATTG TGGCAACTGC TCTTTTATTT TTTGTTACCT ACGCCACCTT AGCCACATCG GCTCCTAGCG TGCCCCATAA TGCCAGTGTG CAAATTTTCG ATGGCACAGC CGTGATCAAC AACCTTCGTA CTGGCGCTGA ACGCCGCTTA AACGCTGGCG ATGTAACCAT TTTAGAGCCA GGCGATACAA TTCAAACTGA AACCGGTCGG GCATTAATTA CTTATTTCGA TGGCCAAACT ACTACCTTGC AAGCCAACGC TCGCCTAACC CTTGAAACCA TGGATAGCCA AAATGGCGGC CAACAAATTC GGCTTAAAGT TTGGTTTGGC CGCACGCTCA ATGGAGTCAA ACGGTTGCTT GGGCCAAATG ATCAGTTCGA AGTTGAAACG CCATCCTCAG CAGCTTCAGT CCGTGGCACA GAGTTCACTG TCGAATCGCG CAATAATACT ACCACCTTTT ATGCCACCGA CAAGGGCAAT GTGCAAGTGG CGATGGATGG TCAAACGGTG TTTGTGCGGG CTGGCGAACA ATTATTGGCC GAGCAATCCA AACCATTGGT GGTTCAGCCC CAAATCTCGC CAACCAATAC CCCAACCAAC ACGCCAACAC CAACGGCGAC GGCCACGCCA ACCAATACCC CAACCAACAC GCCAACGCCA ACGGCGACGA CCACGGCCAC GCCAACTGCA ACGCCAAGCG CAACGCCAAC GGCCACACCA CAGCTTTACA TTACCCAAGC TGGCGATACA ATCAATGGCA TCGCCCAACG CTTTGGAATC ACCCCTGATG CTTTGGTCAA CGCCAACCCG ATCATTCGTG ATCGTGATGA GATTCCGATT GGTTTAACTT TGATTATTCC GCAGCCATAG
|
Protein sequence | MASTKQQLQQ AINQALPEQQ LSYSQQQVIN QALEQQLEQV RQLQPTRDRR KAPKTKRRIW QWQWSRAQIF ALIVATALLF FVTYATLATS APSVPHNASV QIFDGTAVIN NLRTGAERRL NAGDVTILEP GDTIQTETGR ALITYFDGQT TTLQANARLT LETMDSQNGG QQIRLKVWFG RTLNGVKRLL GPNDQFEVET PSSAASVRGT EFTVESRNNT TTFYATDKGN VQVAMDGQTV FVRAGEQLLA EQSKPLVVQP QISPTNTPTN TPTPTATATP TNTPTNTPTP TATTTATPTA TPSATPTATP QLYITQAGDT INGIAQRFGI TPDALVNANP IIRDRDEIPI GLTLIIPQP
|
| |