Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5119 |
Symbol | |
ID | 5737077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 159319 |
End bp | 160482 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641282284 |
Product | NLP/P60 protein |
Protein accession | YP_001547875 |
Protein GI | 159901629 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.553866 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACCC GCTTCCGACT CGTCGGACTT GTCATAATCG TTTTCTTGAG TGGTTGTGGA GGTCGCCCGC TTCCGGCGGC TTCCACCCAT CCCATCACGG GCAAGCCGCA GTGGTGGTGT CCAACTCCGG TCATGGCAGG CGATCCGACC GCGATTGCCG CCATGCCGAC GGTGACACCC TACTATCACC GTGACCAGTT CATGCTTGGC CAAGACGTGC TCAGCAATGG GTTGCGCGTG ACCGTGCATG GCATCACCAG CGGCGAGGAA GCGCCTGAAG CCATTGGCGG GGGCCAGGTG CAGTGGGTCG ATCTCGAACT CACCAGCGCG GTGTCGTTGC CGCTTGATCT GGCGGCGCAG GTCGTCATTC GTGAGGTCGA GCAGGAAGCA GGACAAGCGG CACGCGGCTG GTGGACAACC GACACCGCGA CGCTGGCCAC GACCGCGATT ACCCTGCCCA CGCGGCTGGA AGCAGGCATT TCATGGCGCG GGTCGATTCC GATTCGTACC CCGATTGGTA CGCCCGTGTT TGTCCTGATC TACCGCACGC CTGCCGATGC GCTGCTGCGG GAGCAGCCGA CCGATGGCGT AATCGTGGTG CAGAATCGCC GCGACCCGAC CTGTGCGGGC AATATCGCGC GGGTTCCCTT CCCGACCATG CCCGCAGGCG GTGGATCAGG TGCGCCGATT AACGGGACAC CCATCGCCGT TCCACCAGGC ACGAATCCAC TCGTGGCTTA CGCGGTCAGC AAGCTGGCAT GGCCCTATGT CTGGGGTGGC GAAAGCGAGG CCGAAGGCGG CTTTGACTGT TCAGGGCTGA TGTACGCCGC CTATGGCAGT GTCGGCCTGA CGATTCCGCG CACCTCACAG GCGATGTGGC AGAGCGCCCA GCTGCAACGG ATTGGCATCA GCGAGCTGCG ACCGGGTGAT CTGGTCTTTT TCCACACCGA TAGCAGCCGC TTTAGCAGCC CGCCAACCCA TGTGGGGATG TATATCGGTG ATCTGAATGG CAATGGCACA CCCGATTTAG TCCATGCCCT CAGTCCGGCG TGGGGTATTC GGATTGAGGA TAACTGGCTC ACCAAGCCGT GGCTGGTGGC CCCATGGCCC GATGGCACGC CCCGTTTGTG GGGGGCTGGC TACTTTGTGA ATCCGTATCG GTAG
|
Protein sequence | MSTRFRLVGL VIIVFLSGCG GRPLPAASTH PITGKPQWWC PTPVMAGDPT AIAAMPTVTP YYHRDQFMLG QDVLSNGLRV TVHGITSGEE APEAIGGGQV QWVDLELTSA VSLPLDLAAQ VVIREVEQEA GQAARGWWTT DTATLATTAI TLPTRLEAGI SWRGSIPIRT PIGTPVFVLI YRTPADALLR EQPTDGVIVV QNRRDPTCAG NIARVPFPTM PAGGGSGAPI NGTPIAVPPG TNPLVAYAVS KLAWPYVWGG ESEAEGGFDC SGLMYAAYGS VGLTIPRTSQ AMWQSAQLQR IGISELRPGD LVFFHTDSSR FSSPPTHVGM YIGDLNGNGT PDLVHALSPA WGIRIEDNWL TKPWLVAPWP DGTPRLWGAG YFVNPYR
|
| |