Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3592 |
Symbol | |
ID | 5735453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4521699 |
End bp | 4522736 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280741 |
Product | NLP/P60 protein |
Protein accession | YP_001546356 |
Protein GI | 159900109 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCCAC TTCAAACCAC GATTGATCAG TATGCAGCCC AATTTGATCG TCGGCGCATG ACGATTTTAG CTGCCCACGT CGAAGAAGAC AATGGAACGG TGCTCCGTGG TCGGGTGTTG GATCGCAGCC AGCATGCGGC GTTGTTGGCT TTATTGCCAA ATGTGCGCGA TGAATTGATT GTGCTGCGCG AACAGCCGGA GCATGGCTTT GCAACCGTTA CTTTTGGGGT ATTAGATGTG CGCTGGCAGG CTACTCGCGA TTCGGAATTG GTGACTGAGG CCAGTTTTGG CGAAGGCTTA GAGGTACTGG CCCACGAGCA TGAATGGTTG CAAGTAATTA CCAGCGATGG TTATTTGGGT TGGGTACGGC GCAATGGTGT TGTGCTGCAC GAGCAGCCAA GCACCTATCG AAGTGCTGCA ACCCACGTTG TGACCAGCCG TTGGTTGCCA TTGTGGGGTT TAGAGGGCGA CCAAATTGGC TTGTTGCCAT GGGGCATTCG GCTTGAAATT GACGAATTTC GTGATGGCAA AGCCTTTATG CGCTCACCTG CAGGCCTGCC AGGTTGGCTC GAAGCCGATT CATTAACGCC CGTTGAAGAT TTGTGTTCGG TTGATAGCGC TGGTATCGAG GATATGTTGC AAGCAATTCG CCAATTGATT GGTGTGCCCT ATTTGTGGGG TGGCACAACT AGTTTTGGTT TCGATTGCTC AGGTTTGGCT CAGGCTGCCT ATCGTTGGCT GGGTGTGCAA TTGCCACGCG ACGCTGATCA ACAATCGCAA ATTGGTCGTT TAATTAGCCG TGAGCAGGTT GCAGCTGGCG ATTTACTGTT TTGGGGCGTG CTACGCAACA TCGAAGATTA TCGCCACGAA CGGATTAACC ATGTGTCGAT TGCCCTTGAC AACGAGTGGA TGATTCATGC AAATCAACGC AATTGGAGCA TCTCGCTTGA TCGCATTGAT GAAGTGAATG CGCAGGTCTA TCAAGCACAA GGGAACCCAG GGTTGGTGGT AATTCGGCGG ATTCGCGAGG AGCAGTAG
|
Protein sequence | MHPLQTTIDQ YAAQFDRRRM TILAAHVEED NGTVLRGRVL DRSQHAALLA LLPNVRDELI VLREQPEHGF ATVTFGVLDV RWQATRDSEL VTEASFGEGL EVLAHEHEWL QVITSDGYLG WVRRNGVVLH EQPSTYRSAA THVVTSRWLP LWGLEGDQIG LLPWGIRLEI DEFRDGKAFM RSPAGLPGWL EADSLTPVED LCSVDSAGIE DMLQAIRQLI GVPYLWGGTT SFGFDCSGLA QAAYRWLGVQ LPRDADQQSQ IGRLISREQV AAGDLLFWGV LRNIEDYRHE RINHVSIALD NEWMIHANQR NWSISLDRID EVNAQVYQAQ GNPGLVVIRR IREEQ
|
| |