Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1852 |
Symbol | |
ID | 5733741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2153364 |
End bp | 2154827 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641278996 |
Product | peptidase M23B |
Protein accession | YP_001544623 |
Protein GI | 159898376 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACGA AGCGAATCAT AGCTTGTTGG TTGGTCGCAA TTATTGTCAG TATTGGTTTT GGCTCTCCTC AATCAAGTCA AGCGCAGAAC GCTCAAGCGC AGCCTGCCCA GATTACTATC GATGGCTTAG CCGTGAAACT GCCATTTCAG CCAGGCGCAG AATGGCGAGT CACAGCAGGC TGGGAGGCGG CAAATCATCA ACAAGCTTGG AATTATTATG CAGTTGATGT CGTACCAGTG AATCAAGCGT GCTTAGGCCG ACCGATTTTA GCGATGGCAC ATGGCTTTAT TGAATCGAAT AATGGTCACG AACTTCAGAT TGATCATCGC GTGAATAATT ACCGTTCATT ATATTCACAC TTAGATACCG TAAGTCCTGG TTTAGCGGTT GGAACGGAAG TGTTTCAAGG TCAACAAATT GGCACCTGTG GTGGATATCC AAATTTTGCG CCACACCTAC ACTTTCAAAT CTTTCAAGGG GCACGGATGG CGAGTAGTGG CGTGATTCCG ATACCGATTG ATGGGATCAC TGATGCTAAT CGGCTGCGTT CAGGCCAACG TGGCTTGTAT TCAACTAATC AATCCCAACC ACAATTGCCG ACTGCTAATT ATCGACCATT AAGGCTATTT TGGCATGGCG AACGTGGAGA TAACTTTACA ACTGCCTCAA CACAAGCAGA AATCTCTGCG GCGGCAAATG CATACACTTC AATTCGAACC GAAGGGTTTG TATTTAGTCA TGCAGGATCA GGTAGAGTAC CACTCCAACA GTATTGGCAT AGTGGTAGGG GTGACAATAT CTTAGTCGCC ACTAATGCAG GAATCAATGA TGCACGTAAT GCTGGCTATA GTTTTGTGCG AACCGAAGGC TATATTTACG CTACACAGCA ACAACATACT GTACCATTGA AACTATTTTG GAGTGATACA CGACAGGATA ATTTTACAAC CTCGACTGCA GAAGGCGAAC AATCTGCATT AGCAGCTGGG TATAGTTTTG TGCGAATCGA AGGTTATGTT TTTGTTGCAC GCCCCCTACA GCTCTACTGG CATTCCGATC GTGGTGATAA TTTTCCAACT GCAACATCTG AGGGCATTGC TTCTGCTCAT ATATCAGCCT ATGGGTTTAT TCGAACTGAG GGCTATGTTT TTGCTGATCA TTTGCCTGAT ACAGTTCCAT TAAAATTATT TTGGAGTGAT GCCCGACAAG ATAACTACAT TACCGCAACA GCTGAAGGCG AACAATCTGC ACTAATGGCT GGATATGGCT TTGTGCGAAT TGAAGGTTAT GTTTTACCAA CCAATGGTGC GGATCTACGA CCTTTAGAGC TATTTTATCA TGATGGACGT GGAGATAATT TTACAACTGG TACAGTAGCA GGAGCTACCG ATGCGATTAA CCATAACTAC CAGTTGATTC GGCATGAAGG TTATATTTAT GGCACTTTGC CTGCTGGATA TTAG
|
Protein sequence | MNTKRIIACW LVAIIVSIGF GSPQSSQAQN AQAQPAQITI DGLAVKLPFQ PGAEWRVTAG WEAANHQQAW NYYAVDVVPV NQACLGRPIL AMAHGFIESN NGHELQIDHR VNNYRSLYSH LDTVSPGLAV GTEVFQGQQI GTCGGYPNFA PHLHFQIFQG ARMASSGVIP IPIDGITDAN RLRSGQRGLY STNQSQPQLP TANYRPLRLF WHGERGDNFT TASTQAEISA AANAYTSIRT EGFVFSHAGS GRVPLQQYWH SGRGDNILVA TNAGINDARN AGYSFVRTEG YIYATQQQHT VPLKLFWSDT RQDNFTTSTA EGEQSALAAG YSFVRIEGYV FVARPLQLYW HSDRGDNFPT ATSEGIASAH ISAYGFIRTE GYVFADHLPD TVPLKLFWSD ARQDNYITAT AEGEQSALMA GYGFVRIEGY VLPTNGADLR PLELFYHDGR GDNFTTGTVA GATDAINHNY QLIRHEGYIY GTLPAGY
|
| |