Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0419 |
Symbol | |
ID | 5732318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 488839 |
End bp | 491997 |
Gene Length | 3159 bp |
Protein Length | 1052 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277545 |
Product | endonuclease/exonuclease/phosphatase |
Protein accession | YP_001543198 |
Protein GI | 159896951 |
COG category | [R] General function prediction only |
COG ID | [COG2374] Predicted extracellular nuclease |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0131268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAATTTA AGGGTTTACG ACTCCTAACA GCGTTAACGA TGCTGTTCAC GTTGATTGGT ACTTCGTTGT CCGCAACATA CACCACTCCA ACCAGTGCTG TATCAACCTC CCTTGTCATT AGTCAATTCC AAACCGCTGG CGGTACTGCC GATGATGAAT TTATCGAAAT TCACAATATC AGTGCCAATC CAGTCGATTT GAATGGGCAT CGCGTCGTTT ATCGTGCTGC TGCTGGGGTA ACTGATGTTT CAATTGCCAG TTGGACAAGC TCGGTTGTAA TTCCTGCTGG TGGTTTCTAT CTCTTAGGCC GTGCGAACTC ATATGATGGT ACGGTTACCG CTGATACAAC CTTTGGTAGC GGTATTTCTG GTACGGGTGG GGGCTTCGCT ATCCGTAACG GTGATTTGAA TACTGGCACA ATCATCGATT CAGTTGGCTT TGGTAGTGCG ACGAATGCCT TTGTTGAAGG CACGGTTGTA GCTGCTCCGA GCGCCAACAA TAGTGCTCAA CGCAATAATT CAAGTTGTAC TGATACCGAC AATAATGCTG CCGATTTCAG TTTGTTGACT CCTTCAGCTC CTCGCAATAG CAGCAGTCCA GTTGCAACCT GTGGTGGTTT GCCAACCGAT AATCCACCAA CTGTTGCCAG CACTGTGCCA GCCAATAATG CCAACAATGT GGCCTTGAAT GCGAACGTCA CCCTTACGTT CAGCGAGCCA GTCACCACGA CGGGCACTTG GTACACCATT AGCTGTACCA GTGGCGCTCG CACCGCCAGT GTTACTGGTG GCCCAACCAG CTATGTGCTT GATCCCAGCA GCGATTTTGC TGCCAGCGAT GCTTGTACGG TAACGGTTGT TGCCAGCCAA GTAACCGACC GCGATGGTAG CGTTGATGCT ATGGCCCAAG ACTATAGCTT CAACTTCAAT GTTTCGAGCG GTGTCGCCTG TTCAGGTGGC ACGGTTACCC CAATCGGCCA AGTGCAAGGC ACTGGCGAAA CCAGCCCAGC AAACGGTACG GTTGTTACCG TTCAAGGGAC GGTGGTAGCC GATTTTGAAG GTGCTCAGCC TGCTTTGCGT GGCTTCTACC TGCAAGATGC TGGCGATAGC AACGCTGCAA CTTCCGATGG TATTTTTGTA TTCAACGGCG GCGCTGATCA AGTTGCGCTT GGTCAAGTGG TGCGCGTAAC TGGTACGGCT GGCGAAAACC AAGGCCAAAC CCAATTGAGC GGAAGCTTGA CGGTTGAGCT TTGTGGCACA ACCAACACGG TTACCCCAAC CCAAGTTACC TTGCCATTTG CCTCAACCAC CGATGCCGAG CGCTTTGAAG GTATGTTGGT TCAATTCAAC CAAAAGCTGG TTGTTTCAGA GCACTACTTG TTGGCTCGCT TCGGCCAAGT AACCTTATCG CTCAACGAAC GCTTGGCACA ACCAACCAAC GTGGTTTCAC CAGGCGCTCC AGCCTTGGCA TTGCAAGCAC AAAATAACTT GCACAAAATC ATCCTCGATG ATCCATTCAA CAACCAAAAT GCCGACCCAA TTCTGTTTGG TCGTGGTGGC ACTGGTTTGA GCGCCGCCAA TACCCTTCGC GCTAGCGATA GCATCACTGG GTTGCAAGGT GTGATGACCT ACACGTGGGG TGGTAATAGC GCTAGCCCCA ACAGCTACCG CGTGCGCGTA ACCCAATCGC CAAACTTCGT AGCCGAAAAC CAACGCCCAA CCTCGCTTGA GCTTGATGGC TCATTACGGG TTGCGGGAAT CAACGTGTTG AACTATTTCA ACACCTTTGG CAATGGCAAC TGTACTGGCG GGGTTGGTGG TGCTGCTACC GATTGTCGTG GTGCTGAAAA TACCACCGAA TTCCAACGCC AAAAAGATAA AACCATTGCT GCGATCATCA TGACTCAAGC TGATGTGATT GGCTTGATGG AAATTGAAAA TGATGGCTTC GGCAGCAATA GCGCAATCCA AGATTTGGTT AATGGCTTGA ACGCAGCCAC TGCACCTGGT ACCTATGCCT TGGTGGATGT CGATACACGC ACAGGCCAAA CCAATCCATT AGGTACTGAT GCAATTAAGG TTGCCTTACT CTACAAGCCT GGCAAGGTTA GCTTGGCTGG TACAACCGCC GTACTCAATA CTGGGGCATT CGGCGATTTC ACCACCAGCA GCGGCACAAC TGGCCGTAAT CGCCCAGCTT TGACTCAATC GTTTACCGAA ACCAGCAGCG GCGAAACCTT CACCGTGGTG GTCAATCACC TTAAGTCGAA GGGTAGTGCT TGTACCGATA ACGTCAGCCC AGTGCCAAGC GACCCCGATT TGGGCGATGG TCAAGGCAAC TGTAACCTGA CCCGCAAAAC TGCTGCCCAA GAATTAGTGG CTTGGTTGAA TACCGACCCA ACCAATGTTG ATGATAGCGA TTATCTGATC ATTGGCGACT TGAACTCATA CGCCATGGAA GATCCAATCA CCGCTATTCG CAACGCTGGC TATGTCAATA TTCCCAAAAC CTTGCTTGGC GACGAAGCCT ACTCATACAT TTTCGATGGC CAAACTGGTT CGCTTGACCA TGCCTTGGCT AGCACTTCGT TGTTCAGCCA AGTTGCTGAT GTTCAAGAAT TGCACATCAA TGCCGATGAA CCAATCGCCT TGGATTACAA CACCAACTTC AAGAGCGCTG GGCAAATCGT CAGCTTGTAC AATAACGATG CCTATCGTGC TTCCGACCAC GATCCAATTG TGGTTGGTTT AGCCTTAGGC GCGGCTCCAA CCGCCAACTT CAGCACTTCA ACCAAAACCG TTAGCAGCAC CAGCGTCGAA CAAGGCGAGG TCTTCACTTA CACCTTGACG ATTCGCAACA CGGGTGCAGC TGAAGGCAGC TTTAGCTTGA GCGACCTGAT CAACAGCAAT CTCGAAATTG TTGGCGTGAC TGGCGGCTTG ACGGTCAATG ACCAAACCGC CAGCGTCAAT AGCAGCCTCG CACCAAGCAG TAACAAAACC TATACCATCG CTGTGCGCAG CGTGGGCAAT TTCGTGGGTA GCGTTGGCAA CACCGCTGTC CTCAACAGCA ATATTAACTT GACCGCCGAT AGCGTGACAG TCAATGCAAC GACAACCCCA AGCTTCACTG TGTACATGCC ATTAGTCACC AAAAACTAA
|
Protein sequence | MQFKGLRLLT ALTMLFTLIG TSLSATYTTP TSAVSTSLVI SQFQTAGGTA DDEFIEIHNI SANPVDLNGH RVVYRAAAGV TDVSIASWTS SVVIPAGGFY LLGRANSYDG TVTADTTFGS GISGTGGGFA IRNGDLNTGT IIDSVGFGSA TNAFVEGTVV AAPSANNSAQ RNNSSCTDTD NNAADFSLLT PSAPRNSSSP VATCGGLPTD NPPTVASTVP ANNANNVALN ANVTLTFSEP VTTTGTWYTI SCTSGARTAS VTGGPTSYVL DPSSDFAASD ACTVTVVASQ VTDRDGSVDA MAQDYSFNFN VSSGVACSGG TVTPIGQVQG TGETSPANGT VVTVQGTVVA DFEGAQPALR GFYLQDAGDS NAATSDGIFV FNGGADQVAL GQVVRVTGTA GENQGQTQLS GSLTVELCGT TNTVTPTQVT LPFASTTDAE RFEGMLVQFN QKLVVSEHYL LARFGQVTLS LNERLAQPTN VVSPGAPALA LQAQNNLHKI ILDDPFNNQN ADPILFGRGG TGLSAANTLR ASDSITGLQG VMTYTWGGNS ASPNSYRVRV TQSPNFVAEN QRPTSLELDG SLRVAGINVL NYFNTFGNGN CTGGVGGAAT DCRGAENTTE FQRQKDKTIA AIIMTQADVI GLMEIENDGF GSNSAIQDLV NGLNAATAPG TYALVDVDTR TGQTNPLGTD AIKVALLYKP GKVSLAGTTA VLNTGAFGDF TTSSGTTGRN RPALTQSFTE TSSGETFTVV VNHLKSKGSA CTDNVSPVPS DPDLGDGQGN CNLTRKTAAQ ELVAWLNTDP TNVDDSDYLI IGDLNSYAME DPITAIRNAG YVNIPKTLLG DEAYSYIFDG QTGSLDHALA STSLFSQVAD VQELHINADE PIALDYNTNF KSAGQIVSLY NNDAYRASDH DPIVVGLALG AAPTANFSTS TKTVSSTSVE QGEVFTYTLT IRNTGAAEGS FSLSDLINSN LEIVGVTGGL TVNDQTASVN SSLAPSSNKT YTIAVRSVGN FVGSVGNTAV LNSNINLTAD SVTVNATTTP SFTVYMPLVT KN
|
| |