Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4583 |
Symbol | |
ID | 5736428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5862018 |
End bp | 5865059 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281745 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001547342 |
Protein GI | 159901095 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAGG ATAAAATTGT CATCAAGGGC GCACGTGAGC ACAATCTTAA AAATATCGAC ATCGAACTAC CACGCGATCA ATTAGTTGTG TTGACGGGAG TTTCTGGCTC AGGCAAGTCA TCGTTAGCCT TTGATACCTT ATATGCCGAA GGCCAGCGCC GCTATGTTGA ATCGCTTTCC TCGTATGCCC GCCAGTTTTT AGGCCAACTC GAAAAGCCCA AGGTCGATTT TATTGGTGGG CTTTCGCCAG CAATCGCGAT CGAACAAAAA TCGGCCTCGA AAAATCCGCG CTCAACTGTG GGCACGGTCA CCGAAATTTA TGATTATTTG CGGGTCTTGT TTGCGCGGGT TGGGGTGCCA CACTGCCATA AATGCGGTAA AGCGATCGGC TCACAAACTG CTGAGCAGAT GGTCAATCGG GTGCTGGAGT TACCCAAAGG CACGCGCTTT ATGCTTTTAG CGCCGATGGT AGCCGCTCGC AAAGGCGAAT ATAAAGATAT TTTCGATGAT GCCCGCTCGC AGGGGTTTTC GCGGGCACGG GTCGATGGCG AAATTCGTGA TCTGCAAGAC GAAATCAAAC TCAATAAAAA AGTTAAACAT ACAATTGATA TTGTGGTTGA TCGCTTGGTC GTGCCAACCG AGGATGACGA AACCTTTCGT TCACGACTTA ACGATAGTGT TGAAACCGCG CTACGCACCG CCAATGGCAC AATTATTATC GCTATTCCTG AGTTTGCCCA ACAAACCGAA AAATCAAAAT CCAAAAAAGC CAAGGCCAAA GCGGTTGCTA CCGAAGTTAA TGAAGAAGAA TTGTTGCCCG AAGAAGAATC GCGTATTAGC GGCATGAACG CTGCTGGCGA TATTGTGATG AGCGAGGATT ATGCCTGTGT TGATTGTGGC ATCTCGTTCC TTGAACTCAA CCCACAAATG TTTTCGTTCA ATGCTCCCCA AGGGGCTTGC CCAGAATGTG CTGGCTTGGG TACGCGTTTA GAAGTTGATG CAGAACTGCT CGTGCCAAAC CCGAATTTAT CGTTGCACGA TGGTGCGGTA ACCTATTGGG GCGAGTTGCG CAAAAAAGTT GGTAGCTGGG GCTATCGCGC TCTGCAAGCC ATCTCCGCCC ACTATCAATT TGATCTTGAT ACACCGTGGA AAGACCTCAG CCCGCGTGTG CGTGAGATTT TGATGCAAGG CTCAGGCAGT GAAAAAGTCA AGCATGTTTG GAGCGAAGGT CAGTCCAAGG GCGAGTATTA TCGTCGTTGG GAGGGCTTGG GCGCAGAAAT TATGCGCCGC TTCCAGCAAA CTGGTGTTGA AAATATGCGC GAGCACTACC AACAATGGAT GAGCGATCAG CCTTGTCATG CCTGCCATGG CGCAAAATTG CGGCCTGAGA GTTTGGCAGT GACGGTTGGT GGTGAAAATG TGCAACAAAT TTGTGCCAAA ACTGTGGCTC AAGGCTATGC CTGGGCCTGT GGTTTAACTG GCAGCGATGC CCACTGGGTG AGCCGTGATG GTTTGGATAC CACGGTATTG CCGTTGTTGG CTGCCGCGCC AACCCCGACC AAACTCGATG GTCGGCAATT GGAAATTGCT GGCGAAGTGC TCAAAGAAAT TCGTGAGCGC TTGGGCTTTT TGTTGAATGT TGGTTTGCAC TATCTCACGC TCGATCGCTC AGCTCCCTCG CTGTCTGGTG GCGAGGCTCA GCGCATCCGC TTGGCTTCAC AAATTGGGGC GGGCTTGATG GGCGTGATGT ATATTCTCGA CGAGCCATCA ATTGGCTTGC ATCAACGTGA TAATCGCAAA TTGATCGATA CTTTGACCAA ACTGCGCGAT CTTGGCAACA CCGTAATTGT GGTTGAACAC GACGAAGATA CCATGAAAGC CGCTGATTGG CTGGTAGACT TCGGGCCTGG CGCGGGAGTC AACGGCGGCA AGGTGGTCGC TGAAGGTCGG CCAGAATATA TCAGCAGCAA TGGTTCGTTG ACCGGTAGTT ATTTGTCGGG GCGCTTGAAA ATTGAAGTGC CAGCAACCCG CCGCCCAGCC CATGGTCATA TTACTTTGCG CGGTGCAACC CACAACAACC TCAACAACCT TGATATTACG ATTCCTTTGG GCACGCTGGT CGCGGTAACT GGGGTTTCTG GCTCAGGCAA ATCCTCATTG ATCACCGAAA CGCTCTATCC AGCCTTGGCA AATTTGCTCA ATCGCGCCCA GCTGCGGGTT GGCAAATACG ATACGCTCGA AGGTTTAGAG CAACTTGATA AAGTGATCGA TATTGATCAA CAGCCGATTG GCCGCACGCC GCGCTCGAAT CCTGCAACCT ACGTCAAGTT ATTTGATCAG ATTCGTGAGG TGTTTGCCAA TACGCCTGAT GCTAAACTGC GCGGCTATGA GCCAGGCCGC TTTTCGTTCA ACGTTAAGGG TGGGCGCTGT GAAGCCTGCC AAGGCAATGG CGAGCAAAAG ATCGAAATGC ACTTCTTGGC TGATGTGTGG GTGCGCTGCG ATGAGTGCAA AGGCAAGCGT TACAATCGCG AAACCTTGCA AGTGCGCTAC AAAGGCAAAA CGATCTCCGA TGTGCTGGAT ATGGATGTGC ACACTGCGCT CGAATTCTTC GAGAACCACC CCAAACTCAA GCGCGTGCTG CAAACCTTGC ACGATGTTGG TTTGGATTAC ATCAAACTTG GCCAATCGGC GACGACGCTC TCTGGTGGCG AGGCTCAGCG GGTCAAATTG GCCAAAGAAT TAGCGCGGGT GGCCACTGGT CGCACGATCT ACATTCTCGA TGAGCCAACT ACAGGCTTGC ACTTCGCCGA CGTACAAAAT CTGTTGCGGG TCATTCAGCG CTTGGTCAAG GCTGGCAACA CGGTTTTGGT GATTGAACAC AGCCTCGATG TGATCAAAAC CGCCGACTGG ATTATTGATC TTGGGCCTGA GGGTGGTACT GGCGGCGGCT ATATTATTGC CCAAGGCACG CCTGAAGAAG TGGCCTTGCA TCCAACTTCG CACACAGCAG TCTTTTTGCG CGATTTGCTG AACCTTGACT GA
|
Protein sequence | MAQDKIVIKG AREHNLKNID IELPRDQLVV LTGVSGSGKS SLAFDTLYAE GQRRYVESLS SYARQFLGQL EKPKVDFIGG LSPAIAIEQK SASKNPRSTV GTVTEIYDYL RVLFARVGVP HCHKCGKAIG SQTAEQMVNR VLELPKGTRF MLLAPMVAAR KGEYKDIFDD ARSQGFSRAR VDGEIRDLQD EIKLNKKVKH TIDIVVDRLV VPTEDDETFR SRLNDSVETA LRTANGTIII AIPEFAQQTE KSKSKKAKAK AVATEVNEEE LLPEEESRIS GMNAAGDIVM SEDYACVDCG ISFLELNPQM FSFNAPQGAC PECAGLGTRL EVDAELLVPN PNLSLHDGAV TYWGELRKKV GSWGYRALQA ISAHYQFDLD TPWKDLSPRV REILMQGSGS EKVKHVWSEG QSKGEYYRRW EGLGAEIMRR FQQTGVENMR EHYQQWMSDQ PCHACHGAKL RPESLAVTVG GENVQQICAK TVAQGYAWAC GLTGSDAHWV SRDGLDTTVL PLLAAAPTPT KLDGRQLEIA GEVLKEIRER LGFLLNVGLH YLTLDRSAPS LSGGEAQRIR LASQIGAGLM GVMYILDEPS IGLHQRDNRK LIDTLTKLRD LGNTVIVVEH DEDTMKAADW LVDFGPGAGV NGGKVVAEGR PEYISSNGSL TGSYLSGRLK IEVPATRRPA HGHITLRGAT HNNLNNLDIT IPLGTLVAVT GVSGSGKSSL ITETLYPALA NLLNRAQLRV GKYDTLEGLE QLDKVIDIDQ QPIGRTPRSN PATYVKLFDQ IREVFANTPD AKLRGYEPGR FSFNVKGGRC EACQGNGEQK IEMHFLADVW VRCDECKGKR YNRETLQVRY KGKTISDVLD MDVHTALEFF ENHPKLKRVL QTLHDVGLDY IKLGQSATTL SGGEAQRVKL AKELARVATG RTIYILDEPT TGLHFADVQN LLRVIQRLVK AGNTVLVIEH SLDVIKTADW IIDLGPEGGT GGGYIIAQGT PEEVALHPTS HTAVFLRDLL NLD
|
| |