Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3179 |
Symbol | |
ID | 5735054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4017007 |
End bp | 4020078 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280325 |
Product | SMC domain-containing protein |
Protein accession | YP_001545944 |
Protein GI | 159899697 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0019664 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGCCAG AGAAGCTGCG TGTTCGTAAT TTTATGTGTT ATCGCGATGA TGTTCCAACC CTCGATTTCG AAGGAATTCG GGTTGCTTGT TTGTCGGGCG AAAATGGCGC TGGCAAATCG GCCTTGCTCG ATGCAATCAC TTGGGCGCTT TGGGGCAAAG CTCGCGTTTC CTCCGATGAT GAATTGATTG CGTTGGGTGC ACAAGAGATG GAAGTTGATT TGCAATTTAG CGTTGCCAAA ACTTCCTATC GTGTGTTGCG CCGCCGTTCA TCAGCCAAAC GCGGCCAAAC CATCCTCGAA ATTCAAGTTA ACGATGGTGA TAATTGGCGA GCAATTTCTG GCAATAGCAT CCGAGAAACC CAAGAGATCA TTCATTCTGT GCTGCGCATG GAATACGATA CCTTCATTAA TAGTGCTTTT TTGGTGCAAG GCAAAGCCGA TGAATTTACC CGCAAAGCGC CTGCTGAACG CAAACGAGTG CTGGCCGAGA TTTTAGGTCT CGATGCCTAC GAACAACTTG AAGCCCGTGC CAAAGAGCAG GTGCGCTATT TCAGCGACCG CGCTCAAGGC CTCGAAGGTA CAATCAGCAG CTACCGCGAA TGGGTCAATA AGCGTGATTT TTACCTGAGC CAAGAGGCCG AAGCGCAACA ACGGGTGCAG CAGCTTAGCC ATGAAATCGA ACGAGCAACA GTAGTGTTTG AGCAAGCCGA CCAACAACGC CGCGCACTCG AACATCGTAA GGCCGAACGC GACCGCGAGC TAAGCCGCAG CCGCGACCTC GAACGTCAAA TTGCCGAAAG CGATGGCGAT TATCGCCAAC TGGCTCAAGA TATTGGTGTG GCACAAGCGA TGGTCGCTCG TCAAAGCGAA ATTGAAACCA ATTTTCAAGT CTTGCAAGCT GCGCGTGAAG AGTTAGTGGT GCTTGATCGC TTGCGTGAGC AAATTGTCCA GCTCAACGAT CAATACAAAG AGCAACGCGC TTTGGTCGAG CGCGAAGAAC TTTCGTTGCA ACATCAACTT CAGCAATATC AGCGCGATCA GCAAACCAAC GCCGAATTGA TCGCCAATAA AGCCAGTAAG CAAGCAGATT TGGCCGCCTT GGCCGAGCAA TTAGCCGGTT TTGCCGACGA TCAACGCCAT TTAGAGCAAG CCCGCAGCCA GCGCAACGAG CTAGAGCAAC AAAGCACCAA CTTAACTCAA TTGCAGGTTG AGGTGCAACG ATTGCAAGGC TTAATCAACG TGCGCGGCGA TTCGTTAATT GCCGCACGCG AGGAGCAAAA TCGCCGAATT ATCGAGGCCG ACAATGTTTT GGCCAACGAA TCGCGCTGGC GCAGTGAATT TGAAACCTCG GTTGCCCAAC AAAAAGCCCT GATTCGCGAT GAACGCAAGC TCCAAGAAAT GCGCAGCCGC GACCAACAAG ATGCCAAGCA ATTGGGCGAA CTATCGGCAC AAGAGAGCAA TTTAAAGCAA CTTGGCAAGC AAATTAACGA TAAAATCGAC CAACTACGGG CAACCCACGA TACGCATTGC CCACTCTGCC AGAGCGATAT CGGCCAGCAT GGCATTCATC AAGTGATCGA GCAATATATC GTTGAGCGCG ACGATCTGCG TGATCAATAT CGTGAGGCCA GCCAAGCTCG CAAACAACTA AGCAATGAAT ATGATGCTCG CCAGCGCGAA ATTCAAGGTT TAGAGCGCAA AGTTGCCCAG CTGAGCACTC TGGCAGCAGC AGTTGGTCGC TTGGAAGGCC AAATCAGCGA AGCCCAAGAA CATCGTAGCA AACGCAGCGA AGCCGAAAGC ACCTTGCGCG ATTTGAATCA GCGGCTTGAA CATGGCGATT TTGCCCATGA AGAACGCGCC GCCTTAGCCC AAGCCCAAAC TGAAATCGCC GAATTAGGCC TTGATCAAGC GGCGCTTGAT GCCCAACGCC AAGCCAATGG CCGATTAATC GCCCAGCTCG AGCAACGGCT AGCCCAACGT GGCACAATCG AGGCCAAAGC CGCCGTACTC CAAGAACAAC TTGAGCGAAT TCAGGCGGCT GAAACCCACA ATGCCACATT GCACGAAACA ATTTTGAGCT TGCAAGTCCA GATCGATAGT CAGCAATTTG CCCAAACTGC ACGGCAACAG GCCGAGGCAA TTTACCAGCA AATGGCTGAA TTGGGCTATG CCTCACAACG CCATCAAGAA GTGCGCGACG CGGTTGCCAG CCTTGGCCAT TGGGAAGGCG AATATCATCA ATTGCGTTCG GCGCAAACCA ACTTGGCAAC CAACCAACGC CAAGCCCAGC GTTTGGCCGA ATTGATCGAG CGCCAACGCC AAGATTTAGC CCAAATTCAA GTAATGCTCA ATCAACTCAA CCAAGAATTA GCCCAATTGC CAGCCGCTAT TCAAGCTGCC GAAACCGCTC AACGCACAAT CAACGAGTTT CGCGGGCGCT TGGCCGTTGC TCAAAAAGAT TTGGGTGCAG CTCAGCAAAA CGTCCAGCAT GTGGCTCAAG TCGCTGAGCA GTTGGCCGCA GCCGAAAAAG AGCTGCTCTC AGTGCAAGAT CAACGCGATG TTCACAGCGA ATTGGTACGA GCTTTTGGCA AAAAAGGCAT TCAAGCAATG CTGATCGAAA CGGCGATTCC TGAGCTTGAG CGCGAAGCTA ACGAATTGCT CAGCCGCATG ACCGACAACC AAATGCACTT GCGCTTTGAA ACCCAACGCG AAACAAAAAA AGGCGATACC AGCGAAACTC TCGATATTCA AATTGCCGAT GAACAAGGCA CGCGCCGCTA CGATTTGTAT AGCGGTGGCG AGGCCTTCCG CATCAACTTT GCAATTCGGA TTGCCATGAG CAAAATGCTG GCTCGTCGCG CCGGAGCCAA CTTGCAAACA TTAATTATTG ACGAAGGCTT TGGTTCGCAA GATGGTCGTG GGCGCGAACG CTTGGTCGAG GCTATCACCC AAGTTCAGCC AGATTTCAGT CGCATCTTGG TAATCACCCA CATCCAAGAA TTGAAGGATC AATTTCCGGT GCAGATCGAA ATTACCAAAC ACGACAACGG CTCACGTTGG GCGGTGAATT AG
|
Protein sequence | MLPEKLRVRN FMCYRDDVPT LDFEGIRVAC LSGENGAGKS ALLDAITWAL WGKARVSSDD ELIALGAQEM EVDLQFSVAK TSYRVLRRRS SAKRGQTILE IQVNDGDNWR AISGNSIRET QEIIHSVLRM EYDTFINSAF LVQGKADEFT RKAPAERKRV LAEILGLDAY EQLEARAKEQ VRYFSDRAQG LEGTISSYRE WVNKRDFYLS QEAEAQQRVQ QLSHEIERAT VVFEQADQQR RALEHRKAER DRELSRSRDL ERQIAESDGD YRQLAQDIGV AQAMVARQSE IETNFQVLQA AREELVVLDR LREQIVQLND QYKEQRALVE REELSLQHQL QQYQRDQQTN AELIANKASK QADLAALAEQ LAGFADDQRH LEQARSQRNE LEQQSTNLTQ LQVEVQRLQG LINVRGDSLI AAREEQNRRI IEADNVLANE SRWRSEFETS VAQQKALIRD ERKLQEMRSR DQQDAKQLGE LSAQESNLKQ LGKQINDKID QLRATHDTHC PLCQSDIGQH GIHQVIEQYI VERDDLRDQY REASQARKQL SNEYDARQRE IQGLERKVAQ LSTLAAAVGR LEGQISEAQE HRSKRSEAES TLRDLNQRLE HGDFAHEERA ALAQAQTEIA ELGLDQAALD AQRQANGRLI AQLEQRLAQR GTIEAKAAVL QEQLERIQAA ETHNATLHET ILSLQVQIDS QQFAQTARQQ AEAIYQQMAE LGYASQRHQE VRDAVASLGH WEGEYHQLRS AQTNLATNQR QAQRLAELIE RQRQDLAQIQ VMLNQLNQEL AQLPAAIQAA ETAQRTINEF RGRLAVAQKD LGAAQQNVQH VAQVAEQLAA AEKELLSVQD QRDVHSELVR AFGKKGIQAM LIETAIPELE REANELLSRM TDNQMHLRFE TQRETKKGDT SETLDIQIAD EQGTRRYDLY SGGEAFRINF AIRIAMSKML ARRAGANLQT LIIDEGFGSQ DGRGRERLVE AITQVQPDFS RILVITHIQE LKDQFPVQIE ITKHDNGSRW AVN
|
| |