Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0063 |
Symbol | |
ID | 5731935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 80277 |
End bp | 82202 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277184 |
Product | excinuclease ABC, C subunit |
Protein accession | YP_001542843 |
Protein GI | 159896596 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | [TIGR00194] excinuclease ABC, C subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGATT TATCTCATAC GAATATTGCC AATCATGCGT TGTTTGAAGA ACGCTTGCGC AACTTGCCAC TCTCGCCAGG CGTGTATATC TACCGCGATC AGGCCAATAC GATCATCTAC GTGGGCAAAT CCAAAAGCCT GCGGGATCGG GTGCGTTCGT ATTTTGGTGC GCCGCGTGGC CTAACCAGCA AAACCCGACG GTTGGTGCAA AATATCGCCG ATTTTGAGTT TATTACTACC GATACCGAGC TAGAAGCGCT ACTTTTAGAA ATGAATCTGA TCAAAAAGCA TCGCCCACGC TACAATGTGT TGCTCAAGGA TGACAAAAGT TATCCCTATA TCAAAATTAC CAAAGAAGCA TGGCCAAGGG TTTTACGAGT GCGCAAAGTG ATTGAAGATG GTGGCATTTA TTTCGGGCCA TTTGCCAAAG CTAGCAGCGT TTATGCCACA ATTGAGTTAT TAAATAAATT ATTTGCTTTT CGTTTATGCA ACGATGATAT GTTTAAAAAG CACGAGCGCC GCAATCGGGC TTGTATGTAT TATGATATTA AACGCTGTCT TGGGCCATGT GCCAACAACT GTACCACCGA AGAATATAGC ACTGCAATTA ATCAAGTACG CTTATTTTTA GGTGGCAAGC CTGAAGCAAT TTTACGTGAC CTAAAAGTGA AAATGAACCA AGCTGCCGAA GATTTACAAT TTGAACGCGC GGCCTATGTT CGCGATCAAA TCAAGGCAGT CGAACGGGTG ATGGAGCGCC AAAAAGTGCT GAATACCGCC GCCAGTGACC AAGATGTAAT TGCTTTTGCC CGTGATGAAG GTAAAGCGGT GGTGCAGGTG TTCTACATTC GTGGTGGCAA GTTGATTGGC TCGGAGCCAT TTACGCTGCA AGGCACGGAG GAAGAACATC CCGAAGCCTT GATGAGTTCG TTCTTGACCC AATTTTATGA TGCCGCCGCC GATATTCCGC CCAATATTCT GCTGCCCGAT TATCCCGAAG AAACCCAAAT TATTGAGCAA TGGCTGGAAA GCAAGGGCGG ACATAAAGTG AGTTTGCAAG TGCCACGACG CGGCGATAAA AAAGATTTGG TCGATTTGGC GGCCCGCAAT GCCAGCCAAA CCCTCGATCA ATTGCGCTTG CAATGGCTCA ACGCCGAACA ACGGGCAACA GCCGGGCTGA GTCAATTGCG CGAATTATTG AATTTAGCCG AATTGCCCCA ACGCATCGAA TGCTACGACA TCTCGAATAC CCAAGGCACC AATTCGGTGG GCAGTATGGT GGTGTTTGAG CAGGGCGAGC CAGCCAAAAA GCACTATCGC CGTTTCAAAA TCAAAACCGT TGAAGGTGCT AATGATGTGG CCTCGCTCAG CGAAGTGCTA CAACGCCGGT TTGCCCGCGC CGATGATACT GGCCAAACGG ACGAGCCAGA ATCAACCGAG CAAACCAGCG CCGAGCCAAC TAATAACGAC GAAACTTGGG CGGTATTGCC CGATCTGATT TTGATCGACG GTGGGATCGG CCAAGTCAAT GCTGCAGCCA AAACTTTGGC AGCGGCTGGT TTTGAGCATA TTCCGGTGGT TGGCTTGGTC AAGGGCGATA CCAAAGGCCA CTTGCCCTAT GGTTTGGTCA AACCTGGCCA GCGTGTGCCA ATCGCCTTTG CCCAAAACGA TCCAGGCTTG CATCTCGTTC AGCGCATCGA CGAAGAAGCC CATCGCTTTG CGATTAGCTA TCACCGTAAA TTGCGCACTA AAGGCATGCT GCGCTCGACC ATGGAAGATA TTCCAGGTAT TGGCCCCAAA CGCAAAAAAG CGTTGATCAA CGCCTTTGGC TCGCTGGAGG GCATTCGCAA CGCCAGCATC GAGGAGCTAG CCGCCGTGCC TGGCATGACC CGCAAAGCCG CCGAGGAGAT CAAAGGGCTG TTGTAG
|
Protein sequence | MPDLSHTNIA NHALFEERLR NLPLSPGVYI YRDQANTIIY VGKSKSLRDR VRSYFGAPRG LTSKTRRLVQ NIADFEFITT DTELEALLLE MNLIKKHRPR YNVLLKDDKS YPYIKITKEA WPRVLRVRKV IEDGGIYFGP FAKASSVYAT IELLNKLFAF RLCNDDMFKK HERRNRACMY YDIKRCLGPC ANNCTTEEYS TAINQVRLFL GGKPEAILRD LKVKMNQAAE DLQFERAAYV RDQIKAVERV MERQKVLNTA ASDQDVIAFA RDEGKAVVQV FYIRGGKLIG SEPFTLQGTE EEHPEALMSS FLTQFYDAAA DIPPNILLPD YPEETQIIEQ WLESKGGHKV SLQVPRRGDK KDLVDLAARN ASQTLDQLRL QWLNAEQRAT AGLSQLRELL NLAELPQRIE CYDISNTQGT NSVGSMVVFE QGEPAKKHYR RFKIKTVEGA NDVASLSEVL QRRFARADDT GQTDEPESTE QTSAEPTNND ETWAVLPDLI LIDGGIGQVN AAAKTLAAAG FEHIPVVGLV KGDTKGHLPY GLVKPGQRVP IAFAQNDPGL HLVQRIDEEA HRFAISYHRK LRTKGMLRST MEDIPGIGPK RKKALINAFG SLEGIRNASI EELAAVPGMT RKAAEEIKGL L
|
| |