Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4118 |
Symbol | |
ID | 5735979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5265134 |
End bp | 5267992 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281272 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001546878 |
Protein GI | 159900631 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAAA TTGATTCGAT TCAGATTGTT GGCGCACGCC AGCATAATCT CAAAAATATT GATTTAGCCA TTCCCAAAGG CAAATTGGTG GTATTCACCG GCCCATCGGG GGCTGGCAAA TCAACACTCG CCTTTGATAC GATTTATGCT GAAGGCCAGC GTCGCTATGT TGAATCGCTT TCGAGCTATG CCCGCCAATT TTTGGGTCAA TTGCCCCGAC CTGAAGTGGA TAGCATTCGT GGTCTAGCGC CAGCAATTGC CGTCGCCCAA CAAACGATCA ACCGTTCGCC GCGCTCAACC GTTGGCACAA TCACCGAAAT TTATGATCAT TTGCGCTTGT TGTATGCGCG GATTGGCAAA CCGCATTGCC CAGTCTGTGG CCGTGCGATT GAGCAACAAA CTGCTAGCCA AATCGTCGAT CAGATTTTGG GTTATCCTGA CGGCACACGC CTGATGATTT TAGCGCCCTT GGTGAATGAA GAACTTGGTT CGCATGCCAC GGTGCTTGAG CAAACGCGAC GGGCTGGCTT TGTGCGGGTA CGGGTTGATG GCGCAATTGT TGATCTTGAT GAACCAATTG AGTTGGATCA TCGTCAAGCC CATAGCATAG ATGTGGTGGT CGATCGGCTG ATTATTCGCC ACAGCGAGGC CGTGAGCTTG AATGACCATC CGGATCGGGT ACGGGTGAGC GATTCGGTTG AAACAGCCTT GAAAACTGGC GCGGGAATGG TCTGGGTTCA GCCGCTTGAT GGTCAACAGC TGCGTTACAG CGAACATGCT GCATGCCCTG AACATGGGCC ATTGGCCAGC GGCGCGATCG AACCACGCAG CTTTTCATTT AATAGCCCAC ATGGTGCTTG CCCGATTTGC GATGGATTAG GCACAGTTGA TGATTTCGAC CGGAACCTGT TACAATCTCA GGCAGCGCAA ACCCTTGGCG AACTGCTTTC GAATCCACTT CGCGCCAGCA CCACGACCTA CCAACAATAT TGGGAAGAAA CCATCCAAGG TTTAGCTGAA GCTTTGGGCA GCGATCTCGA ACAGCCTGTT GACACAATGG CTAGCTGGGC GTTAGATTTA TGGCTAGAAG GCACAATTCC AAGCGATGAT CAACTACATT TAAGCAAAAA GCTACGCCAA CAGCTCGCCA ACTGGTCAGG TTTGCTCGGT TGGTTGCGTC AACATTGGCA ACAAGCAAGC GAACAACAGC GCGAAAGCCT CAGCATCTAT CGCCAAGGCA CCATTTGTTC GGCCTGTGAA GGTTCGCGCT TGCGACCAGA GGCGCGGGCA GTTACACTAC AGGGGCTAGC AATCGATCAA GTTACAGCAA TGTCGATTGA GGCTAGTTTT GCTTGGGTTA GTGAACTACC CAACAAATTG CGGCGCGAGC GTGAGCAACA GATTGCAGCG CCGATTGTGC GTGAAATGTG CTTGCGTTTA CAGTTCTTGC GCGAAGTTGG CCTAGACTAT TTGAGTTTGG CGCGAACTGC TGAGAGCCTA TCGGGCGGCG AGGCTCAACG GATTCAATTG GCCACCCATA TTGGAGCTGG ATTATCAGGG GTCTTGTATG TCTTGGATGA GCCATCGATT GGCTTGCATC CACGCGATAC TGAGCGTTTA TTACAGACAT TATTGCAACT ACGCGACCTT CGCAATTCAG TTTTAGTAGT CGAGCATGAT CCAGCGATTA TTGCTGCTGC TGATTGGGTG GTTGAAGTTG GGCCGACAGC GGGCGTGCAA GGTGGCTATA TTATGGCCAG CGGCACGCTT GAGCAACTGA TAGCTCAGCC CAATTCCCAA ACAGGCCAGT ATGTAGCTGG GCAACGGCAG CTAAGCCTGC CTCAAACACG GCGAAAGCCA ACCCATGCTA CACTAATGCT ACGTGGAGCC AAACAGCATA ATCTCAAAGA TCTGGATGTA GCGATTCCAT TGGGATGTTT GGTTGCGATT ACGGGAGTAT CAGGTTCGGG GAAATCGACA CTTATTCATG AGATTCTCTA CCCACGGCTG GCCAACGAAC TGCATGGAAG CCGCCTGCCA GTGGGCCGAC ATCGAAGCCT TGAAGGCTAT GATCAACTTG AAAAAGTGAT TGCAGTTGAT CAAACGTCAC TGGGGCGCTC AGGTCGTTCC AACGCTGCCA CCTACACCGG TATTTTCGAC GCATTGCGTC AATTGTTTGC TGGGACACCT GAAGCCAAAG CGCGGGGCTA TGGAGCCAGT CGATTTTCCT TTAATCTTAA GGGAGGGCGG TGCGAACAAT GTCGCGGCGA AGGTGTGGTA TCGATTGCCA TGCAATTTCT GCCAGATCTA GCGGTGACCT GCGACGCATG CGGCGGGTTG CGTTATAATC GAGAAACCCT TGATATTCGC TATCGTGGGT ATACCATTGC TGATGTGCTC GCCATGACCG TAGGCCAAGC CTTAAGCGTT TTTGAACGGC TACCGGCTTT GGCGCGAAAA CTAGAGAGTT TGGTTGAGGT CGGGTTAAGC TATTTGACGT TAGGGCAGCC AGCGGCGACT CTTTCGGGAG GCGAAGCGCA ACGGGTAAAA CTGGCGGCAG AGCTAGCGCG TCGAGGAACA GGCCGAACCC TGTACATCCT TGATGAACCA ACCACCGGAT TATATTGGAC GGATGTCGAA CGGTTAATTG CGATATTGCA ACGATTGGTT GATACAGGAA ACAGCGTTGT GGTGATCGAA CATCATCTCG ACCTGATCAA GACCGCCGAT TGGGTGATCG ATTTAGGGCC GGAAGGTGGG GACACTGGTG GCCGGCTGGT AGTGGCTGGT ACGCCGGAAG TAGTCGCTAT GAATCAAGCT TCATGGACTG GCCGCTTCCT TCAAACCGTG TTAGCTTGA
|
Protein sequence | MAEIDSIQIV GARQHNLKNI DLAIPKGKLV VFTGPSGAGK STLAFDTIYA EGQRRYVESL SSYARQFLGQ LPRPEVDSIR GLAPAIAVAQ QTINRSPRST VGTITEIYDH LRLLYARIGK PHCPVCGRAI EQQTASQIVD QILGYPDGTR LMILAPLVNE ELGSHATVLE QTRRAGFVRV RVDGAIVDLD EPIELDHRQA HSIDVVVDRL IIRHSEAVSL NDHPDRVRVS DSVETALKTG AGMVWVQPLD GQQLRYSEHA ACPEHGPLAS GAIEPRSFSF NSPHGACPIC DGLGTVDDFD RNLLQSQAAQ TLGELLSNPL RASTTTYQQY WEETIQGLAE ALGSDLEQPV DTMASWALDL WLEGTIPSDD QLHLSKKLRQ QLANWSGLLG WLRQHWQQAS EQQRESLSIY RQGTICSACE GSRLRPEARA VTLQGLAIDQ VTAMSIEASF AWVSELPNKL RREREQQIAA PIVREMCLRL QFLREVGLDY LSLARTAESL SGGEAQRIQL ATHIGAGLSG VLYVLDEPSI GLHPRDTERL LQTLLQLRDL RNSVLVVEHD PAIIAAADWV VEVGPTAGVQ GGYIMASGTL EQLIAQPNSQ TGQYVAGQRQ LSLPQTRRKP THATLMLRGA KQHNLKDLDV AIPLGCLVAI TGVSGSGKST LIHEILYPRL ANELHGSRLP VGRHRSLEGY DQLEKVIAVD QTSLGRSGRS NAATYTGIFD ALRQLFAGTP EAKARGYGAS RFSFNLKGGR CEQCRGEGVV SIAMQFLPDL AVTCDACGGL RYNRETLDIR YRGYTIADVL AMTVGQALSV FERLPALARK LESLVEVGLS YLTLGQPAAT LSGGEAQRVK LAAELARRGT GRTLYILDEP TTGLYWTDVE RLIAILQRLV DTGNSVVVIE HHLDLIKTAD WVIDLGPEGG DTGGRLVVAG TPEVVAMNQA SWTGRFLQTV LA
|
| |