Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0517 |
Symbol | |
ID | 4569112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 566631 |
End bp | 569519 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639765116 |
Product | excinuclease ABC subunit A |
Protein accession | YP_910998 |
Protein GI | 119356354 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.516633 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTCA ATCATATTAC CATCAAGGGA GCAAGAGTTC ACAATCTCAA GAATATCTCT CTCGATATTC CACGAAACCG GTTCGTTGTC ATTACCGGTA TTTCGGGATC AGGCAAATCA AGCCTCGCCT TCGATACTAT TTACGCCGAA GGCCAGCGTC GTTTTATGGA AACCCTCTCT CCCTATGCGA GACAGTATAT CGGCAATATC GAACGACCTG ACGTCGATTT CATCGAAGGG CTTTCGCCGG TTATTTCCAT TGACCAGAAA AGCACAAGCC GATCCCCGCG ATCTACGGTA GGCACCGTAA CGGAAATACA CGATTTCATC CGTCTTCTCT ATGCCAAAGC CGGAAGAAGA TATGACCCCG TAACAGGACA GATGCTTCAA AAACAAAGTG AAGAGAGCAT TTGCGAAGCC ATTCTCTCTC TTCCTGAAGG CACAAAGGTG CAGATCATCT CCCCCCTTGT TACAGGCAGA AAGGGACACT ATCGCGAACT GTTTGAAAAA CTGCTCCAAA AAGGGTTTCT CAGGGTTCGT ATTGACGGCG AATATCGGGA AATGCACAAA AACATGCAGC TCGAACGTTA TAAAAGTCAT GCTGTCGCTC TTGTTATCGA CAGACTCCTT ATCAACCCGG AATCTGCCGA TCGCCTGAAA AAAGCCGTCA ACCTTGCTAT AGGCATGTCA GAGCATAAAT CCTCCGTGAT TTGCGATCCA GTGGAAAGCG ACTGCAAAGA GATGGTCTTC AGCACCAGAT ACGCATATTC AGATGGATCT GTCCCTCTCG ATACCCTTGC TCCAAACAAC TTCAGCTTCA ACTCCCCCTA CGGCGCCTGT CCTTCTTGCA GCGGCCTTGG CACGATCATG CAATTGTCTG CCGACCTCAT GATCCCCAAT CCTTCGCTGT CGCTCAAAGA GGGAGCCATA GAACCATTCG GCAAGGCAGG AAAACGCAAC CTCTGGCAGG TCATTAAAGC AATCGGCAAG GTTTATGGGT TCAGTGTTGA TACGCCCATA TCCAAAATCC CGAAAAAAGC TCTCGACATT CTGCTCTATG GGTCCGGCAG CGAAACCTTT GATATTTCCT ATTCATATGC GGGCAGGGAA AACAGCTACC CGCAACTGTT TGAAGGCGCA CTCCCCTATG TTGAAGAGAT CCGCCTGAAA ACCAACTCCA TGAAACTCCG TGAATGGGCT GAAAGCTTCA TGATCCATCA ACCCTGTCCC GAGTGTAATG GCGCAAGACT TCGAAAAGAA AGTCTCCTGG TTAAACTGAA CGATCTCAAT ATTGCTGAAG TTGAGGCGCT GCCGATACCA GAGGCACTTG ACTTTTTTAT AACCCTTCTT CCGACGCTTA CAGCAAAAGA ACTGCTTGTT GCCACACCAG TTGTGCATGA AATCACCAAA CGTCTGGAAT TTCTTCTCAA TATCGGACTA TCCTATCTCT CTCTGGGCAG AAGCTCGCAA ACGCTTTCAG GAGGAGAGGC CCAGCGCATT CGCCTCGCAT CACAACTCGG ATCACAGCTC AGCGGTGTAC TCTATGTGCT CGATGAACCA AGCATAGGGC TGCATCAACG CGACAATCAC AAGCTGATCG AATCGCTCAT TCGGCTTCGA AATCTCGGCA ATACCGTACT CGTTGTCGAG CACGACAAGG ACACCATGCT CATGGCCGAT CAGGTAATCG ATATCGGGCC TGGAGCAGGA GAATACGGCG GAGAAATCGT TGCTCAGGGA CGAGCTGAAG AACTCGGAAA ACACTCTCTG ACCGCCGCTT ATCTTCAGGG AAAAAAAGAG GTTTACTTTC CTCCTGAAAC AAAAAAAAGT CCTGATCAGG CTAAATTTCT CGTGATTCGG GGATGTCGGG GAAACAATCT GAAAAACATC GATATCCGCT TTCCTCTCTC GTCACTGATC AGTATTACCG GAGTCAGCGG ATCCGGCAAA TCAACCCTTA TCAATGAAAC GCTTTACCCT GCGCTCGCCC GCCATTTTTA CCGGTCAAAG CTTCTGACCT ATCCCTATGA CAGCATAGAG GGCATTGAAC TGATCGACAA GGTCGTCAAT GTCGATCAAT CCCCCATAGG AAGGACGCCG CGCTCAAACC CGGCAACATA CACCGGCGTT TTCACCTTTA TACGCGACTT CTATACCCGA CTTCCGGAAG CGCAAATCAG GGGATACAAG GCTGGACGAT TCAGTTTCAA CGTCAAAGGC GGACGATGCG AAGTATGCCA GGGGGCAGGA ACAAGAAAAA TAGAGATGAA TTTTCTTCCC GACGTCTATG TTCAATGCGA ACACTGCAAA GGCGAACGCT ATAACCGGGA AACTCTCCAG GTAAAATACC GGGGAAAATC CATTGCCGAT GTTCTTGACA TGCCTGTCGA AGAGGCTTCT GTTTTTTTTA CCGATTTCCC TCGCATCAAA CGCATTCTTG CCACCATGGA AAGTGTCGGA CTCGGTTATC TTAAACTCGG TCAGCCCTCG CCCATGCTCT CGGGAGGCGA AGCACAACGC ATCAAACTTT CGGCAGAACT CGCAAAAATC CAGACCGGCC AGACACTCTA CATTCTCGAC GAACCAACAA CAGGACTCCA CTTCCAGGAC ATTCAGCATC TTCTCGAAGT ACTCCGCAAG CTTGTCGATA AGGGAAATAC CGTTATCATC ATAGAACACA ACCTCGACAT CATCAAGAAC AGCGACTGGG TCATCGACCT CGGCCCTGAA GGAGGTTCCG GCGGCGGACA GTTTATCGGC GAAGGGACCC CCCGGGAGAT CGCTCAGCTT GAACACTCCC ATACCGGAAG ATACCTTGCT GTCGAACTGG AGGCAAAACA CTCACCCGAG CTACCCGAAC AAGGGATTCA AAACCCGATT TCCGAATGA
|
Protein sequence | MEFNHITIKG ARVHNLKNIS LDIPRNRFVV ITGISGSGKS SLAFDTIYAE GQRRFMETLS PYARQYIGNI ERPDVDFIEG LSPVISIDQK STSRSPRSTV GTVTEIHDFI RLLYAKAGRR YDPVTGQMLQ KQSEESICEA ILSLPEGTKV QIISPLVTGR KGHYRELFEK LLQKGFLRVR IDGEYREMHK NMQLERYKSH AVALVIDRLL INPESADRLK KAVNLAIGMS EHKSSVICDP VESDCKEMVF STRYAYSDGS VPLDTLAPNN FSFNSPYGAC PSCSGLGTIM QLSADLMIPN PSLSLKEGAI EPFGKAGKRN LWQVIKAIGK VYGFSVDTPI SKIPKKALDI LLYGSGSETF DISYSYAGRE NSYPQLFEGA LPYVEEIRLK TNSMKLREWA ESFMIHQPCP ECNGARLRKE SLLVKLNDLN IAEVEALPIP EALDFFITLL PTLTAKELLV ATPVVHEITK RLEFLLNIGL SYLSLGRSSQ TLSGGEAQRI RLASQLGSQL SGVLYVLDEP SIGLHQRDNH KLIESLIRLR NLGNTVLVVE HDKDTMLMAD QVIDIGPGAG EYGGEIVAQG RAEELGKHSL TAAYLQGKKE VYFPPETKKS PDQAKFLVIR GCRGNNLKNI DIRFPLSSLI SITGVSGSGK STLINETLYP ALARHFYRSK LLTYPYDSIE GIELIDKVVN VDQSPIGRTP RSNPATYTGV FTFIRDFYTR LPEAQIRGYK AGRFSFNVKG GRCEVCQGAG TRKIEMNFLP DVYVQCEHCK GERYNRETLQ VKYRGKSIAD VLDMPVEEAS VFFTDFPRIK RILATMESVG LGYLKLGQPS PMLSGGEAQR IKLSAELAKI QTGQTLYILD EPTTGLHFQD IQHLLEVLRK LVDKGNTVII IEHNLDIIKN SDWVIDLGPE GGSGGGQFIG EGTPREIAQL EHSHTGRYLA VELEAKHSPE LPEQGIQNPI SE
|
| |