Gene Cpha266_0517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0517 
Symbol 
ID4569112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp566631 
End bp569519 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content49% 
IMG OID639765116 
Productexcinuclease ABC subunit A 
Protein accessionYP_910998 
Protein GI119356354 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.516633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTCA ATCATATTAC CATCAAGGGA GCAAGAGTTC ACAATCTCAA GAATATCTCT 
CTCGATATTC CACGAAACCG GTTCGTTGTC ATTACCGGTA TTTCGGGATC AGGCAAATCA
AGCCTCGCCT TCGATACTAT TTACGCCGAA GGCCAGCGTC GTTTTATGGA AACCCTCTCT
CCCTATGCGA GACAGTATAT CGGCAATATC GAACGACCTG ACGTCGATTT CATCGAAGGG
CTTTCGCCGG TTATTTCCAT TGACCAGAAA AGCACAAGCC GATCCCCGCG ATCTACGGTA
GGCACCGTAA CGGAAATACA CGATTTCATC CGTCTTCTCT ATGCCAAAGC CGGAAGAAGA
TATGACCCCG TAACAGGACA GATGCTTCAA AAACAAAGTG AAGAGAGCAT TTGCGAAGCC
ATTCTCTCTC TTCCTGAAGG CACAAAGGTG CAGATCATCT CCCCCCTTGT TACAGGCAGA
AAGGGACACT ATCGCGAACT GTTTGAAAAA CTGCTCCAAA AAGGGTTTCT CAGGGTTCGT
ATTGACGGCG AATATCGGGA AATGCACAAA AACATGCAGC TCGAACGTTA TAAAAGTCAT
GCTGTCGCTC TTGTTATCGA CAGACTCCTT ATCAACCCGG AATCTGCCGA TCGCCTGAAA
AAAGCCGTCA ACCTTGCTAT AGGCATGTCA GAGCATAAAT CCTCCGTGAT TTGCGATCCA
GTGGAAAGCG ACTGCAAAGA GATGGTCTTC AGCACCAGAT ACGCATATTC AGATGGATCT
GTCCCTCTCG ATACCCTTGC TCCAAACAAC TTCAGCTTCA ACTCCCCCTA CGGCGCCTGT
CCTTCTTGCA GCGGCCTTGG CACGATCATG CAATTGTCTG CCGACCTCAT GATCCCCAAT
CCTTCGCTGT CGCTCAAAGA GGGAGCCATA GAACCATTCG GCAAGGCAGG AAAACGCAAC
CTCTGGCAGG TCATTAAAGC AATCGGCAAG GTTTATGGGT TCAGTGTTGA TACGCCCATA
TCCAAAATCC CGAAAAAAGC TCTCGACATT CTGCTCTATG GGTCCGGCAG CGAAACCTTT
GATATTTCCT ATTCATATGC GGGCAGGGAA AACAGCTACC CGCAACTGTT TGAAGGCGCA
CTCCCCTATG TTGAAGAGAT CCGCCTGAAA ACCAACTCCA TGAAACTCCG TGAATGGGCT
GAAAGCTTCA TGATCCATCA ACCCTGTCCC GAGTGTAATG GCGCAAGACT TCGAAAAGAA
AGTCTCCTGG TTAAACTGAA CGATCTCAAT ATTGCTGAAG TTGAGGCGCT GCCGATACCA
GAGGCACTTG ACTTTTTTAT AACCCTTCTT CCGACGCTTA CAGCAAAAGA ACTGCTTGTT
GCCACACCAG TTGTGCATGA AATCACCAAA CGTCTGGAAT TTCTTCTCAA TATCGGACTA
TCCTATCTCT CTCTGGGCAG AAGCTCGCAA ACGCTTTCAG GAGGAGAGGC CCAGCGCATT
CGCCTCGCAT CACAACTCGG ATCACAGCTC AGCGGTGTAC TCTATGTGCT CGATGAACCA
AGCATAGGGC TGCATCAACG CGACAATCAC AAGCTGATCG AATCGCTCAT TCGGCTTCGA
AATCTCGGCA ATACCGTACT CGTTGTCGAG CACGACAAGG ACACCATGCT CATGGCCGAT
CAGGTAATCG ATATCGGGCC TGGAGCAGGA GAATACGGCG GAGAAATCGT TGCTCAGGGA
CGAGCTGAAG AACTCGGAAA ACACTCTCTG ACCGCCGCTT ATCTTCAGGG AAAAAAAGAG
GTTTACTTTC CTCCTGAAAC AAAAAAAAGT CCTGATCAGG CTAAATTTCT CGTGATTCGG
GGATGTCGGG GAAACAATCT GAAAAACATC GATATCCGCT TTCCTCTCTC GTCACTGATC
AGTATTACCG GAGTCAGCGG ATCCGGCAAA TCAACCCTTA TCAATGAAAC GCTTTACCCT
GCGCTCGCCC GCCATTTTTA CCGGTCAAAG CTTCTGACCT ATCCCTATGA CAGCATAGAG
GGCATTGAAC TGATCGACAA GGTCGTCAAT GTCGATCAAT CCCCCATAGG AAGGACGCCG
CGCTCAAACC CGGCAACATA CACCGGCGTT TTCACCTTTA TACGCGACTT CTATACCCGA
CTTCCGGAAG CGCAAATCAG GGGATACAAG GCTGGACGAT TCAGTTTCAA CGTCAAAGGC
GGACGATGCG AAGTATGCCA GGGGGCAGGA ACAAGAAAAA TAGAGATGAA TTTTCTTCCC
GACGTCTATG TTCAATGCGA ACACTGCAAA GGCGAACGCT ATAACCGGGA AACTCTCCAG
GTAAAATACC GGGGAAAATC CATTGCCGAT GTTCTTGACA TGCCTGTCGA AGAGGCTTCT
GTTTTTTTTA CCGATTTCCC TCGCATCAAA CGCATTCTTG CCACCATGGA AAGTGTCGGA
CTCGGTTATC TTAAACTCGG TCAGCCCTCG CCCATGCTCT CGGGAGGCGA AGCACAACGC
ATCAAACTTT CGGCAGAACT CGCAAAAATC CAGACCGGCC AGACACTCTA CATTCTCGAC
GAACCAACAA CAGGACTCCA CTTCCAGGAC ATTCAGCATC TTCTCGAAGT ACTCCGCAAG
CTTGTCGATA AGGGAAATAC CGTTATCATC ATAGAACACA ACCTCGACAT CATCAAGAAC
AGCGACTGGG TCATCGACCT CGGCCCTGAA GGAGGTTCCG GCGGCGGACA GTTTATCGGC
GAAGGGACCC CCCGGGAGAT CGCTCAGCTT GAACACTCCC ATACCGGAAG ATACCTTGCT
GTCGAACTGG AGGCAAAACA CTCACCCGAG CTACCCGAAC AAGGGATTCA AAACCCGATT
TCCGAATGA
 
Protein sequence
MEFNHITIKG ARVHNLKNIS LDIPRNRFVV ITGISGSGKS SLAFDTIYAE GQRRFMETLS 
PYARQYIGNI ERPDVDFIEG LSPVISIDQK STSRSPRSTV GTVTEIHDFI RLLYAKAGRR
YDPVTGQMLQ KQSEESICEA ILSLPEGTKV QIISPLVTGR KGHYRELFEK LLQKGFLRVR
IDGEYREMHK NMQLERYKSH AVALVIDRLL INPESADRLK KAVNLAIGMS EHKSSVICDP
VESDCKEMVF STRYAYSDGS VPLDTLAPNN FSFNSPYGAC PSCSGLGTIM QLSADLMIPN
PSLSLKEGAI EPFGKAGKRN LWQVIKAIGK VYGFSVDTPI SKIPKKALDI LLYGSGSETF
DISYSYAGRE NSYPQLFEGA LPYVEEIRLK TNSMKLREWA ESFMIHQPCP ECNGARLRKE
SLLVKLNDLN IAEVEALPIP EALDFFITLL PTLTAKELLV ATPVVHEITK RLEFLLNIGL
SYLSLGRSSQ TLSGGEAQRI RLASQLGSQL SGVLYVLDEP SIGLHQRDNH KLIESLIRLR
NLGNTVLVVE HDKDTMLMAD QVIDIGPGAG EYGGEIVAQG RAEELGKHSL TAAYLQGKKE
VYFPPETKKS PDQAKFLVIR GCRGNNLKNI DIRFPLSSLI SITGVSGSGK STLINETLYP
ALARHFYRSK LLTYPYDSIE GIELIDKVVN VDQSPIGRTP RSNPATYTGV FTFIRDFYTR
LPEAQIRGYK AGRFSFNVKG GRCEVCQGAG TRKIEMNFLP DVYVQCEHCK GERYNRETLQ
VKYRGKSIAD VLDMPVEEAS VFFTDFPRIK RILATMESVG LGYLKLGQPS PMLSGGEAQR
IKLSAELAKI QTGQTLYILD EPTTGLHFQD IQHLLEVLRK LVDKGNTVII IEHNLDIIKN
SDWVIDLGPE GGSGGGQFIG EGTPREIAQL EHSHTGRYLA VELEAKHSPE LPEQGIQNPI
SE