Gene Cpha266_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1939 
Symbol 
ID4570053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2245794 
End bp2248646 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content49% 
IMG OID639766521 
Productexcinuclease ABC subunit A 
Protein accessionYP_912379 
Protein GI119357735 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0317613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAC AACGGCTTGC CGATACCGAC TTTGCTGAAT CGGTTCTTCC TGATATCGTG 
CTCAAGGGCG TTTGCACGCA TAATCTTAAA AACATCACCG TTCATATTCC CCGAAACCGG
TTTGTTGTTC TCACAGGAGT CAGCGGATCG GGAAAGTCCA GTCTTGCGTT TGACACACTC
TATGCAGAAG GTCACCGACG TTATGTTGAA TCGCTCTCGG CATATGTTCG TCAGTTTCTT
GAGCGCATGC CTAAACCTCC GATCGAAATT GTCGAAGGTA TCGCTCCGGC TGTCGCCATC
GAGCAGAAGC CCATTCCGAA AAATCCTCGT TCGACCGTTG GCAGTGTTTC GGAGATATAC
GATTATCTGC GTCTTCTTTA CGCAAGGGTT GGTAAAATCT ATTCGCGTGA TACCGATGAG
CTGGTTCTGA AGCATACCCC GGATGATGTG AGCTTGCAGG TGCGCTATTT TGACGAGGGG
GCAAAATTCT ATGCGGGCTT TCCATTTCCA TGTCATACAG ATGAGGCTCA TCATGATTGT
TCGGCCAAGG ATGAAATAGA GAATCTGCTC AAGAAAGGTT TTTTCAGGAT TATTGACGGC
GATACGGTGC TGGATCTTAA TGACGCAGCG GTCTGTAACC GTCTCAAGTC GATGAATCAT
CTCGAACTGT CGTCATTGCT TGTTCTTGTT GACAGGTTTG TTACGCGACA TGAGGATAAA
CTTTATCACC GGGTCGCCCA GGCTGCAGAG ACAGGATTCA TGGAATCCGG CGGGTATGTT
GTGCTGAGAG TGGTTGGCGG AAAAACCTAC CGGTTCAGCG ATAAACTTGA GCTTAACGGT
ATTGAATACC TGGAGCCCTC TCCGCAGCTT TTTGCGTTCA ACTCTCCGAT CGGGGCCTGC
AAAAAGTGCC AGGGATTCGG ACGTATAGCA GGCATTGATG AAGATGCTGT TGTTCCCGAT
AAATCGCTGA GTCTTTTCGA AGGAGCAATT GTCTGCTGGA ATTCCGAAAA GTACCGCTGG
AACCTGAAAC AGTTGCTTGC TGCGGCACCG GAAGCGGGCA TTCCTCTTGA TGTTCCATAC
GAAAAACTCT CTGCAGCCAA TAAGGAGCTT ATCTGGAAGG GTATACCCGG TAAGCGGTCG
GAGTACAAGG GGATCTGGGC GTTTTTTGCG GAAATCGAAA AGGATGCCGG GTATAAAATG
CATTATCGGG TATTCCTGAG CCGTTATCGG GGGTATGCTA CCTGTCCCGA ATGCGAGGGA
TCGCGTCTCA ATCTCGATGC AAGGCTGGTA AGGGTATCCG GCAGGAATAT CTCTGAAGTC
ACCCGCATGA ATATTGCGGA AGCTCGCAAC TTTTTTCTGA ACCTTGATAT CTCTCCGTTT
GACAGAAAGG TTGCAGAGGC GATTCTGGAG GAGATCATCA AGAGGCTTGG CTATCTTCTC
GATGTTGGTC TCGATTATCT TACCCTTGAC CGTCTGACCC ATACGCTTTC CGGAGGAGAG
TTTCAGCGGA TCAATCTCTC CACCTCCATA GGTTCGCCAT TGGTAGGGGC AATCTATGTT
CTTGATGAAC CGAGCATCGG TCTTCATCAG AGTGATTCGT CCAAATTGAT CGCGCTGCTT
AGAAAATTGC GTGATCTTGG AAACACTGTT GTTGTGGTTG AACACGACCG TGAGATTATT
GAGGCGGCTG ACGAGGTGAT CGATCTCGGG CCGAAAGCTG GTCGTCTGGG CGGTGAGGTT
GTTTTTCAGG GGACGATCAG CGAGATGAAG GCCTCCGGAA ATTCACTTAC AGCGGAGTAT
CTGAACGGTG AAAAGGAAAT TGCGGTACCC AAAGATCGAC GGAAAGCTGA CTTTTCATCC
TGCATCTCCA TAAAGGGGGC CATGCAGAAT AATCTGAAAA ATATCGATGT CCGGTTTCCT
CTCGGTATTA TGACCTGCGT TACCGGCGTG AGTGGTTCAG GCAAGTCAAC TCTCGTTAAC
GATATTTTGA AAAACGGACT TCTCAAACAG AAAGAGGGTT TGAAAGAGAA GGTCGGAACA
CATCGTTCAA TTGGCGGCGT GGAACTGATA GACCGTATTG AGCATGTTGA TCAGTCGCCG
ATAGGAAAAT CCAGTCGCAG CAATCCTGTT ACCTATCTGA AAATATTCGA TGACATAAGG
ATGCTGTTTG CCCAGACTGT TGAGGCAAAG GCGAGGGGGT TGCATGCTGG CTATTTTTCC
TTCAATATTC CTGGTGGCCG ATGCGAGGCA TGCGCCGGAG AGGGAGTTGT CAGGATCGAG
ATGCAGTTTC TTGCCGATAT CGAAGCCGTT TGTGAAGAGT GCGGCGGATC GCGCTACAAA
CAGGAGACTC TTGAGATCAC TTTCAATGGT CGATCGATTA TGGATGTTCT CGATCTCACG
GTCAGTGAAG CGATTGAGTT TTTCAATGGT GAAAAAAATG TTTTGCGCAA GTTGCAGGTG
CTTGAAGAGG TTGGTCTCGG CTATATCCGT CTTGGACAGT CATCCAGCTC GCTTTCGGGT
GGTGAAGCAC AACGGTTGAA GCTTGCCAGC TTTATTGCGC ATGCCGATAC CCGGCACACC
CTTTTTCTGT TTGATGAACC TACTACCGGG CTGCATTTCG AAGATATCAG CAAGCTGATT
CGCTGTTTTG AGAAACTGCT TGAGCAGGGA AATACACTGG TTATTATCGA GCATAATCCC
GATATCATCA AGCAGGCAGA CTGGGTTATC GATCTCGGGC CGGGAGCGGG AGACAAGGGA
GGATCCATCA TGGCCGAAGG CACTCCCGAA AAAATTGTCG AGTGCAAGGA GTCTTTGACG
GGCTTGCATC TCAAGCCCTA CCTGCATTCA TGA
 
Protein sequence
MTTQRLADTD FAESVLPDIV LKGVCTHNLK NITVHIPRNR FVVLTGVSGS GKSSLAFDTL 
YAEGHRRYVE SLSAYVRQFL ERMPKPPIEI VEGIAPAVAI EQKPIPKNPR STVGSVSEIY
DYLRLLYARV GKIYSRDTDE LVLKHTPDDV SLQVRYFDEG AKFYAGFPFP CHTDEAHHDC
SAKDEIENLL KKGFFRIIDG DTVLDLNDAA VCNRLKSMNH LELSSLLVLV DRFVTRHEDK
LYHRVAQAAE TGFMESGGYV VLRVVGGKTY RFSDKLELNG IEYLEPSPQL FAFNSPIGAC
KKCQGFGRIA GIDEDAVVPD KSLSLFEGAI VCWNSEKYRW NLKQLLAAAP EAGIPLDVPY
EKLSAANKEL IWKGIPGKRS EYKGIWAFFA EIEKDAGYKM HYRVFLSRYR GYATCPECEG
SRLNLDARLV RVSGRNISEV TRMNIAEARN FFLNLDISPF DRKVAEAILE EIIKRLGYLL
DVGLDYLTLD RLTHTLSGGE FQRINLSTSI GSPLVGAIYV LDEPSIGLHQ SDSSKLIALL
RKLRDLGNTV VVVEHDREII EAADEVIDLG PKAGRLGGEV VFQGTISEMK ASGNSLTAEY
LNGEKEIAVP KDRRKADFSS CISIKGAMQN NLKNIDVRFP LGIMTCVTGV SGSGKSTLVN
DILKNGLLKQ KEGLKEKVGT HRSIGGVELI DRIEHVDQSP IGKSSRSNPV TYLKIFDDIR
MLFAQTVEAK ARGLHAGYFS FNIPGGRCEA CAGEGVVRIE MQFLADIEAV CEECGGSRYK
QETLEITFNG RSIMDVLDLT VSEAIEFFNG EKNVLRKLQV LEEVGLGYIR LGQSSSSLSG
GEAQRLKLAS FIAHADTRHT LFLFDEPTTG LHFEDISKLI RCFEKLLEQG NTLVIIEHNP
DIIKQADWVI DLGPGAGDKG GSIMAEGTPE KIVECKESLT GLHLKPYLHS