Gene Clim_0455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0455 
Symbol 
ID6354450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp515188 
End bp518043 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content56% 
IMG OID642668086 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001942527 
Protein GI189345998 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTCA GCCATATCAG TATCAGGGGA GCCCGAGTCC ACAACCTGAA GAATATATCT 
CTCGACATCC CGCGCAACCG GTTTGTCGTC ATTACCGGCA TATCCGGATC GGGAAAATCG
AGTCTCGCAT TCGACACCAT CTTCGCCGAA GGGCAGCGGC GCTTTATGGA AACCCTCTCT
CCTTACGCGC GACAGTATAT CGGCAACATC GAGCGACCCG ACGTGGACTT CATCGAAGGG
CTCTCTCCGG TGATCGCCAT CGACCAGAAA AGCACCAACC GTTCCCCCCG TTCGACGGTG
GGTACCATAA CGGAAATTCA CGACTTCATC CGGCTGCTCT ATGCAAAAGC CGGACGCCGG
TACGATCCCG TAACGGGGCA TATGCTGCAG AAACAGTCGC CCGAAAGCAT TGTCGAAGCA
ATTCTCTCGC TTCCCGAAGG CACCAAAGTA AGGATTCTTT CTCCGCTTGT CACCGGACGC
AAGGGGCACT ACCGTGAGCT CTTCGAACGA TTGCTGCTCA AGGGATTCGT GCGCGTCCGC
ATCGATGGCG AGGAGCAGGA GATGGAAAAA AACATGCAGA TCGAGCGGTA CAAAACCCAT
ACCATCGAAC TGGTTGTAGA CCGCCTTGCG GTCGGTACCG AAAGCGCCGA CCGGCTTAAA
CAGGCCGTCG AAATGGCCAT CAGCATGTCG GAGCACCGCT CCTCGGTGAT CTGCGCGCCT
CTCGATACCG ATATCAAGGA ACTGTTCTTC AGCACACAGT ACGCCTATTC CGACGGCTCC
GTGCCGATCG ACACCCTCGC GCCTAACCAG TTCAGTTTCA ATTCACCCTA CGGAGCCTGC
CCCGAATGCA ATGGCCTCGG GGAACTCATG CAGCTCTCCG GTGATCTCAT GATTCCCGAC
CCTTCGCTTT CTCTCAATCA GGGCGCCATA GAACCATTCG GAAAACCCGG CAAGCGCAAC
CTCTGGCAAA TCATCAAAGC CATTGCAAAG CAGTTCAAAT TCACGCTCGA CACTCCGATC
TCGGAAATTC CTCCAAAAGC GCTCGACACC CTGCTGCACG GCTCAGGCAG TCGCACGTTC
GACGTGATCT ATGCATACGC CGGTAAAGAA CACGGTTATC CTCAAATCTT TCAGGGAGCC
ATACCTTACG TTGACGAAAT GCTGCGCAAC AGCAGTTCGT CCAAAATGCG TGAATGGACC
GAAAGTTTTA TGGTACACCA GCCATGTCCG CTCTGCAAGG GTGCCCGACT GAGGCAGGAA
AGCCTCGCAG TCAAAATCAA CGGACTGAAT ATCGCCGAGG CCGAATCGCT CCCGCTACCT
GAAGCGCTCG ACTTCTTCAG GGAACTGCCG CCGGTGCTTA CCGGCCGCGA ACAGCTTGTC
GCAACTCCCG TGCTGCATGA AATCACCAAA CGGCTCGAAT TCCTGCTCGA TGTCGGGCTC
AGCTACCTCA CCCTTGCCCG CAACGCACGA ACACTTTCCG GAGGCGAGGC ACAGCGAATC
CGCCTTGCCT CGCAACTTGG TTCACAACTC AGCGGCGTGC TCTATGTGCT CGACGAACCA
AGCATTGGCC TGCACCAGCG CGACAACCAC AAACTCATCG CATCGCTGCA GCACCTGCGC
GACATCGGCA ACACCGTACT CGTCGTCGAG CATGATAAAG ACACCATGCT CATGGCCGAC
GAAATTATCG ATCTCGGTCC CGGCGCAGGC GAATACGGCG GAGAGATCGT AGCCAAAGGC
CCTGCTTCAC AGCTCGGCCC CGACTCCCTG ACGGCAGCAT ACCTCACCGG CCGCAAAACC
GTTTTTTTCG AACCCGAATC AAAAGAGAAA ACAGGGAAAC AGCAGTACAT CACCATCAAG
GGATGCCGCG GCAACAACCT CCGGAACATC GACGTACGTT TTCCGCTCGG CTCGCTCATC
AGCATCACCG GAGTCAGCGG TTCGGGCAAA TCGACCCTTA TCAACGAAAC CCTCCACCCG
GTTCTTGCAC GTCACTTCTA CCGATCCAAG CTGATCAGCC AGCCATATTC CGTCATTCAG
GGCATCGACC TGATCGACAA GGTGGTCAAC GTCGATCAGA CGCCCATCGG CCGCACCCCG
CGCTCCAATC CTGCAACCTA CACGGGAGCA TTCACCTTCA TCCGCGACTT CTTCACCCGC
CTGCCCGAAG CGCAGATCCG TGGCTACAAA GCCGGACGCT TCAGCTTCAA CGTCAAGGGT
GGACGGTGCG AAGTGTGTCA GGGTGCCGGA ACCATGAAAA TAGAGATGAA CTTTCTGCCG
GATGTTTACG TGCCGTGTGA AAACTGCAAG GGACAGCGAT ACAATCGCGA AACCCTGCAG
GTGAAATACA AAGGGAAATC CATCGCCGAT GTGCTTGAAA TGCCTATCGA AGAGGCCTCG
GGCTTTTTTG AGGAGTTTCC CCGAATCCGC AGAATCCTCT CCACCATGGA AAGCGTCGGT
CTCGGCTATC TCAAACTCGG CCAGCCATCC CCCCTGCTCT CGGGAGGCGA AGCGCAGAGG
ATCAAGCTTT CGGCAGAACT TGCCAAAATC CAGACCGGCA AAACCCTCTA CATTCTCGAC
GAACCCACAA CCGGCCTGCA CTTCCAGGAC ATCCAGCACC TGCTCGAAGT CCTGCGGAAA
CTCGTTGACA AAGGCAATAC GGTCATCATC ATCGAGCACA ACCTCGACAT CATCAAAAAC
AGCGACTGGG TCATCGATCT CGGCCCCGAG GGAGGCAGCG GGGGCGGTCA GCTCGTCGCC
GAAGGAACCC CGACGGAAAT TGCCGCACTG GCACACTCCT ATACCGGTAA CTTTCTGAAA
ATCGAGCTGG AAAGCCATGA AAACAGCAGC AACTGA
 
Protein sequence
MQFSHISIRG ARVHNLKNIS LDIPRNRFVV ITGISGSGKS SLAFDTIFAE GQRRFMETLS 
PYARQYIGNI ERPDVDFIEG LSPVIAIDQK STNRSPRSTV GTITEIHDFI RLLYAKAGRR
YDPVTGHMLQ KQSPESIVEA ILSLPEGTKV RILSPLVTGR KGHYRELFER LLLKGFVRVR
IDGEEQEMEK NMQIERYKTH TIELVVDRLA VGTESADRLK QAVEMAISMS EHRSSVICAP
LDTDIKELFF STQYAYSDGS VPIDTLAPNQ FSFNSPYGAC PECNGLGELM QLSGDLMIPD
PSLSLNQGAI EPFGKPGKRN LWQIIKAIAK QFKFTLDTPI SEIPPKALDT LLHGSGSRTF
DVIYAYAGKE HGYPQIFQGA IPYVDEMLRN SSSSKMREWT ESFMVHQPCP LCKGARLRQE
SLAVKINGLN IAEAESLPLP EALDFFRELP PVLTGREQLV ATPVLHEITK RLEFLLDVGL
SYLTLARNAR TLSGGEAQRI RLASQLGSQL SGVLYVLDEP SIGLHQRDNH KLIASLQHLR
DIGNTVLVVE HDKDTMLMAD EIIDLGPGAG EYGGEIVAKG PASQLGPDSL TAAYLTGRKT
VFFEPESKEK TGKQQYITIK GCRGNNLRNI DVRFPLGSLI SITGVSGSGK STLINETLHP
VLARHFYRSK LISQPYSVIQ GIDLIDKVVN VDQTPIGRTP RSNPATYTGA FTFIRDFFTR
LPEAQIRGYK AGRFSFNVKG GRCEVCQGAG TMKIEMNFLP DVYVPCENCK GQRYNRETLQ
VKYKGKSIAD VLEMPIEEAS GFFEEFPRIR RILSTMESVG LGYLKLGQPS PLLSGGEAQR
IKLSAELAKI QTGKTLYILD EPTTGLHFQD IQHLLEVLRK LVDKGNTVII IEHNLDIIKN
SDWVIDLGPE GGSGGGQLVA EGTPTEIAAL AHSYTGNFLK IELESHENSS N