Gene Cpha266_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0475 
Symbol 
ID4568519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp523801 
End bp525831 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content52% 
IMG OID639765074 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_910956 
Protein GI119356312 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATG CCGTCAATGT CCGCAAGGCT ATAGAAGAAC TACGCAGGGA GATCAACAGG 
CACAATACGC TCTACTATGT TGATGCCCGT CCTGAAATTT CCGATTACGA GTTTGATCGG
TTGATGGAAC GCCTCATCGA ACTCGAAAAG GCTTTTCCGG ATTTTGTCAC ACCGGACAGC
CCTTCTCATC GGGTTGGTGG TGCAATTACC AGAGAGTTTC CTTCGGTACG GCACAGGGAG
CCGATGCTGA GCCTTTCCAA TACCTACTCG CTTGCAGAGG TCGAGGATTT TTACAGTCGG
GTGAAAAAAC TTCTGCTGGT GGAAGGGGTG CGCGAGCAGG ATATGGCAGC CGAACTGAAG
TTTGACGGGG TGGCCATAAG TCTGCTGTAT CGCGATGGTC TTCTCGTTCA GGGTGCGACA
AGAGGTGACG GTACCGAGGG GGACGATATT ACTACAAATC TGAAAACCTT GTCGAGTGTG
CCGCTCAGGA TTCCTCTGAC AAGCCTTCCG GTTATTGAGG GCATAGAGCG CGATATCGAG
ATCCGCGGAG AGGTGTTCAT GCAGAAGGAT GATTTTGCCC GTCTTAACGA GGCTCGTCCT
GAAGAAGACC GGTTTGCCAA TCCTCGTAAT GCAACTGCCG GAACGCTTAA ATTGCAGGAT
TCCGGAGAGG TTGCCCGTCG ACGTTTGAAT TTTGTGGCTT ATTATCTGCG GCTTTCGAGC
CAGGACTCTT TTCTGCTTAC CCAGTATGAC CGGCTTGAAC TGCTCGGGCA GCTCGGTTTC
TTTACCGGGA AGCACTACAG GTTATGTCGC GATATGAAGG AAATCGGTGA TTTTATCGGC
TATTGGGCCG TTGAGAGGTG GAACCTTCCT TATGAAACGG ACGGAGTTGT CCTGAAACTC
AACGATGTCG CACTATGGCC GAGAATTGGC GCAACCGCAA AAAGTCCTCG CTGGGCTATT
GCCTACAAGT ATCCCGCACA GCAGGCGAAA ACAGTTCTGC AAGGGGTGGC ATTTCAGGTC
GGCAGACTCG GAACGGTGAC TCCTGTTGCT GAACTTGAGC CTGTTCGGCT TGCCGGTTCA
ACGGTGTCGC GTTCAACCCT GCATAATTTT GACGAGATAG AGCGGCTTGG TCTCATGCTT
CATGATCGGG TGATTATTGA AAAATCAGGG GAGGTTATCC CCAAAGTGAT TCGGGTTGTT
CTCGATGAAC GACCCGAAAA TGCAGAGCCT GTCGGCGTGC CTTCGGTTTG TCCCGAGTGC
GGCGCTTCGC TCGAAAAACC CGAAAACGAG GTCAGTTACT ACTGTCCCGA TCAAGATTCG
TGCCCGGCAC AGATCAGGGG ACGCTTGCTC CATTTTGCTT CGCGAAATGC CCTTGACATT
CAGTCGCTTG GTGAATCGCT GGTCGCTCAG CTTGTTCAAA AGGGTCTTGT GAGGGATGCC
GGAGATCTTT ATCTTCTTCA GCAACCGCAG CTTGAGGCTC TTGAGAGGAT GGGACGAAAA
TCGGCGCACA ATCTCCTTCG CGCACTTCAA GAGAGTCGGG AGAAGAGTTA TGAACGCTTG
CTCTATGCGC TTGGGATTCG TCATGTCGGT CAGGCTACGG CCCGTGAACT TGCCAGGGCC
TATGAGACGG TTGAGGCCTT GCAGAATGCT TCGGAGGAGG AGCTCGCAAC GGTGCCTGAT
GTCGGGCCGG TTGTTGCCCA TAGCGTCAAG GACTATTTTT CAAAATCGTC TACACAGATG
CTGCTCCGGA AACTCAGGGA TGCCGGCTTT CCTTTACGTT TCACCGGGCC TCAAAAGCTG
ATCAACAGGA ATTTCGAGGG TGTGAATGTT CTGTTTACGG GAGCGCTCGA ACGGTATGGA
CGGCAACAGG CATCGGAACT TGTTCAGCAA CGGGGAGGGA GGGTTGTTGG TTCGGTCAGC
AGAAGCACCG GAGTTGTTGT TGCCGGTAGT GAACCCGGCA GCAAGCTTGA CAAAGCGAGA
AAGCTCGGCG TAAAGGTGAT CAGCGAGGAT GAATTTAATG CAATGTTGTA A
 
Protein sequence
MSDAVNVRKA IEELRREINR HNTLYYVDAR PEISDYEFDR LMERLIELEK AFPDFVTPDS 
PSHRVGGAIT REFPSVRHRE PMLSLSNTYS LAEVEDFYSR VKKLLLVEGV REQDMAAELK
FDGVAISLLY RDGLLVQGAT RGDGTEGDDI TTNLKTLSSV PLRIPLTSLP VIEGIERDIE
IRGEVFMQKD DFARLNEARP EEDRFANPRN ATAGTLKLQD SGEVARRRLN FVAYYLRLSS
QDSFLLTQYD RLELLGQLGF FTGKHYRLCR DMKEIGDFIG YWAVERWNLP YETDGVVLKL
NDVALWPRIG ATAKSPRWAI AYKYPAQQAK TVLQGVAFQV GRLGTVTPVA ELEPVRLAGS
TVSRSTLHNF DEIERLGLML HDRVIIEKSG EVIPKVIRVV LDERPENAEP VGVPSVCPEC
GASLEKPENE VSYYCPDQDS CPAQIRGRLL HFASRNALDI QSLGESLVAQ LVQKGLVRDA
GDLYLLQQPQ LEALERMGRK SAHNLLRALQ ESREKSYERL LYALGIRHVG QATARELARA
YETVEALQNA SEEELATVPD VGPVVAHSVK DYFSKSSTQM LLRKLRDAGF PLRFTGPQKL
INRNFEGVNV LFTGALERYG RQQASELVQQ RGGRVVGSVS RSTGVVVAGS EPGSKLDKAR
KLGVKVISED EFNAML