Gene Cpha266_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1371 
Symbol 
ID4569241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1566453 
End bp1567703 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content54% 
IMG OID639765958 
Productexodeoxyribonuclease I subunit D 
Protein accessionYP_911824 
Protein GI119357180 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGTC TTCATACTTC CGACTGGCAT TTAGGCCGAA CTCTTTATGG CAGGAGCCGA 
TACAGCGAGT TTTCAGCTTT TCTCGACTGG CTTTCAGCTC TCATTGAAAA GGAAAAGGTC
GATCTGCTTC TCGTGGCCGG AGATGTGTTC GATACCACGG TGCCGGGTAG TCGGGCTCAG
GAGCTCTACT ATGGATTTCT CAACAAGGTT GCCCGATCGA CGGTTTGCCG GCATGTGGTG
ATCACCTCCG GTAACCACGA TTCTCCGTCG CTTCTCGATG CTCCGAAAGC TCTTCTGCGT
ACGCTCGATG TGCATGTGGC AGGCTCGGTG ACCGACGATC CGATGAACGA CGAAGTACTG
GTGCTTCGAG ACAAAACCGG AAAGGCTGAA GCTGTCGTGT GCGCCGTGCC GCATCTGCGT
GATCGCGATA TCCGTAGTGT GGGGCCGGGC GAAAGCATGC AAGACAAAAG TCTCAAGCTA
CTCGAAGGCA TTGAGCGCCA TTACCGCGAA GTGTGCAGGG CAGGGGAGGA ACGGCGGGTT
CAGGAGGGAG GGCGTATTCC CATGATCGTC ATGGGCCATC TGTTTACGGC CGGAGGTCAA
ACCATTGATA ACGACGGTGT TCGCGAAATC TATGCGGGCT CCCTTGCCCA TGTGAGCCGT
TCGATTTTTC CTCCGGGAAT AGACTATCTT GCGCTCGGCC ATCTGCATGT TCCGCAGATT
GTAGGAGGCG AAGAGCATCT GCGTTACAGC GGATCGCCAA TACCTATGGG GTTTGGCGAG
GCCAGCCAGC AGAAAATGGT TGTGCTTGTT GAATTTACGG AGGGCGAACG GCTGATCACC
GAAATTCCGA TACCCTGTTT TCAGCCGCTC GAACGCATCA GCGGTACGCT GCAGGCCATA
ACTGACCGCA TTTTCGAATT GAAGGCTGCA GGCAGCAAGG CCTGGCTGGA GATCGAGTAT
ACCGGCGACG ATCTTTCCGG AGACCTGAGA GGTCTGCTTG ACAATGCCGT GAAAGGGTCA
CTGCTTGAAA TCCGCAGAAT CAAGAACAAC CGGATTGTCG CTCTTGCCCT CAAGCAGCTA
TCGGAAGACG AATCGCTCGA TGAACTCGGA GAACTCGACG TCTTTAATCG TTGCCTCGAT
GCGGCAAAGG TTCCGCAGAC TCAGCGCCCG GAGCTTCTTC AGGCCTATCA GGAGATTCTG
GTTTCGCTGC ATGAAGATGA GAAGAAGATC AAGAGTGAGC CAAGGGAGTA A
 
Protein sequence
MKCLHTSDWH LGRTLYGRSR YSEFSAFLDW LSALIEKEKV DLLLVAGDVF DTTVPGSRAQ 
ELYYGFLNKV ARSTVCRHVV ITSGNHDSPS LLDAPKALLR TLDVHVAGSV TDDPMNDEVL
VLRDKTGKAE AVVCAVPHLR DRDIRSVGPG ESMQDKSLKL LEGIERHYRE VCRAGEERRV
QEGGRIPMIV MGHLFTAGGQ TIDNDGVREI YAGSLAHVSR SIFPPGIDYL ALGHLHVPQI
VGGEEHLRYS GSPIPMGFGE ASQQKMVVLV EFTEGERLIT EIPIPCFQPL ERISGTLQAI
TDRIFELKAA GSKAWLEIEY TGDDLSGDLR GLLDNAVKGS LLEIRRIKNN RIVALALKQL
SEDESLDELG ELDVFNRCLD AAKVPQTQRP ELLQAYQEIL VSLHEDEKKI KSEPRE