Gene Acid345_4497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4497 
Symbol 
ID4070175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5337702 
End bp5339042 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content58% 
IMG OID637986536 
Productdeoxyribodipyrimidine photo-lyase type II 
Protein accessionYP_593571 
Protein GI94971523 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR00591] photolyase PhrII 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.112755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.578419 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACTGGA TGCAGCGCGC GCAGCGGGCA TTCGATAATC CGGCGCTCGA TGTAGCCGTG 
CAGGCGGCAA ACGCGCTGAA GCTGCCGTGC GTGATCTTCT TCGCGCCGGT GCCGTTTTAT
CCGCATGCGA ATTTGCGGCA TTACGCGTTC CTGCAACAAG GTATTCCTGA CATTGCGGAG
ATGGCGGAGG AACGCGACAT TGGGTTCGTG CTGCGGCGGT TCCCGGAACA TTCGCTGATC
AAGTTTTGTG ACGAAGTCAA AGCGGCGCTC GTGATCGGCG ATGAGAACCC CATGCGTGAG
CTGCGAGAAT GGCGCGAGAT TGCCGCGAAG AAGCTCAGGG TGCCGCTGTG GACGGTGGAT
GCCGATGTGA TTGTGCCTTC GAAGCTGCTG GGGAAAGAAC AGTATGCGGC GCGGATTATT
CGCCCGCGGC TGAAGGCGCA TTTCAAAGAG TTGCTTGTGC CGTGTGAAAA CCCAAGAGCG
CACGTGAAAT GGCATGCGGC GCGCGGATTG CAGCAGTTGG ACTGGCGCGG AACCGAGGAC
ATCACCGAAG GATGGGAGAT CGACCGCTCG GTGAAGCCGG TGGATTCGTT TCATGGAGGA
ACGCGGGAAG CGCTGCGGCT GCTGGATGAA TTCGTGAAGC ATGGGTTGAC GCGTTATCCC
GAACGGCACA ATCAGGCGGA TGAAAACGGA ACGGCGCGGC TGTCGCCGTA TCTGCATTTC
GGGCATATTG GGCCGCATAC CGTCGCGCTG GCGGTGGAGA AGTCAAAGGT TCCGCGGGAG
GCGAAGGACG ACTTTCTCGA CCAGGTAATT ACGTGGCGGG AGTTGGCAAT CAATATGGTG
CACTTCAATC CGTTGTACGA CACGCTTGAG TGCGGAGAGA ACTGGGCGCA CAAGACGCTC
GGTGAACATG CGAAAGATCC GCGGCCGATC CAATACACGC CCAAGCAACT CGAAGCGGCC
GAGACGTACG ACGATCTCTG GAATGCCGCG CAGTTGCAGA TGGTGCATGC GGGCTGGATG
CACAACTACA TGCGGATGTA CTGGGCGAAG AAGATCCTGG AGTGGAGCAA ATCGCCGCAG
GTGGCTTACA ACACCGCTGT GTATTTGAAC GACAAGTACT TTCTGGATGG GCGCGATCCG
AATGGGTATG CGGGGATTGC GTGGGCTATC GTCGGAAAGT TTGATCGTCC GTGGTTTCCG
CGGCCGATCT TTGGGACGAT CCGCTACATG GCGCGCAGTG GAGCGGAAAA GAAGTTCGAC
GCCGAGAAGT ACATTGCGCA GATGTATGCG TTGGCGGGGA GAGAGCGCGG GAAGGATAAG
GAAGCGCCGA TGCTTTTTTG A
 
Protein sequence
MYWMQRAQRA FDNPALDVAV QAANALKLPC VIFFAPVPFY PHANLRHYAF LQQGIPDIAE 
MAEERDIGFV LRRFPEHSLI KFCDEVKAAL VIGDENPMRE LREWREIAAK KLRVPLWTVD
ADVIVPSKLL GKEQYAARII RPRLKAHFKE LLVPCENPRA HVKWHAARGL QQLDWRGTED
ITEGWEIDRS VKPVDSFHGG TREALRLLDE FVKHGLTRYP ERHNQADENG TARLSPYLHF
GHIGPHTVAL AVEKSKVPRE AKDDFLDQVI TWRELAINMV HFNPLYDTLE CGENWAHKTL
GEHAKDPRPI QYTPKQLEAA ETYDDLWNAA QLQMVHAGWM HNYMRMYWAK KILEWSKSPQ
VAYNTAVYLN DKYFLDGRDP NGYAGIAWAI VGKFDRPWFP RPIFGTIRYM ARSGAEKKFD
AEKYIAQMYA LAGRERGKDK EAPMLF