Gene Acid345_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1547 
Symbol 
ID4072938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1892614 
End bp1893891 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content59% 
IMG OID637983556 
ProductDNA-directed DNA polymerase 
Protein accessionYP_590623 
Protein GI94968575 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.153092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.161564 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAA TCACGGGTCG AAATTCGGCG GCTGCGGACC CTCGACGCGT CATTTTTCAC 
GTCGACATGG ATGCCTTTTT TGTCTCCGTC GAAGAGTTGT TCGACCCTTC CCTCAAGGGG
AAAGCCGTCG TGGTAGGCGG CCATCGCGAC GAGCGTGGTG TCGTCTCAGC GGCCTCGTAT
GCGGCACGCA AGTTCGGCGT ACACTCGGCA ATGCCGCTCC GGACCGCGGC CAAACTCTGT
CCTGACGCCA TCTTCGTGAA CGGACATCCC GAACGCTACC GCGAATATTC CGGCAAAGCT
TTTGAGGTAC TGAAGAAGTT TTCGCCGAAA GTCGAAATGG CTTCCATTGA TGAGGCCTAC
CTCGACATGA CCGGTACCGA GCGCCTGCAC GGGCCGCCGC TCAAGGCCGC GCATGCTCTC
CACCAGTTCA TGAAGTCTGA AACTAAGTTG AACTGCTCCA TCGGCATCGG CACCTCGCGG
CTTATCGCCA AGGTCTGCAG CGACCAGGCC AAGCCCAACG GCGTCCTTTA TATCGTGCCA
GGACAGGAGG CCAGCTTCCT CGCCCCACTA AGCGTGCGTA AAATTCCGGG CGTCGGCAAG
GTCTTCGAAC AGAAGCTCAA TGAAATCGGC ATTAAGAAGG TCGGCGATCT CGCCAAACTC
GATGACGCAT TCCTACGCGA GCGCTTCGGC GAGTGGGGAC TCGCGCTGGC AGGGAAGTCC
CGCGGCCTCG ACGCCGGCGG ATACTTCGAT CGCGATGTCG CGGTGGACGA AGGACCGAAG
TCGATCAGCC ACGAGCATAC CTTCAACACC GACACGCGCG ATCAAGAGAA ACTCGAAGCC
ATGATCGCGC GACTGAGCGA GATGGTCTGC CGCAGGGTGC GCGAGCACGA GCTCCATGCC
CGCACGGTGC AGATCAAGTT GCGCTATTCC GATTTCACCA CCATCACGCG CGCCCACTCA
CTCGAACAGC CCACGCAACT CGATACGGTC GTTGCCGAAA CTGCGCGCAC TCTTTTTCGC
GACAACTGGC AGCGCGGCCG AACCATTCGC CTCATCGGCG TACATGTCGC CGGCTTTGAC
GATGTTCCTC AGCAACTCGA TCTGCTTACT CAGACGCACG ACGACAAAGT CAGCAAAGCT
CTTTCGGTAG TTGATCGAAT GCGCGACAAG TTCGGCGAAA ACGCGGTCTC GCTTGCCACC
GGCCTGAAAT CGCGTTTCCG CGAGAAGACC CACGAGAACC CAGCCAGCCT TCCGGGGAAA
TCGAAGAAGA AAGAATAG
 
Protein sequence
MEEITGRNSA AADPRRVIFH VDMDAFFVSV EELFDPSLKG KAVVVGGHRD ERGVVSAASY 
AARKFGVHSA MPLRTAAKLC PDAIFVNGHP ERYREYSGKA FEVLKKFSPK VEMASIDEAY
LDMTGTERLH GPPLKAAHAL HQFMKSETKL NCSIGIGTSR LIAKVCSDQA KPNGVLYIVP
GQEASFLAPL SVRKIPGVGK VFEQKLNEIG IKKVGDLAKL DDAFLRERFG EWGLALAGKS
RGLDAGGYFD RDVAVDEGPK SISHEHTFNT DTRDQEKLEA MIARLSEMVC RRVREHELHA
RTVQIKLRYS DFTTITRAHS LEQPTQLDTV VAETARTLFR DNWQRGRTIR LIGVHVAGFD
DVPQQLDLLT QTHDDKVSKA LSVVDRMRDK FGENAVSLAT GLKSRFREKT HENPASLPGK
SKKKE