Gene Acid345_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1189 
Symbol 
ID4072601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1469924 
End bp1471024 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content60% 
IMG OID637983199 
ProductDNA polymerase III, delta prime subunit 
Protein accessionYP_590266 
Protein GI94968218 
COG category[L] Replication, recombination and repair 
COG ID[COG2812] DNA polymerase III, gamma/tau subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTCT CCGACTTCCA CGGCAATCCT GCCACGGTTC AATCAGCGCG CGAAATGCTT 
GGGCGCGGGC GGTTCCCACA TGCGGTGATC CTGAGCGGGC CGCGCGGTAG CGGCAAGTTC
ACACTGGCGC AGATGATTGC CAAGGCGATG AACTGCCTTG AGCCGCCGAT GACCGACGAC
GGCCTGCCCG ATTTCTGTGG ACGCTGCTCA AACTGCGAGC GGATTGCGCA AGCGGACGAT
CTCGATGCGC GATTTACGGA GGCTGTCGAG GCGCGTGAGG GCCTGCGCGA GACCGACAAA
AAGGAAACTC GCATCCTGGT GCAGACGCAT CCGGACGTGC TCATCATTCC GCCTGATCCG
CCGCAGATGC TGGTGAAGGT GGACCAGGTG CGCCACGTGA TTGGGCACAT CTATTACAAG
CCGACGCAGG GCGAGCACAA GGTTTACATC TTCCCTACCG CCAACTTCAT GAAGGAGGCG
GCGAACTCGC TGCTGAAAAT TCTGGAGGAG CCGCCGGAAT TTGCGACGAT CTTCCTGCTT
TCGGAAAATC CCAGTTCCCT GCTGGCGACG ATTCGTTCGC GTTGTGTGCA ATTGCGGCTG
GAAGCGATCG CGCCGGAAGA TGTGGAGAGT TATCTGGAGA AAGAGCGTCC GGAGTGGGCG
GCGCGGCAGC GGGCGCTGGT GGCACGTCTC TGCGGGGGTG GTATCGGCCA GGCCAGGACC
TTCGATCTCG CGGCGTACAC GGCGGCGCGT CAGGATGCCC TGACGTTGTT GCGATCTTCC
GTCGCGGCGC AGGACCATAC CGAGCTCTTC AAAGTGACGG AGGGCTACCG CGCAGGAGCG
GAAGGAAAAG AGAAAACCGA CCAACTGATC CGGGCGAGCT ACTCTCTGCT GCAGGACCTT
CTGTATTTAC TTTCCGGTAC TCCCAAATTG GTCCGAAACA CGGACTTGGG CAGCGAATTA
ACCAAGCTGG CACAATCGGT TGATTTGGGA TGGGTGCAAA AGGCGGCGCT CAAGCTTGGT
GAGGTGGAGA CGGGCATGCG GCGTAACCTG TTGCGCAGCT TATCCCTAGA CGCCTTTGCC
ACGTCGCTGG AACGCGCCTG A
 
Protein sequence
MPFSDFHGNP ATVQSAREML GRGRFPHAVI LSGPRGSGKF TLAQMIAKAM NCLEPPMTDD 
GLPDFCGRCS NCERIAQADD LDARFTEAVE AREGLRETDK KETRILVQTH PDVLIIPPDP
PQMLVKVDQV RHVIGHIYYK PTQGEHKVYI FPTANFMKEA ANSLLKILEE PPEFATIFLL
SENPSSLLAT IRSRCVQLRL EAIAPEDVES YLEKERPEWA ARQRALVARL CGGGIGQART
FDLAAYTAAR QDALTLLRSS VAAQDHTELF KVTEGYRAGA EGKEKTDQLI RASYSLLQDL
LYLLSGTPKL VRNTDLGSEL TKLAQSVDLG WVQKAALKLG EVETGMRRNL LRSLSLDAFA
TSLERA