Gene Acid345_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3752 
Symbol 
ID4069327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4428141 
End bp4430540 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content56% 
IMG OID637985774 
Productprimase P4-like protein 
Protein accessionYP_592826 
Protein GI94970778 
COG category[R] General function prediction only 
COG ID[COG3378] Predicted ATPase 
TIGRFAM ID[TIGR01613] phage/plasmid primase, P4 family, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGT TTTACGGAGT CCATGAAGTA CGCGTCTTGC GGGAGTCCGG TGTCGCCGTT 
GGATACTTTA ATTCTTGGGA AGCCGCTCTG AATGCTGTTA AAAGTGAGCC GTCGTACAAG
GGCGCCTACT TCACCCTTAA CCCGGTAAAG CTTCCAACAG GGGTTCCGCT GAATCCGAAG
ACCCTAATCG CTGCGCATAA AACGGCGGGC GATTCTGACA TTGAAAAACG CATTTGGCTG
CTCATCGACG TGGACCCGCC TCGGGCTGCT AACACGAACA GCACAGTATC TGAGAAGGAC
GCAGCGCGTG AGCAAGCTGA GCAGGTGCGC AACTACCTCA GCAGCCGATG TTGGCCCCAG
CCGGAATTGT GTGACTCGGG GAACGGTTGG CACTTGCTTT ATCACATTGA ATTGCCAAAT
GACTACGCGT CTACAGAATT GGTGCGCGGG ACATTAGCAC GCTTGCATGA CCTCTTTTCA
ATGGTGGACG CGGGAAACTT CAACGCTTCG CGATTATGTA AGCTGTATGG CACCTGGGCC
CGAAAGGGGC AGCATATAGA AGAGCGCCCG TGGAGGCTGT CCGCAATCGT GTCGAAAGGG
TCGGAGACAG CCGTGACGGC GCAGCAATTA GGTTCGATTA GCTCTATACG CTTGTCTTCT
CACAAGACAC CTGAGCAAGC AGATGATTTG AACCTGTCTC GCTTAATGGA GTTCCTTGAC
TACTACGGGG TGTCCGTGCG CTCAAAGCCG CGCTCAGTCA GAGATGGATT TCGGGTTGAG
ATTGAATGCC CGTGGGCAGA GGAGCACAGC GGAGAGACTC GCAGGGATAC AACGGTGTCT
TTCATCGAGG GCGTAGGCAA CGGGTTTCAC TGTCTGCACT CGCATTGCAC GGAGCGGCAT
TGGCGCGAGT TTAGAACCGA ATTGGAGAAG CGTCATCCGG GACGGTTGTT TTCGTTCGGA
CCAGAGGCTG TGTTTGGTGG CGGTTCGCTG CCGCTCATCG CGCATGCGAC ATTGGCTGAA
GCATTCCTTC GGGATAACCA CGATTTTGTG TGCATCTACG ACCAACCCAA ACGGCCAATC
GCGCAGTGGG TAAAAACACG GTGGGACATC TCTGAAGATG ACACGCTCTT ATGGCGTGCC
GTTGCGGACT ATCTCAAGGA CCTCCATCCC CGTTACCCGA AGCCAGAAAA GGGTCCAGAT
TCCCGTATGC GGTTTTATGA CGCCGCCTTT ATCGGCGGTG TGGTGAGATG CGTCAAGCCA
TACTTGCCTC CCGTTAAAGG GGAACTGTTT GACCGAGACC CGCACTTGCT GGGTTTGCCG
AACTGTCGAT TGATTGACCT TCGCACCAAC GCAACCCGGG ATATGCGTCG CGAAGATTAC
ATCACCCAAC GTATTGACGT AGCGCCGGAC CCAAACTGTC CAACGCCCCG CTTCGACCGC
TTCATCAGCG AAATAACCTG CGGCGATGGT CCACTTGCGA ATTACCTGCT TCGGCTTTGT
GCTCTGTGTC TGACGGCCAT CCCGTTCCAA GCTCTTTTTT TCCTGTGGGG GCGTGGCCGA
AATGGCAAAG GCGTCCTTAT ACGAACGCTG ACCGCAATCC TTGGCGACGG GAAATTTGCC
TGGCCACTCC GCCCGGGTGA AATCACAGTT TCGAAGTTTG GCGACGAAGC AGCAAAGCGG
ACCTTCGCCA ATTTGAAGGG TAGGCGGCTG GCAACAGTTA ACGAAAGCGT CGCGGGCAAT
CTCAACACCT CCATGTTGAA ACTGATTTCC GGAGGCGACA CATTGACCGG TGCGAACATG
CGGCAGGACC AGCAGGCTTT CAAGCCTACT CATAAAGTGC TGTTGCCGAC GAATGACCGA
CCGCAGCTAC CGGCGGACCC GGCCTTTCGC GGACGCGTGC ATATGATTCC ATTCCTCGCG
AACTTCACCG GGCGTGAGGA CACAAACCTC GACCACGTCT TGCAACATGT CGAGTTGCCC
GGAATCCTCT ACCGCTTCGT GACGCTGTGC CCCGACGTCA TTGAGAACGG GTTGCGGCCA
CCGGCAAGTG TGTTGGCTGA GACGGAACAG CTCTTCTCTG AGTTGGATAT CACGAAACAG
TTTCGCGATG ACTGCCTGGA AATCGTGGAT GGTGCGGAGA CTCCTGCCGC CGATGTGGAG
CGAGTTGTTA ACTCCTGGGT GCGTGAGCAA AGTACGACGG GTATCGTCGT CTCATCGTCG
GGGCGCGAGG GTCCTGATGA TGTGATATTG CGCGAACTGA AGCACCAGCC AGACATCAAG
TATTCCCGGC TAAGACGCAA AACTGGAGAG ACATCACCTC ACGGAAAAGG TAAGGCCTGG
TACTTCGTTG GCGTGAGGCT CAAGGAAGAA CCACCTCCGG CCACTCCGGC CACATCGTAA
 
Protein sequence
MTPFYGVHEV RVLRESGVAV GYFNSWEAAL NAVKSEPSYK GAYFTLNPVK LPTGVPLNPK 
TLIAAHKTAG DSDIEKRIWL LIDVDPPRAA NTNSTVSEKD AAREQAEQVR NYLSSRCWPQ
PELCDSGNGW HLLYHIELPN DYASTELVRG TLARLHDLFS MVDAGNFNAS RLCKLYGTWA
RKGQHIEERP WRLSAIVSKG SETAVTAQQL GSISSIRLSS HKTPEQADDL NLSRLMEFLD
YYGVSVRSKP RSVRDGFRVE IECPWAEEHS GETRRDTTVS FIEGVGNGFH CLHSHCTERH
WREFRTELEK RHPGRLFSFG PEAVFGGGSL PLIAHATLAE AFLRDNHDFV CIYDQPKRPI
AQWVKTRWDI SEDDTLLWRA VADYLKDLHP RYPKPEKGPD SRMRFYDAAF IGGVVRCVKP
YLPPVKGELF DRDPHLLGLP NCRLIDLRTN ATRDMRREDY ITQRIDVAPD PNCPTPRFDR
FISEITCGDG PLANYLLRLC ALCLTAIPFQ ALFFLWGRGR NGKGVLIRTL TAILGDGKFA
WPLRPGEITV SKFGDEAAKR TFANLKGRRL ATVNESVAGN LNTSMLKLIS GGDTLTGANM
RQDQQAFKPT HKVLLPTNDR PQLPADPAFR GRVHMIPFLA NFTGREDTNL DHVLQHVELP
GILYRFVTLC PDVIENGLRP PASVLAETEQ LFSELDITKQ FRDDCLEIVD GAETPAADVE
RVVNSWVREQ STTGIVVSSS GREGPDDVIL RELKHQPDIK YSRLRRKTGE TSPHGKGKAW
YFVGVRLKEE PPPATPATS