Gene Acid345_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4047 
Symbol 
ID4072469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4781910 
End bp4785122 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content60% 
IMG OID637986078 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_593121 
Protein GI94971073 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAC AAAACCAATA CATCGAGTTG CATGCGAACA GCGCGTTCAG TTTCCTGCGT 
GGGGCTTCAC TGCCTGAGAC GCTGATTTTT CGCGCGCAGC AACTCGAATA TCCGGCGATG
GCGCTGATGG ATGGAAACGG CGTTTATGGC TCGGCGCGGT TTCATCTCGC TGCGAAGCCG
AATGGCGTAC GTGCGCACGT GGGCAGCGAG ATTGCCGTGC GCGATCTCGG TGAGCGCATG
GCGCCGGCGG CGTGGTTGCC GCATCAATGG CCGGCGGAAC CGGTGCGTCT ACCGCTGTTA
TGCGAGTCGC GCGAGGGATA CCAGAACCTG TGCCGCATGG TGACGCGGCT GAAGATGCGC
GAGCCGTCGA AGGCGGAAGG CGCGGCGGTG ATGGCCGATG TGGAAGAGTT CGCGCGCGGT
CTGGTGTGTT TGACCGGCGG CGACGAGGGT CCACTGGCGT CGGCGCTGGC GCGTGGTGGC
GAGCCCGAAG GACACAAAAC TGTAGAGCAC CTGGTGCGCA TGTTTGGGCG CGAGAATGTT
TTCGTCGAAT TGCAGCGGCA TGGATTGCGT GAGCAGGAGT GGCGCAACCA GGCGGCGGTG
CGGATTGCGC GCTCGCTGCA GCTGCCGATT CTCGCCACCA ACGGCGTGCG TTATGCCGAT
GAGCCGGAGC GCGAGATTGC CGATCTGTTT ACGGCGATCC GCCATCACGT GGCACTGAAG
GATGCGGGGC GTCTGCTGGC GCAAAATTCG TGCCGGTATC TGCGTCCGAC GGCGGAGATG
GCGCGGTTGT TTCGCGACCT GCCGGAGGCG GTTGCGAATA CGTGGGAGTT GTCGCAGCGG
CTGACGTTCG AGCTCGACGA TCTTGGTTAC CAGTTCCCGA TTTATCCCAC GCCGGATGGC
GAATCGATGG ACTCGTTCCT GGAGAAACGC GTACAGGAAG GCGTGATCAA GCGCTATGGA
GCGAAGAACG AGCCCGATCT CTACGCTCGT GCACAGAAGC AAGTTGCGCG CGAATTGGCG
CTGATCAAGA AGCTCGGTCT CGCCGGATAT TTTCTGATCG TATGGGACAT CATTCGCTTC
TGCCAGCAGA ACGACATTCT CGTGCAAGGG CGCGGCAGCG CCGCGAATTC CGCGGTTTGT
TACGCGTTGG AGATCACGGC GGTTGACCCG GTTGGCATGG AGTTGCTGTT TGAGCGCTTT
TTGAGCGAAG CGCGCGGCGA GTGGCCGGAT ATTGACCTCG ATTTGCCATC GGGCGACAAG
CGCGAGCAGG CGATCCAGTA CGTGTACACG CGTTACGGGC ATCTAGGCGC GGCGATGACT
GCAAATGTCA TTACCTATCG TGGAAAATCC GCGGCGCGAG AGGTTGGCAA GGCGCTTGGT
TTCGACGTCG AGACGTTGAA CAAGCTGACA AAGCTGGTGA GCACGTGGGA GTGGCGCGGG
CCGAATGACA CGCTGGAAAA CCAATTCGAA TCGGCGGGAT TTGAAGCGAA ACATCCGCAT
ATCGCGAAAT ATCTCGACCT TTGCAGCCGT ATCCAGGATT TGCCGCGGCA TCTCGGGCAA
CACTCCGGCG GGATGGTGAT CTGCCAAGGG CAGCTCGACA GCGTGGTGCC GATCGAGCCG
GCGTCGATGC CGGGCCGGTC GGTCGTTCAA TGGGATAAAG ACGATTGCGC CGACATGCAC
ATCATCAAAG TGGACCTACT TGGACTGGGG ATGATGGCGG TAATCAAGGA TTGCGTGGAC
TTGGTACCGC GGCACTACGG TGTTCCCGTG GATTTGGCGC AACTACCCCA GAACGATGGG
GATGTTTATC GCACGCTGCA GAAGGCGGAC ACAGTCGGCA TGTTCCAGGT GGAAAGCCGT
GCGCAGATGG CGTCGCTACC GCGGAATTAT CCAACGCGGT TTTACGACAT CGTGGTGCAG
GTTGCGATCA TCCGACCGGG GCCGATCGTG GGGAACATGG CAAACCCGTA TATGCGTCGC
CGACAGAAGA AGGAAGAGGT GACGTACTAC CATCCGTTGC TGGAGCCGGT GTTGAAGCGG
ACGCTGGGCG TGCCGTTATT CCAGGAGCAG TTGCTCCGGA TGGCGATGAT CGTCGCAAAC
TTCTCCGGCA CCGAAGCGGA GGAATTGCGA CGCGCGCTGG GAATGCGGCG CTCCACGCAG
CGTATGCGCG AGCTCGGCGT GAAATTGCGC GCGGGAATGA CGGAGAACGG GTTTGATCCG
AAGACGCAGG AAGAGATCAT CCAGAGCATT TCGTCGTTCG CGCTGTACGG GTTTCCGGAG
TCGCACGCCG CAAGTTTCGC GCTGATTGCG TATGCGTCGG CGTATTTCAA GGTGCATTAC
CTCGCGGCGT TCACGGCGGC GATCCTGAAT AACCAGCCGA TGGGCTTTTA TTCGCCGGCG
GTATTGGTGA AGGACGCGCA ACGGCATGGA TTGCGCGTTT TACCGGTGGA TGTACAACGT
TCGGTGTGGG ACTGCACGGT AGAACACGCC GGTGACGAGC GTAAATTGCG GTTGGGATTG
CGGTATGTGC GCGGATTACG GCAGGAAACC GCAGAAATAC TGGTGAAATC GCGGGAAAAC
CACGGTCCGT TCGCGTCGTC GGAGGACTTG GGCCGGCGGG TGCCGCTGCT GAACCGCAAA
GAACTGGTGG CGCTGGCGCA AATCGGAGCG CTGAACTGGG TGGGCGGTAC GGCGCACCGT
CGCGACGCCT TGTGGCAGGT GGAGCGGGTG AGCCGACGCC CCGGACCGCT CCTGCAATCC
GCGGAGGAAG AAGAGGACGT TTCACCGCTG GCGCAGATGA ATTGGGAGGA GCGGTTAGTC
GCTGATTTCA ACGGCACCGG CCTGACCGTC GGCAAGCATC CGATGACCTA TCATCGCGAG
CGGCTAGCAA AGATGAGGGT GCTTTCGGCG GAACAGTTAC AGGTTGCGGA AGACGGACGG
CAGGTTCGAA TCGCCGGCTG CGTGATCGCA AGGCAGCGCC CGGGGACGGC GAAAGGTTTT
GTTTTCCTGA GCATCGAGGA CGAGACGGGG ATTGCGAACG CGATCATTAG TCCGCAACTA
TATGAGCAGA ACCGGGTGGT GGTTTTTACC GAGCGGTTCC TGGTCGTGGA AGGCAAGCTT
CAAAACCAAG ACGGCGTGAT TCACGTCAGG GCGCAGCGAG TGCAGTCGTT GCATTTGAAC
CAGGTTGCAG CACCTTCGCA CGATTTTCAT TGA
 
Protein sequence
MTAQNQYIEL HANSAFSFLR GASLPETLIF RAQQLEYPAM ALMDGNGVYG SARFHLAAKP 
NGVRAHVGSE IAVRDLGERM APAAWLPHQW PAEPVRLPLL CESREGYQNL CRMVTRLKMR
EPSKAEGAAV MADVEEFARG LVCLTGGDEG PLASALARGG EPEGHKTVEH LVRMFGRENV
FVELQRHGLR EQEWRNQAAV RIARSLQLPI LATNGVRYAD EPEREIADLF TAIRHHVALK
DAGRLLAQNS CRYLRPTAEM ARLFRDLPEA VANTWELSQR LTFELDDLGY QFPIYPTPDG
ESMDSFLEKR VQEGVIKRYG AKNEPDLYAR AQKQVARELA LIKKLGLAGY FLIVWDIIRF
CQQNDILVQG RGSAANSAVC YALEITAVDP VGMELLFERF LSEARGEWPD IDLDLPSGDK
REQAIQYVYT RYGHLGAAMT ANVITYRGKS AAREVGKALG FDVETLNKLT KLVSTWEWRG
PNDTLENQFE SAGFEAKHPH IAKYLDLCSR IQDLPRHLGQ HSGGMVICQG QLDSVVPIEP
ASMPGRSVVQ WDKDDCADMH IIKVDLLGLG MMAVIKDCVD LVPRHYGVPV DLAQLPQNDG
DVYRTLQKAD TVGMFQVESR AQMASLPRNY PTRFYDIVVQ VAIIRPGPIV GNMANPYMRR
RQKKEEVTYY HPLLEPVLKR TLGVPLFQEQ LLRMAMIVAN FSGTEAEELR RALGMRRSTQ
RMRELGVKLR AGMTENGFDP KTQEEIIQSI SSFALYGFPE SHAASFALIA YASAYFKVHY
LAAFTAAILN NQPMGFYSPA VLVKDAQRHG LRVLPVDVQR SVWDCTVEHA GDERKLRLGL
RYVRGLRQET AEILVKSREN HGPFASSEDL GRRVPLLNRK ELVALAQIGA LNWVGGTAHR
RDALWQVERV SRRPGPLLQS AEEEEDVSPL AQMNWEERLV ADFNGTGLTV GKHPMTYHRE
RLAKMRVLSA EQLQVAEDGR QVRIAGCVIA RQRPGTAKGF VFLSIEDETG IANAIISPQL
YEQNRVVVFT ERFLVVEGKL QNQDGVIHVR AQRVQSLHLN QVAAPSHDFH