Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_2377 |
Symbol | |
ID | 5422868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 2653968 |
End bp | 2656970 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640881631 |
Product | DNA polymerase I |
Protein accession | YP_001417277 |
Protein GI | 154246319 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0421438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCCCT CCCCCCGCCC GCTTCAGCCC GGCGACCATG TGTTCCTGGT GGACGGCTCG TCCTTCGTGT TCCGCGCCTA TTTCCAGTCC ATCAACCAGG ACCGAAAGTA CAATTTCCGC TCCGACCGGC TGCCCACCGG GGCGGTGCGG CTGTTCTGCA CCAAGCTGCT GCAATTCGTG CGCGACGGGG CGGTGGGCAT CAAGCCGACC CACCTCGCCA TCATCTTCGA CAAGTCGGAG GATAGTTTCC GCAAGGAGCT TTACCCCGAC TACAAGGCCA ACCGCTCCGA GCCGCCCGAG GAACTCATTC CCCAGTTCCC GCTCATGCGC GAGGCGGTGC GCGCCTTCGG CCTCATTCCG GTGGAGCAGG CGCGCTACGA GGCGGACGAC CTCATCGCCA CCTATGCCGA CCAGGCGGTG AAGGCGGGGG CGGACGTGCT CATCGTCTCC GCCGACAAGG ATCTGATGCA GATGGTCGGC CCCAAGGTCG CCATGTACGA CCCTGCTTCC GGCGAGAGCG GCGGGCGCGG GGCGCGGCCG GAGCGGCGCA TCGGGGTGGG CGAGGTGCTC GAATATTTCG GCGTGCCGCC GGAGAAGGTC ACCGACGTGC AGGCGCTGGC GGGGGATTCC ACCGACAACG TGCCCGGCGT GCCCGGCATC GGCATCAAGA CCGCCGCCCA GCTCATTGGC GAATACGGCG ACCTTGAAAC CCTGCTGGCC CGCGCCGGCG AGATCAAGCA GCCCAAGCGG CGCGAATCGC TGCTCACCAA TGCCGAGGCG GCGCGCATCT CCAAGACGCT GGTCACGCTG GTGCGCGACG TGTCCGTGGA GGTGCCGCTG GAGGACCTGG TGCTGGAGGC GCCCGACGCG AAGCGCCTCA TCGCCTTCCT GAAGGCCATG GAATTCACCA CCATCACCCG CCGCGTGGCC GAGGCCTATG GCGTGGAGGC CGCCGAGGTG GAGGCCGACC CCAGGCTCGC CCCCGCCGGC CTGTTCTCCC CCGCCGCCCC CACCTCGGCC GAGGAGACGG ACGGCGCAGA GCCGGCCACG GGTGGGGAAG CGCCCGCGGT CTCGGCCGTG GCGCAGGCCA TGGACGGCAC GCTCACCCCG TCCGACCTCG CCCATGCCCG CGCCAGCACG GCACGCACCA TCCCCGTGGA CCGCACGGCC TACCGCACCG TCCTCGACCT CGCCGAGCTG AAGGCATGGT GCGCCAAGGC GCAGGACCAG GGGTTGCTTG CCTTCGACAC CGAGACCAAT TCCCTCGATC CCATGCAGGC GGACCTGGTG GGCGTCTCGC TCGCGCTGTC GCCCAACGAG GCCTGCTATA TCCCCCTTGC CCATACCGGA GCCGGCGACG GGCTATTCAG CGAGGGGCAG CTTCCCGGCC AGATCCCGGT CCGCGATGCC ATCGCCGCGC TGAAGGGTGT GCTGGAGGAC AAGGGCACCC TGAAGGTCGG GCACAATGTG AAGTACGACC AGTTGGTGCT GGCCCGCCAC GGCATAGATG TCGCCCCGTT CGACTGCACC ATGTGCATGT CCTACGCGCT GGACGCAGGC AAGAACGGCC ATGGCATGGA CGAATTGTCG GTGCTCCATC TCGGCCACCA GCCCATCGCA TTCTCGGAAG TGACCGGCAA GGGCAAGGGA AAGGTGACCT TCGACAAGGT CGCGCTGGAG CCCGCCACCC ACTATGCCGC CGAGGATGCG GACGTGACCC TGCGCCTGTG GCAGGTGCTG AAGCCCCGCC TCGCCGCCGA GGGCCGCACC ACCGTCTACG AGACTTTGGA GCGCCCGCTC ATCGCCGTGC TGGCGCGCAT GGAGAGCCGG GGCATCTCCA TCGACAAGGC CATGCTGGCG CGCCTCTCCT CAGAGTTCGC ACAAGGCGCG GCGCGCATCG AGGACGAGAT CGCGGAGCTG GCCGGCGAGC GGCTGAACGT GGGCAGCCCC AAGCAGATGG GCGACATCCT GTTCGGCAAG ATGGGCCTGC CCGGCGGCAC CAAGACCGCC ACCGGCATGT GGTCCACCAA GGCCACCGCG CTGGAGGAGC TGGCCGAGGC CGGCCACAAG CTGCCGCAGA AGATCCTGGA ATGGCGCCAG CTCTCCAAGC TGCGCTCCAC CTATACGGAC GCCCTGCCCA ACTTCGTGAA CCCCCAGACC AGGCGGGTCC ACACCTCCTA TGCGCTGGCC GCCACCACCA CCGGGCGGCT GTCCTCTTCC GACCCGAACC TGCAGAACAT CCCCATCCGC ACCGAGGAAG GCCGGCGCAT CCGCCGCGCG TTCGTGGCGG AAGAAGGCAA CCTTCTGGTT TCAGCGGACT ATTCGCAGAT CGAGCTGCGG CTCCTCGCCG AGATCGCCGA GATCCCGGCC CTGCGCACCG CCTTCACCGA GGGGCTGGAC ATCCACGCCA TGACCGCCTC CGAGATGTTC AACGTGCCGG TGAAGGACAT GCCCGCCGAG GTGCGCCGGC GCGCCAAGGC CATCAATTTC GGCATCATCT ACGGCATCTC CGCCTTCGGC CTCGCCAACC AGCTGGGCAT TCCGCGCGAG GAGGCGGGGC AATACATCAA GCGCTATTTC GAGCGCTTCC CCGGCATCCG CGACTATATG GAGGAGACCA AGACCTTCTG TCGCGAGCAC GGCTATGTGG AGACCCTGTT CGGCCGGCGC TGCCACTATC CCGAGATCGC GGCCAAGAAC CCCTCCATCC GCGCCTTCAA CGAGCGCGCC GCCATCAATG CCCGCCTGCA AGGCACGGCG GCGGACATCA TCCGCCGCGC CATGATCCGC ATGGAGCCGG CGCTGGAGAA GGCGAAGCTA TCCGCGCGCA TGCTGCTGCA GGTGCACGAC GAACTGGTGT TCGAGGTGCC CGAGGGCGAG GCGGACGCCA CCATCCCCGT GGTGCGCCAG TGCATGGAAA CCGCCTCCGC CCCCGCCGTC GCTCTCGCCG TGCCGCTGAA GGTGGATGCG CGGGCGGCGA AGAACTGGGA AGAGGCGCAC TGA
|
Protein sequence | MSPSPRPLQP GDHVFLVDGS SFVFRAYFQS INQDRKYNFR SDRLPTGAVR LFCTKLLQFV RDGAVGIKPT HLAIIFDKSE DSFRKELYPD YKANRSEPPE ELIPQFPLMR EAVRAFGLIP VEQARYEADD LIATYADQAV KAGADVLIVS ADKDLMQMVG PKVAMYDPAS GESGGRGARP ERRIGVGEVL EYFGVPPEKV TDVQALAGDS TDNVPGVPGI GIKTAAQLIG EYGDLETLLA RAGEIKQPKR RESLLTNAEA ARISKTLVTL VRDVSVEVPL EDLVLEAPDA KRLIAFLKAM EFTTITRRVA EAYGVEAAEV EADPRLAPAG LFSPAAPTSA EETDGAEPAT GGEAPAVSAV AQAMDGTLTP SDLAHARAST ARTIPVDRTA YRTVLDLAEL KAWCAKAQDQ GLLAFDTETN SLDPMQADLV GVSLALSPNE ACYIPLAHTG AGDGLFSEGQ LPGQIPVRDA IAALKGVLED KGTLKVGHNV KYDQLVLARH GIDVAPFDCT MCMSYALDAG KNGHGMDELS VLHLGHQPIA FSEVTGKGKG KVTFDKVALE PATHYAAEDA DVTLRLWQVL KPRLAAEGRT TVYETLERPL IAVLARMESR GISIDKAMLA RLSSEFAQGA ARIEDEIAEL AGERLNVGSP KQMGDILFGK MGLPGGTKTA TGMWSTKATA LEELAEAGHK LPQKILEWRQ LSKLRSTYTD ALPNFVNPQT RRVHTSYALA ATTTGRLSSS DPNLQNIPIR TEEGRRIRRA FVAEEGNLLV SADYSQIELR LLAEIAEIPA LRTAFTEGLD IHAMTASEMF NVPVKDMPAE VRRRAKAINF GIIYGISAFG LANQLGIPRE EAGQYIKRYF ERFPGIRDYM EETKTFCREH GYVETLFGRR CHYPEIAAKN PSIRAFNERA AINARLQGTA ADIIRRAMIR MEPALEKAKL SARMLLQVHD ELVFEVPEGE ADATIPVVRQ CMETASAPAV ALAVPLKVDA RAAKNWEEAH
|
| |