Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4602 |
Symbol | |
ID | 4070759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5449731 |
End bp | 5450861 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986642 |
Product | deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_593676 |
Protein GI | 94971628 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.161559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.698671 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGCCG GCTATGCGGT GGACGTTGAA CAATCCCGAG GGCGGCGCAT CCCTGAGCCG CGGCACGCTT ACCGCAACGA CTTCCAGCGC GACCGCGATC GCGTGCTTCA TGCGCGGGCC TTTCGTCGCT TAGAGAACAA GACGCAGGTC TTTACCGGCC GCTATTCCGA CCACTTTCGC AATCGGCTGA CCCATACGAT TGAAGTCCAA CAGATTTCGC GTACGATCGC GAACGCGCTG GATTTGAACG TTGACCTCGT TGAGGCGTTG GCGCTGGCGC ATGACATTGG GCATCCACCG TTTGGACATG CCGGTGAGAA GGCGCTCGAT ACCGCGATGC GCAAGCACGG CGAGCGCTTC GACCACAATC TGCACGCGCT GCGCATCGTG GACGATTTCG AGCTGCGCTA CATCGCGTTC CGCGGCTTGA ATCTCACCTT CGAAGTGCGC GAGGGGATCA TCAAGCACTC GCGCGATTAC AAGGAGAGCG AGCATCCGGA ACTGAAGGAG TATCTGCTCG ATCGCCGTCC GCCGCTGGAA GCGCAGTTGA TCGACCTGAC CGACGAGATC GCCTACAACA CCGCCGACAT GGACGACGGT TTCGAAGCGC GCATCTTGAA CATCGACGCG CTCCGCACGG TGCCGATCTT CGAGCGCTTC TATCGCGAGG TGGAGGCGAA GCATCCCACG GCGCGGCGCA AACTGAAGTT CAACGAGACG GTGAAGCGGA TCTTCGACCG GCTGGTCACC GACCTGATTG AGAACACGCG CAAACGCATC GCAGACTCCG GCGTGAAGAC AGTTGAGGAT GTTCGCAACT ATCCCGAGCG GCTGGCGGCG TTCAGTCCGG ATGTGGATGC GGAGCGCGCG GAGTCGAAGG CGTTCCTCTA CAAGAACCTC TATTTCAGCG AAGCATTGCA AAACGAGAAG ATTGACGCGG AACTGATTGT CGGAGGATTG TTCGGGCATT TTATGACCCA TCCCGAGAGT TTGCCGCCGG GCTACCAGGA GAAGGCGCAA CAGGAAACAC GGGCACGCGT GGTGTGCGAC TACATCGCTG GGATGACCGA TAACTTCATC CAGAGCAACT ACGAGCGGCT GATGACCGAC GAAGCGCCGA GCGAAGAATA G
|
Protein sequence | MPAGYAVDVE QSRGRRIPEP RHAYRNDFQR DRDRVLHARA FRRLENKTQV FTGRYSDHFR NRLTHTIEVQ QISRTIANAL DLNVDLVEAL ALAHDIGHPP FGHAGEKALD TAMRKHGERF DHNLHALRIV DDFELRYIAF RGLNLTFEVR EGIIKHSRDY KESEHPELKE YLLDRRPPLE AQLIDLTDEI AYNTADMDDG FEARILNIDA LRTVPIFERF YREVEAKHPT ARRKLKFNET VKRIFDRLVT DLIENTRKRI ADSGVKTVED VRNYPERLAA FSPDVDAERA ESKAFLYKNL YFSEALQNEK IDAELIVGGL FGHFMTHPES LPPGYQEKAQ QETRARVVCD YIAGMTDNFI QSNYERLMTD EAPSEE
|
| |