Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4226 |
Symbol | |
ID | 4073152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5008735 |
End bp | 5010534 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637986257 |
Product | MutT/nudix family protein |
Protein accession | YP_593300 |
Protein GI | 94971252 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.500357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.363608 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTG TCTTGATCTC CGCTCTTTTG CTTAGTTCTT CCGCCTGGTC CTTCGCACAA AACGTTCCCG CCCCCGGCGC TCCGGATGAT CTCAAATTCC AGTCCCCCAA GTTCGCCAAG GGTTCTAAGG CCGCAAAGAT GCAGAAGTCT GACTCGACTC CCAAGAGCCA CATCCCCGAC GCCGCGGAAC TCCGCCAGAT GGGTGCGCGC TTCGCCCCCA CAGAACTAAA GATCGATCCG TCCTCGCTGT CACCCGGCGA CCAGAAAGCC CTCGCCAAGC TCGTTGAAGC CGCGAAGGTC ATTAACGATG TCTTCCTCAC GCAATACTGG AAGGGAAACC ACGCGCTCTG GACCAAACTT CAGGCCGACA CCACCGAACT CGGCCGCGAA CGTGCCCGCT ATTTCTGGAT CAACAAGAGT CCGTGGTCGG CGCTCGACGG TTTAACGGCA TTCCTTCCCG ACGTTCCCGC GAAGAAACTC CCCGGCGCGA ACTTCTACCC CGAAGACATG ACCCGCGAGG AATTTGAAGC GTGGGTGAAG ACGCTGCCCG AAAAAGATCA GGAATCCGCA AAAGGCTTCT TCACTGTCAT CCAACGTGGC GCGGACAAAA AGTTGACCAT TGTTCCCTTC AGCGAAGCCT ACAAAGCCGA CCTCACCCGC TGCGCTTCCC TCTTGAAAGA AGCTGCCGAT CTCACCGACA ACGCCTCCCT CAAGAAATTC CTCAACTCGC GCGCCGACGC GTTCGCTTCC AACGACTACT ACCAGAGCGA TATGGACTGG ATGGATCTCG ACGCCCCCAT CGATCCCACC ATCGGCCCCT ACGAGACCTA TAACGACGAG ATCTTCGGCT ACAAAGCCAG CTACGAGGCC TATATCACCG TCCGCGATGA TGCCGAGACG AAGAAGCTCA GCTCCTTCTC TGCGCACCTT CAGGAAATCG AGAACAATCT CCCGCTCGAT CCGAAGTATC GCAACCCGAA GCTCGGCGCC GCCGCTCCCA TCCGCGTAGT CAATGAAGTC TTCGCCGCCG GCGATGGCGA CCACGGCGTC CAGACCGCCG CCTACAACCT GCCCAACGAC GATCGCGTCG TCGCGCAGAA GGGCTCCAAG CGCGTGATGC TCAAGAACGT GCAGGCCGCC AAGTTCAACA GCGTGCTCAT TCCGATCTCG AAACAAGTCC TGAAAGCCGA TGCCCAGCAG TACGTGGACT TCGACCTGTT CTTCACCCAC ATCCTCACCC ACGAGCTCTG TCATGGCCTC GGCCCCCACG AAATCACGGT GAACGGCAAA AAGACCAACC CGCGTATCGA GATCAAAGAG CTTTACAGCG CGCTCGAAGA AGCCAAGGCT GACGTCACCG GCCTCTTTGC GCTGCAGTAC ATGCTCGACC ACGCAAAGGA CATGGGTCTC GACTCGACAT TGAAAATCGA CAACGATTCC GAAAAGAAGC TCTACACCAC CTATCTCGCG TCGTCGTTCC GTACCCTCCG CTTCGGAACG CATGAAGCCC ACGGCAAAGG CATGGCCGTC CAGGTCAGCT ACCTGATGAA GCGTGGCGCG TTCGTCGCGA ACCCCGACGG CACCTTCTCC GTGGACTACA AGAAGATCAA AGACGCCGTC CGCGACCTCG ACAAAATCAT GCTGACGCTG GAGGCCGAAG GCGACTACGC CGGCACGAAA AAACTTCTCG ACGACTACGG CACCGTCCCC GCTGAAATGC AAAAAGCCAT CGCTAAGATG AGCAGCGTGC CGGTGGACAT CGAGCCGCTA TACGTGACGG CCAAAGCACT AACAAAATAG
|
Protein sequence | MKIVLISALL LSSSAWSFAQ NVPAPGAPDD LKFQSPKFAK GSKAAKMQKS DSTPKSHIPD AAELRQMGAR FAPTELKIDP SSLSPGDQKA LAKLVEAAKV INDVFLTQYW KGNHALWTKL QADTTELGRE RARYFWINKS PWSALDGLTA FLPDVPAKKL PGANFYPEDM TREEFEAWVK TLPEKDQESA KGFFTVIQRG ADKKLTIVPF SEAYKADLTR CASLLKEAAD LTDNASLKKF LNSRADAFAS NDYYQSDMDW MDLDAPIDPT IGPYETYNDE IFGYKASYEA YITVRDDAET KKLSSFSAHL QEIENNLPLD PKYRNPKLGA AAPIRVVNEV FAAGDGDHGV QTAAYNLPND DRVVAQKGSK RVMLKNVQAA KFNSVLIPIS KQVLKADAQQ YVDFDLFFTH ILTHELCHGL GPHEITVNGK KTNPRIEIKE LYSALEEAKA DVTGLFALQY MLDHAKDMGL DSTLKIDNDS EKKLYTTYLA SSFRTLRFGT HEAHGKGMAV QVSYLMKRGA FVANPDGTFS VDYKKIKDAV RDLDKIMLTL EAEGDYAGTK KLLDDYGTVP AEMQKAIAKM SSVPVDIEPL YVTAKALTK
|
| |