Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0390 |
Symbol | |
ID | 4069212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 447960 |
End bp | 449555 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637982393 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_589469 |
Protein GI | 94967421 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.19098 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.642333 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCC TGGCGCTTCT GCTCCTCGTC CTCTCCCAGC TTTCTTTCGC CTCCGCCTAC GACGCCCATC CCAAGCTCGT AGTCGTCATC GTCATCGACC AGTTCCGCGG CGACTACCTC CAGCGCTATC ACAACGAGTT CGGCGAAGGA GGTTTCCGTC TCTTCACCGA CCACGGCGCC TACTTTTCCG ACTGCTATTA CGACTACGCC ACCCTTGTCA CCGGCCCCGG CCACGCGACG ATCGGCACCG GCTCCTACAC CATCGGCCAC GGCATCATGG CCAATGAGTG GTTCGACCCG CAGATCAACG AGCGCGTCAC CAGCGTCTCC GACGAAGCCA CACATATCGT CGGCGTTGAA GGCGGCCAAG GCTCCTCGCC CCACAACCTC CTCACCGACA CCTTTGGCGA CGAACTCCGC ATGGCCACCC AGGGCCGCTC TCGCGTCTTC GGCATCTCGA TGAAAGATCG CGCCGCGATC CTCCCCACCG GCCACAGCGC CAACGCCGCC TACTGGCTCG ACGGCAAATC CGGCGCGTGG ATCACCTCCG ACTACTACAT GAAGGCGCTC CCCCCATGGG TTGAAGCCGT CAATCACTCC GATGAAGCCA AAAAGTTCCT CAACCGCGAC TGGAAAGACG CCGCCGGCAA AGTGATGGGT AACACCAACC CGCGTAACGA CGAGGACGGC CAGCCCGAAG ACTATTTTGA AATCGTCGGA AGCACGCCCT TCGCCAATGA CCTAGAACTC GACTTCGCGC GCTCACTCAT CACCAACGAA AAACTCGGCA CCCGCGCAAC CACCGATCTG CTCGTCATCA GCCTCTCCGA AAACGACATC CTCGGCCACG CCGTAGGCCC CGACTCACCG ATCCTCCACG CCTCCATCGT TGAACTCGAT CGCCAACTCG CCGGCTTCTT CCAGTTTCTC GATAAGCAAT TCGGCATGAA TAACGTCTGG CTCGCCCTCT CCGCCGATCA CGGCGTCGCC CCCGTCCCGC GCGAAGTCCA GACTCTCCAC ATGCCCGCCA GTGAAATGGA CACCAAGCAG TTCACCGAAA AGCTCAATGA GGAAATCGCT AAGACCACCG GCAAGCCCGG CAAATATCTC CGTTCTGCCG GCCTCCCAAT GATCTCGCTC GATCCAGCCT CCTGGAGCGA CACCAAGGAA GCCGACGCCG AACAAATTGT CGGCGAAGCC GCTGTCCGCA CCGGTGCACT CGCCTACTTC ACCAAGTCCG ACCTCGCCAA AGGCCGCGTC CCCGAGACAC CCATGGGCCA CAAGTTCGCC AACACCTATT CGCCCTACGG CGGCTGGTGG GTCATGGTCC AACCGCGCCC CTTCACCATC CCCAAAGAAG ACGGCACCAC CCACTTCTCC CCCTACAGCT ACGACGCCCA CGTACCGCTC GCCTTCTACG GCGTACCGTT TGCGCCCGGC GTCTATCGCG GCCACAGCGA ACCAATCGAC CTAGCCGTCA CTCTCTCCTC TTTGCTCGGC ACCAACAAAC CCGCCGCCGC AACCGGACGC GTACTGACCG AAGCCCTCAA GCCCCCACCG AATCCGCCCG CAGGAGAAAA GCATCTCGTA AAATAA
|
Protein sequence | MKRLALLLLV LSQLSFASAY DAHPKLVVVI VIDQFRGDYL QRYHNEFGEG GFRLFTDHGA YFSDCYYDYA TLVTGPGHAT IGTGSYTIGH GIMANEWFDP QINERVTSVS DEATHIVGVE GGQGSSPHNL LTDTFGDELR MATQGRSRVF GISMKDRAAI LPTGHSANAA YWLDGKSGAW ITSDYYMKAL PPWVEAVNHS DEAKKFLNRD WKDAAGKVMG NTNPRNDEDG QPEDYFEIVG STPFANDLEL DFARSLITNE KLGTRATTDL LVISLSENDI LGHAVGPDSP ILHASIVELD RQLAGFFQFL DKQFGMNNVW LALSADHGVA PVPREVQTLH MPASEMDTKQ FTEKLNEEIA KTTGKPGKYL RSAGLPMISL DPASWSDTKE ADAEQIVGEA AVRTGALAYF TKSDLAKGRV PETPMGHKFA NTYSPYGGWW VMVQPRPFTI PKEDGTTHFS PYSYDAHVPL AFYGVPFAPG VYRGHSEPID LAVTLSSLLG TNKPAAATGR VLTEALKPPP NPPAGEKHLV K
|
| |