Gene Acid345_0390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0390 
Symbol 
ID4069212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp447960 
End bp449555 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content61% 
IMG OID637982393 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_589469 
Protein GI94967421 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.19098 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.642333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCC TGGCGCTTCT GCTCCTCGTC CTCTCCCAGC TTTCTTTCGC CTCCGCCTAC 
GACGCCCATC CCAAGCTCGT AGTCGTCATC GTCATCGACC AGTTCCGCGG CGACTACCTC
CAGCGCTATC ACAACGAGTT CGGCGAAGGA GGTTTCCGTC TCTTCACCGA CCACGGCGCC
TACTTTTCCG ACTGCTATTA CGACTACGCC ACCCTTGTCA CCGGCCCCGG CCACGCGACG
ATCGGCACCG GCTCCTACAC CATCGGCCAC GGCATCATGG CCAATGAGTG GTTCGACCCG
CAGATCAACG AGCGCGTCAC CAGCGTCTCC GACGAAGCCA CACATATCGT CGGCGTTGAA
GGCGGCCAAG GCTCCTCGCC CCACAACCTC CTCACCGACA CCTTTGGCGA CGAACTCCGC
ATGGCCACCC AGGGCCGCTC TCGCGTCTTC GGCATCTCGA TGAAAGATCG CGCCGCGATC
CTCCCCACCG GCCACAGCGC CAACGCCGCC TACTGGCTCG ACGGCAAATC CGGCGCGTGG
ATCACCTCCG ACTACTACAT GAAGGCGCTC CCCCCATGGG TTGAAGCCGT CAATCACTCC
GATGAAGCCA AAAAGTTCCT CAACCGCGAC TGGAAAGACG CCGCCGGCAA AGTGATGGGT
AACACCAACC CGCGTAACGA CGAGGACGGC CAGCCCGAAG ACTATTTTGA AATCGTCGGA
AGCACGCCCT TCGCCAATGA CCTAGAACTC GACTTCGCGC GCTCACTCAT CACCAACGAA
AAACTCGGCA CCCGCGCAAC CACCGATCTG CTCGTCATCA GCCTCTCCGA AAACGACATC
CTCGGCCACG CCGTAGGCCC CGACTCACCG ATCCTCCACG CCTCCATCGT TGAACTCGAT
CGCCAACTCG CCGGCTTCTT CCAGTTTCTC GATAAGCAAT TCGGCATGAA TAACGTCTGG
CTCGCCCTCT CCGCCGATCA CGGCGTCGCC CCCGTCCCGC GCGAAGTCCA GACTCTCCAC
ATGCCCGCCA GTGAAATGGA CACCAAGCAG TTCACCGAAA AGCTCAATGA GGAAATCGCT
AAGACCACCG GCAAGCCCGG CAAATATCTC CGTTCTGCCG GCCTCCCAAT GATCTCGCTC
GATCCAGCCT CCTGGAGCGA CACCAAGGAA GCCGACGCCG AACAAATTGT CGGCGAAGCC
GCTGTCCGCA CCGGTGCACT CGCCTACTTC ACCAAGTCCG ACCTCGCCAA AGGCCGCGTC
CCCGAGACAC CCATGGGCCA CAAGTTCGCC AACACCTATT CGCCCTACGG CGGCTGGTGG
GTCATGGTCC AACCGCGCCC CTTCACCATC CCCAAAGAAG ACGGCACCAC CCACTTCTCC
CCCTACAGCT ACGACGCCCA CGTACCGCTC GCCTTCTACG GCGTACCGTT TGCGCCCGGC
GTCTATCGCG GCCACAGCGA ACCAATCGAC CTAGCCGTCA CTCTCTCCTC TTTGCTCGGC
ACCAACAAAC CCGCCGCCGC AACCGGACGC GTACTGACCG AAGCCCTCAA GCCCCCACCG
AATCCGCCCG CAGGAGAAAA GCATCTCGTA AAATAA
 
Protein sequence
MKRLALLLLV LSQLSFASAY DAHPKLVVVI VIDQFRGDYL QRYHNEFGEG GFRLFTDHGA 
YFSDCYYDYA TLVTGPGHAT IGTGSYTIGH GIMANEWFDP QINERVTSVS DEATHIVGVE
GGQGSSPHNL LTDTFGDELR MATQGRSRVF GISMKDRAAI LPTGHSANAA YWLDGKSGAW
ITSDYYMKAL PPWVEAVNHS DEAKKFLNRD WKDAAGKVMG NTNPRNDEDG QPEDYFEIVG
STPFANDLEL DFARSLITNE KLGTRATTDL LVISLSENDI LGHAVGPDSP ILHASIVELD
RQLAGFFQFL DKQFGMNNVW LALSADHGVA PVPREVQTLH MPASEMDTKQ FTEKLNEEIA
KTTGKPGKYL RSAGLPMISL DPASWSDTKE ADAEQIVGEA AVRTGALAYF TKSDLAKGRV
PETPMGHKFA NTYSPYGGWW VMVQPRPFTI PKEDGTTHFS PYSYDAHVPL AFYGVPFAPG
VYRGHSEPID LAVTLSSLLG TNKPAAATGR VLTEALKPPP NPPAGEKHLV K