Gene Acid345_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4226 
Symbol 
ID4073152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5008735 
End bp5010534 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content58% 
IMG OID637986257 
ProductMutT/nudix family protein 
Protein accessionYP_593300 
Protein GI94971252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.500357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.363608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG TCTTGATCTC CGCTCTTTTG CTTAGTTCTT CCGCCTGGTC CTTCGCACAA 
AACGTTCCCG CCCCCGGCGC TCCGGATGAT CTCAAATTCC AGTCCCCCAA GTTCGCCAAG
GGTTCTAAGG CCGCAAAGAT GCAGAAGTCT GACTCGACTC CCAAGAGCCA CATCCCCGAC
GCCGCGGAAC TCCGCCAGAT GGGTGCGCGC TTCGCCCCCA CAGAACTAAA GATCGATCCG
TCCTCGCTGT CACCCGGCGA CCAGAAAGCC CTCGCCAAGC TCGTTGAAGC CGCGAAGGTC
ATTAACGATG TCTTCCTCAC GCAATACTGG AAGGGAAACC ACGCGCTCTG GACCAAACTT
CAGGCCGACA CCACCGAACT CGGCCGCGAA CGTGCCCGCT ATTTCTGGAT CAACAAGAGT
CCGTGGTCGG CGCTCGACGG TTTAACGGCA TTCCTTCCCG ACGTTCCCGC GAAGAAACTC
CCCGGCGCGA ACTTCTACCC CGAAGACATG ACCCGCGAGG AATTTGAAGC GTGGGTGAAG
ACGCTGCCCG AAAAAGATCA GGAATCCGCA AAAGGCTTCT TCACTGTCAT CCAACGTGGC
GCGGACAAAA AGTTGACCAT TGTTCCCTTC AGCGAAGCCT ACAAAGCCGA CCTCACCCGC
TGCGCTTCCC TCTTGAAAGA AGCTGCCGAT CTCACCGACA ACGCCTCCCT CAAGAAATTC
CTCAACTCGC GCGCCGACGC GTTCGCTTCC AACGACTACT ACCAGAGCGA TATGGACTGG
ATGGATCTCG ACGCCCCCAT CGATCCCACC ATCGGCCCCT ACGAGACCTA TAACGACGAG
ATCTTCGGCT ACAAAGCCAG CTACGAGGCC TATATCACCG TCCGCGATGA TGCCGAGACG
AAGAAGCTCA GCTCCTTCTC TGCGCACCTT CAGGAAATCG AGAACAATCT CCCGCTCGAT
CCGAAGTATC GCAACCCGAA GCTCGGCGCC GCCGCTCCCA TCCGCGTAGT CAATGAAGTC
TTCGCCGCCG GCGATGGCGA CCACGGCGTC CAGACCGCCG CCTACAACCT GCCCAACGAC
GATCGCGTCG TCGCGCAGAA GGGCTCCAAG CGCGTGATGC TCAAGAACGT GCAGGCCGCC
AAGTTCAACA GCGTGCTCAT TCCGATCTCG AAACAAGTCC TGAAAGCCGA TGCCCAGCAG
TACGTGGACT TCGACCTGTT CTTCACCCAC ATCCTCACCC ACGAGCTCTG TCATGGCCTC
GGCCCCCACG AAATCACGGT GAACGGCAAA AAGACCAACC CGCGTATCGA GATCAAAGAG
CTTTACAGCG CGCTCGAAGA AGCCAAGGCT GACGTCACCG GCCTCTTTGC GCTGCAGTAC
ATGCTCGACC ACGCAAAGGA CATGGGTCTC GACTCGACAT TGAAAATCGA CAACGATTCC
GAAAAGAAGC TCTACACCAC CTATCTCGCG TCGTCGTTCC GTACCCTCCG CTTCGGAACG
CATGAAGCCC ACGGCAAAGG CATGGCCGTC CAGGTCAGCT ACCTGATGAA GCGTGGCGCG
TTCGTCGCGA ACCCCGACGG CACCTTCTCC GTGGACTACA AGAAGATCAA AGACGCCGTC
CGCGACCTCG ACAAAATCAT GCTGACGCTG GAGGCCGAAG GCGACTACGC CGGCACGAAA
AAACTTCTCG ACGACTACGG CACCGTCCCC GCTGAAATGC AAAAAGCCAT CGCTAAGATG
AGCAGCGTGC CGGTGGACAT CGAGCCGCTA TACGTGACGG CCAAAGCACT AACAAAATAG
 
Protein sequence
MKIVLISALL LSSSAWSFAQ NVPAPGAPDD LKFQSPKFAK GSKAAKMQKS DSTPKSHIPD 
AAELRQMGAR FAPTELKIDP SSLSPGDQKA LAKLVEAAKV INDVFLTQYW KGNHALWTKL
QADTTELGRE RARYFWINKS PWSALDGLTA FLPDVPAKKL PGANFYPEDM TREEFEAWVK
TLPEKDQESA KGFFTVIQRG ADKKLTIVPF SEAYKADLTR CASLLKEAAD LTDNASLKKF
LNSRADAFAS NDYYQSDMDW MDLDAPIDPT IGPYETYNDE IFGYKASYEA YITVRDDAET
KKLSSFSAHL QEIENNLPLD PKYRNPKLGA AAPIRVVNEV FAAGDGDHGV QTAAYNLPND
DRVVAQKGSK RVMLKNVQAA KFNSVLIPIS KQVLKADAQQ YVDFDLFFTH ILTHELCHGL
GPHEITVNGK KTNPRIEIKE LYSALEEAKA DVTGLFALQY MLDHAKDMGL DSTLKIDNDS
EKKLYTTYLA SSFRTLRFGT HEAHGKGMAV QVSYLMKRGA FVANPDGTFS VDYKKIKDAV
RDLDKIMLTL EAEGDYAGTK KLLDDYGTVP AEMQKAIAKM SSVPVDIEPL YVTAKALTK