Gene Acid345_0497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0497 
Symbol 
ID4069396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp611909 
End bp613273 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content60% 
IMG OID637982501 
Productmetal dependent phosphohydrolase 
Protein accessionYP_589576 
Protein GI94967528 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01942] poly(A) polymerase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCCA AGGCAGCCGC CATCGAAATC GTAAAAACGC TCCGCGGCCA CGGCCACCAG 
GCCTATCTCG TTGGCGGTTG CGTGCGCGAT CTCCTGCTCG GCCGCGATCC CGCCGACTAC
GACGTTGCCA CTGACGCCAT CCCCGACCAG GTGATGAAGA TCTTTCCGAA GACATGGGCC
GTCGGCGCGC AGTTCGGCGT AGTCCTGGTT CCGGTGCCTG ACGAACCCGA ATGCGATCCG
GAAGCAACGC ATCCGCAAAG CCATCATCAT GGCTGCATCG AAGTCGCGAC GTTCCGCAGT
GACGGCGTCT ACTCCGATGG CCGCCATCCC GATGAAGTCC GCTTCTCGCA AGACCCCCGC
GAAGACGTCC AGCGCCGCGA CTTCACCATC AACGGCATGC TGCTCGATCC CCTGAACCAC
GACGAAGTTC TCGACTTCGT CGGCGGTCGC GCCGACCTCG CAGCCAAAAT CATTCGCACC
ATTGGTGATC CGCGCCAGCG TTTCGAAGAA GACAAACTCC GCATGCTCCG CGCCGTACGC
TTCGCCGCTC GCTTCGAATA TGAAATCGCG CCCGAAACCA TGGCGGTGAT TCAACAACTC
TCGCCGCAGA TCAGCGTCGT GAGCCACGAA CGCGTGCGCG ATGAACTCAC GAAAATGCTC
ACAGAAGGCC ACGCCCGCCG CGCCTTTGAG TTGCTCGATG AATCCGGCCT TCTCATCCAA
GTTCTTCCAG AGATCGCGAA GATGAAAGGC GTCCAGCAGC CGCCGCAGTT TCATCCTGAG
GGCGATGTTT GGATTCACAC CCTGATGCTG CTCGAGCAAT TGCCACCGAA CTGTTCGCCG
ACCCTGGCCT GGGGTGCGTT GCTTCACGAC GTCGGCAAGC CCCCAACATT CCGCCGCGCG
CCCGATCGCA TCCGCTTCGA TGGCCATGTG GATGTCGGCG TACGCATGAC TGAAGTGCTC
GGCAAGCAAA TGCGTTTCTC CAACGAGCAA CTCGATCAGG TAAGCGCGCT CGTCGAGAAC
CACATGAAGT TCGCAGAAGT GACGCACATG CGCGAGTCCA CCCTCAAGCG TTTCATGCGC
CTGCCGAAGT TCGACGAGCA CATGGAACTC CACAAGATGG ACTGCCTGGC CAGCCATCGC
AACCTCGGGT CTTACGATTT TCTGAAGCAG AAGATGGAAG AGACTCAGCC CAAAGAAATT
CGGCCAGAGC CATTAGTGAC CGGCGCCGAC CTCATCGCAC TCGGCCACAG GCCGGGGCCG
CTCTTCAAAG AGATTTTGTC ATCCGTGGAA GATGGACAAC TGGAAGGCAG ACTGCAATCG
CGGGAACAGG CGATGGCGTT CGTGCAGGCG GAGTTCCCGC TCTGA
 
Protein sequence
MQPKAAAIEI VKTLRGHGHQ AYLVGGCVRD LLLGRDPADY DVATDAIPDQ VMKIFPKTWA 
VGAQFGVVLV PVPDEPECDP EATHPQSHHH GCIEVATFRS DGVYSDGRHP DEVRFSQDPR
EDVQRRDFTI NGMLLDPLNH DEVLDFVGGR ADLAAKIIRT IGDPRQRFEE DKLRMLRAVR
FAARFEYEIA PETMAVIQQL SPQISVVSHE RVRDELTKML TEGHARRAFE LLDESGLLIQ
VLPEIAKMKG VQQPPQFHPE GDVWIHTLML LEQLPPNCSP TLAWGALLHD VGKPPTFRRA
PDRIRFDGHV DVGVRMTEVL GKQMRFSNEQ LDQVSALVEN HMKFAEVTHM RESTLKRFMR
LPKFDEHMEL HKMDCLASHR NLGSYDFLKQ KMEETQPKEI RPEPLVTGAD LIALGHRPGP
LFKEILSSVE DGQLEGRLQS REQAMAFVQA EFPL