Gene Acid345_4618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4618 
Symbol 
ID4070775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5470932 
End bp5472461 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content51% 
IMG OID637986658 
Producthypothetical protein 
Protein accessionYP_593692 
Protein GI94971644 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0329503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.488499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTATT CCCACCCGAA CGACCCCAAT GCGGGTCGCC TGATTGAATC TCTCCGTCAC 
CTTGGCTATG GAAACTATGA AGCGGTAGCC GACATCGTTG ACAACTCAAT TGATGCGGAT
GCTCAGAACA TCAACATCCG AGTACAGACT AAGTCAAATC AAATCATCAT TAGCATTGCC
GACGATGGGC GAGGTATGTC GAAATCCATC CTCGACCAAG CTATGCGCCT GGGATCGCTG
ACCGACCGCA ATGCCGAGTC GGATCTCGGC AAATTCGGCA TGGGGCTGGT GACAGCAAGC
CTTTCGATGG CAAAGAAGCT ACATGTCGTC TCACGTGGCG ACGATGGGTG CTGGTCGAGC
GCATGGGATG TCGACGAGAT CGTTGCGCAG AATGCGTTTC TCAAGCACTT TGAAGCTGCA
ACATCCGACG AGGAAGAACT TCTAGCTGAA GAGATCGGTA AGAAGAAAAC CGGAACGCTG
GTGCTGCTTT CAAAATGCGA CAACCTTGCC AATAAAAACA CCAGCTCATT TGCGTCTAAT
CTGAGATCGC ATCTTGGGCG CGTACATCGC TACTTCATCG GTGCTGGTAG AGTTGTGACC
GTGAATGGCG AGCCTGTGGA AGCGATCGAT CCACTTCAAC TCGCGGATCC AGACACGGAA
ACCGTGCTCG ATGATGTCAT CTCGGTAACA TTGACAGACG ACGGCGAGAA GAAGACTGAC
AACGTTAGGG TCAGAGTTGT GCTCATCCCG GAATCTCCCG TCACTGACCT CGATGTCGGC
AAGTCTCTCA AGGCTCAGGG TTTCTATGTA ATGCGCAATC AACGCGAGGT GATGAACGCG
GCCGCCCTCG GGTTCTTCAC CAAGCACAAC GATTTCAACC GAATGAGGGG TGAACTGTTT
TTCCCAGGCA CTCTGGACCG CCTTGTTGGA ATCGAGTTCA CGAAACGGCA GGTTGAATTC
GAACAGAGTC TTCAGGATCA ACTAAACAAC GTTCTGATAC CGGTCTGTCG AACAATCAAA
AGGCGCGAAG CAACCAAGAA GCGAGTTCAA AGCGGCGAAG CACAGTTGAA GTTGCACGCT
CAATCGATGA AGGTCATCGC GGAAAAAGAC AAACTTTTGA TCAAGCCGAA GGCCGTCATT
GAAAAGCGTT CATCACCGCG TAACGGCAGC GGTGTGCAAG TCGATGACGC TCTAGATACA
AATAAAGAAC GCAAAAACTT CAATCGTTCA CAGCTGGTTG AAACGAGGCT CAATTGCGTC
ATTCGAGAAG AAAGACTCGG GCCGAACGGC CAAATTTATG AATGCGAGAT GGAGGGAAGA
AAGCTCGTCA TTCGCTATAA CGTTGAGCAT CCCTTCTACC AACGGTTCGT GACCGACAAC
ATGGATGAAG CTCGCGCTGT CACTGCCACC GATTTTTTGA TTTACAGCAT GGCTTCGGCG
GAGTTGAAGT TTCTGGATGA AGGTGATCTG GAGGCTGTGA ATAACTTCAA GGCCGTGCTT
TCCGCTAACT TGCGAACGCT TCTGAACTAA
 
Protein sequence
MRYSHPNDPN AGRLIESLRH LGYGNYEAVA DIVDNSIDAD AQNINIRVQT KSNQIIISIA 
DDGRGMSKSI LDQAMRLGSL TDRNAESDLG KFGMGLVTAS LSMAKKLHVV SRGDDGCWSS
AWDVDEIVAQ NAFLKHFEAA TSDEEELLAE EIGKKKTGTL VLLSKCDNLA NKNTSSFASN
LRSHLGRVHR YFIGAGRVVT VNGEPVEAID PLQLADPDTE TVLDDVISVT LTDDGEKKTD
NVRVRVVLIP ESPVTDLDVG KSLKAQGFYV MRNQREVMNA AALGFFTKHN DFNRMRGELF
FPGTLDRLVG IEFTKRQVEF EQSLQDQLNN VLIPVCRTIK RREATKKRVQ SGEAQLKLHA
QSMKVIAEKD KLLIKPKAVI EKRSSPRNGS GVQVDDALDT NKERKNFNRS QLVETRLNCV
IREERLGPNG QIYECEMEGR KLVIRYNVEH PFYQRFVTDN MDEARAVTAT DFLIYSMASA
ELKFLDEGDL EAVNNFKAVL SANLRTLLN