Gene Acid345_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1898 
Symbol 
ID4073360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2276280 
End bp2279531 
Gene Length3252 bp 
Protein Length1083 aa 
Translation table11 
GC content60% 
IMG OID637983908 
Productglycosyl hydrolase 
Protein accessionYP_590973 
Protein GI94968925 
COG category 
COG ID 
TIGRFAM ID 



Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000419875 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000213557 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAAAGC TCATTTGCGC AGTTTTCGCG TCTCTCATGT TCGTTGCCGC TGGCTATCCA 
CAGGAACCGA TTGCGGACGT GGAGCACGAA GATACCGAAG CCACCCCGCA GCAGCGTCAG
CAGTGGTTCT ACGGTCAGCG CGCCTATCCG TTCAAGGTCA CGCCCGCCGG CGCCCATCGC
CGCGCCTTTG GCGAAGCCAC GCAAATGCGC ATCGATGAAG AAGCGGTTCG CGCCAACGCT
CCTGCTGGTG GCAAAACCAA CACGCTGAAT CCCTGGACCA TGATTGGTCC CAAGCCGTCG
AACCAGGGCG GCTACGGCGT GACCTCGGGT CGCATCACCG CAGTTGCGGT GGACCAGACG
ACTTCCGGCG CGAGCACCGT GCTCTACATC GGCGGTGCGG AAGGCGGTAT CTGGAAGAGC
AGCGACAACG GCTCAACCTG GACCGCGCAG AGCGACAGCC AGCCTACGCT TGCTATCGGT
TCGATCGCGA TTGATCCAAA CAATCACAGC ATCATCTATG CCGGCACCGG CGAAGAGAAC
TTCAGTGGCG ACTCCTACTA CGGCGGCGGA GTGCTCAAGT CCACGAACGG TGGCTCGACC
TGGACAATGC TGGGCGCCTC CTATTTCGGA GGCCCGATCG GATCCGGTTC CTATTACGGC
GGCAGCTTCA TCGGCGCGAT CGCCGTCCAG CCAGGAGTTT CTTCGGGAAC GCCTATCGTT
CTCGCTGGCT CAGAATTCTC CAGCAACGCT AGTTCTGGTG TCTGGCGCTC CACGGATGGC
GGCACAACCT GGGCACGTGT ATTCCCGACG ACTGCACAGC TCTATTCGCA CGTGACGTCG
GTTGTTTGGG TCAGCAAGAC TAAGGCATAC GCCGCCGTCA GCAACGTCTT CGGTGCTTCT
AGTGTGCCAG TCGGCGTGTA CGTTTCAAGC GACAGTGGTG CTACATGGGG CCCCGCGAAT
GGCGTCTCTG GACAGGCATT GCCGGATGGC ACCACAACAG CAGGTCGCTT CACCCTCGCC
GTATCTCCTT CAACGCCGGC GACAATGTAT GTTTCCGTCA GCGATTACAA CACCAGCGGC
CTCTACGGAA TGTACTTCAC CACTGACAGC GGCGGACATT GGAATCCGCT TAAGTCACCT
CTGAATGCGG TTGGCACCAC CAACGATTTC TGCGGTCCGC AGTGTTGGTA CGACATGCCA
CTTGCCGTCC ACCCGACTCA TCCCGGCACT CTCTACGCTG GCGGAAATTT CAACTATGGC
GCGGGCAACG GCGGAGTTTA CGTCAGTCTC AACGCCACCA ATGGCGCTAC CGCGACCTGG
TCTACCCCGA ACCCTGGCAC CAACAGCGTG ACCATGCACC CGGATTTCCA CGCCTTCGCC
TTTTCTGCAG ACGGCAACAC ACTGTACATC GGTGAAGATG GCGGCCTCTG GCGCGGTACG
CCAACCAACA GTGCCACCAT GGCGTGGACC GATCTCAACA CTAATCTTGC GATCACCGAG
TTCTACCCCG GCCTCGCGAT CTACAAAGGC AGCAAAAACA CCGCGCTCAA CGGGACGCAG
GACAATGGCG CCCAACTCTA CACCGGCTCG CTGCAGTGGA CGGTCGTCAC CTGCGGTGAC
GGCGCTTGGG CGGCGATCGA TCCAACGACC GCCAATAATC TCTACGCCGG TTGCACCTCA
GCGAATTTTG AGGGTGTCAT TCGCTCGCTC GACGGGGGCG GCAGTTGGGC CAGCCTCGGC
ACGGGCATCA ACAATTTCGA GAATGTCGCG TTCATCCCAC CGATGATCAT GGATCCCAAA
AGCTCAACCA CCCTGTATTA CGGGACCGAT CACCTCTACA AGATGGTCAA CTCTTCGCAG
CCAAGCCCGT TCCCAACTTG GTCGTACGTA AATGCTTCCG CTCTCACAAG CGGCTATCTC
TCAACGATTG CCGTCAGCGC GGTGAACGGT GCGTACTTCT TCGTCGGTGA CAGTACTGGC
GCGGCGCAAT TCTCAATCAA TTCCGGCGCT AGTTGGACAG CTTTCACCGG ACTCCCAGGG
CGCTTTGTCA GCATGGTTCA AGCCGATCCG CACACTGCCA CCATCGCTTA CGTCACGGTC
TCCGGTTTCA GCGGCTTTAA CGGCGACACC AAGGGCCACG TCTTCAAATG CCTCACGACG
ACTAGCGCTT GCACCGACAT GAGCGGCAAT CTGCCGAACA CGCCCGCCAA CGACATCGTC
ATCGATCCCG ATCTTGCCAA CACCTTCTAT GTCGCGACCG ACGTCGGCGT CTTCACCAGC
ACCAACAACG GCACGACTTG GTCCACCTCA GGAACTGGCT TGCCGAACGT CGCGGTCGTC
GGACTGAAAC TGCACGAAAG CTCCCGCACA TTGCGCGCTG CTACCCATGG CCGCAGCACA
TGGGACCTCT CAGTGCCCAC CTCCACCGTA ACGCCTGCCG CCATGACCAG CCCGGCGAAC
GGCGCGACGA TGACCGGCGC GAGCGCTACC TTCAACTGGA GCGCAGGCAC GGGAGCGACG
CAGTACTCGC TCTATATCGG AAACACCTCG GGCGCGCATG ACATCGCTTT TGTCAGCACG
ACTTCGCTCT CGGCGACCGT CAACACGCTC CCAACCAACG GCGAGAAGTT CTTCGTCAGC
TTGTACTCGT ACATCGGAGG GAAGTGGTAC TACAACGCGT ACTCCTACTA CGCCTCCGGC
ACCGGCGCAG CCGCGACGAT GAGTACACCT ACGCCCGGCA CGAAGTTGAG CAGCGCCAGC
CAGACCTTCA CCTGGACCAA GGGCACCGGC ATCAACTCGT ACTCGCTCTA CATCGGTACG
AAAGCCGGCC TGCACGACAT CGATTTCCTG AACACGAGCA ACACGTCAGC CAGCTTCAGC
AATCTGCCGA CGAACGGCGG GACGTTCTAC GTCACGCTCT ACTCGCTGAA CGGGAAGACC
TGGCTCTCGC ACCCATACAC CTACGTCGCT TCCGGTTCGG GCACGGCCGC AACCATGTCC
ACCCCGACGC CCGGCAGCAC CTTGCCCGGC GCCAGTGTCA CCTTCAATTG GACAACTGGT
TCGGGCGTGA CATCGTACTC GCTGTACATT GGTACGACGG CGGGCGCGCA CAACCTCGAC
TTCATCAACA CCACCTCAAC TTCTGCCAGT GTCACGAATC TTCCAACCAA CGGATCCACC
GTGTACGTCA CCCTGTACTC GTTGATCGGC GGGGTGTGGC ACTCCAACGC CTACACCTAC
AAAGCGCAGT AG
 
Protein sequence
MKKLICAVFA SLMFVAAGYP QEPIADVEHE DTEATPQQRQ QWFYGQRAYP FKVTPAGAHR 
RAFGEATQMR IDEEAVRANA PAGGKTNTLN PWTMIGPKPS NQGGYGVTSG RITAVAVDQT
TSGASTVLYI GGAEGGIWKS SDNGSTWTAQ SDSQPTLAIG SIAIDPNNHS IIYAGTGEEN
FSGDSYYGGG VLKSTNGGST WTMLGASYFG GPIGSGSYYG GSFIGAIAVQ PGVSSGTPIV
LAGSEFSSNA SSGVWRSTDG GTTWARVFPT TAQLYSHVTS VVWVSKTKAY AAVSNVFGAS
SVPVGVYVSS DSGATWGPAN GVSGQALPDG TTTAGRFTLA VSPSTPATMY VSVSDYNTSG
LYGMYFTTDS GGHWNPLKSP LNAVGTTNDF CGPQCWYDMP LAVHPTHPGT LYAGGNFNYG
AGNGGVYVSL NATNGATATW STPNPGTNSV TMHPDFHAFA FSADGNTLYI GEDGGLWRGT
PTNSATMAWT DLNTNLAITE FYPGLAIYKG SKNTALNGTQ DNGAQLYTGS LQWTVVTCGD
GAWAAIDPTT ANNLYAGCTS ANFEGVIRSL DGGGSWASLG TGINNFENVA FIPPMIMDPK
SSTTLYYGTD HLYKMVNSSQ PSPFPTWSYV NASALTSGYL STIAVSAVNG AYFFVGDSTG
AAQFSINSGA SWTAFTGLPG RFVSMVQADP HTATIAYVTV SGFSGFNGDT KGHVFKCLTT
TSACTDMSGN LPNTPANDIV IDPDLANTFY VATDVGVFTS TNNGTTWSTS GTGLPNVAVV
GLKLHESSRT LRAATHGRST WDLSVPTSTV TPAAMTSPAN GATMTGASAT FNWSAGTGAT
QYSLYIGNTS GAHDIAFVST TSLSATVNTL PTNGEKFFVS LYSYIGGKWY YNAYSYYASG
TGAAATMSTP TPGTKLSSAS QTFTWTKGTG INSYSLYIGT KAGLHDIDFL NTSNTSASFS
NLPTNGGTFY VTLYSLNGKT WLSHPYTYVA SGSGTAATMS TPTPGSTLPG ASVTFNWTTG
SGVTSYSLYI GTTAGAHNLD FINTTSTSAS VTNLPTNGST VYVTLYSLIG GVWHSNAYTY
KAQ