Gene Acid345_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3122 
Symbol 
ID4070236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3712315 
End bp3715758 
Gene Length3444 bp 
Protein Length1147 aa 
Translation table11 
GC content59% 
IMG OID637985141 
Productglycosyl hydrolases 38-like 
Protein accessionYP_592197 
Protein GI94970149 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCA TCTGTTCTTT CGTTCTCGCT CTCGTAATCT GTTCGATCTT CTCCGGCATA 
CTGTTCGCGC AAACCCCAGC CCCGAAGCCG GTTACGAAGG CCAAAGCTGA CCTCAGCAAG
CCGTCGGTCT ACGTAGTCGG TTACGCGCAC CTCGATACCG AGTGGCGATG GGAGTATCCG
CTGGTGATTC GCGAATATCT CTCGAAGACA ATGCGCAATA ACTTCGCGCT CTTCGAGAAA
TATCCGGACT ACATCTTCAA TTTCAGTGGC GCAAACCGTT ATCGGCTGAT GAAGGAGTAT
TACCCCGAGG ATTACAAGCG CCTGCAACAC TACGTCGCGG CGGGAAGGTG GTTCCCTGCC
GGCTCTTCCA TGGAAGAAAG CGATGTGAAC TCGCCGTCTG CGGAGTCCAT CTTCCGGCAG
GTGCTCTACG GAAACGAATT CTTTCGACGC GATTTCGGAA AAGCGAGTTC GGAATACATG
CTGCCGGATT GCTTCGGCTT TCCTGCTTCG TTGCCCAGCA TCCTTGCGCA CGCCGGCGTG
AAGGGTTTCT CCACGCAGAA GCTGACGTGG GGATCGTCGG CCGATGCCGG CGGATGGGAT
TCGCCCGAGC GCACGCCGAT GGGTACGCCG TTCAACGTTG GCCTGTGGGA AGGACCCGAC
GGCAAGAGCG TGATCGCCGC TTTTAATCCA GGAAGCTACG CCGCTGATCT CTCCACTGAC
CTAACGAAAC CTCTACCTGA AGGCACACGC TCGACGCGCG GCAAGACCGA AGAAGCGCAT
CAACTCTGGC TGTTTCAGGA TGACTGGGCG CGGCGCGTGG CGCGCAATGG CGAAGTCACC
GGACTCTTTA CCGACTACCA CTACTTCGGC ACCGGCGACA TTGGCGGCAG TGCGACGGAG
CGCTCGGTGC AACTGCTCGA AGCCATGCTC CACAAAAGGC CGTTCGCGCT TCCCGCCTAC
ACCGGCCAGG AACAGTGGAC AACGTTCGGT AAAGAAGCGC TGGTTGGCGA CGGCCCGGTG
CGCGTGATTT CTGCAACCGC CGACCAGATG TTCCTCGACA TCGGCAATAA CACCGCGAAG
CTGCCACGCT ATAAGGGCGA ACTCGAACTC ACCAACCACT CCGCGGGATC GCTCACCTCT
GAGGCTTACC AGAAGCGCTG GAACCGCAAG AACGAACTGC TCGCTGATGC GGCGGAGAAG
GCTTCCGTCG CGGCGGCGTG GCTGGGTGGG CGCGTGTATC CGCAGAAGCG CCTGAACGAT
GCCTGGACAC TGGTGATGGG AGGCCAGTTC CACGACATCA TGGCCGGCAC CGCAACGCCG
CAGTCGTACA ACTATTCATG GAACGATGAT GTGATCGCGA TGAACCAGTT CGCCGGCGTG
ATGCAGGATG CCGTCGGCAC CGTCGCGGCG GCGATGGATA CACGCGGTGA TGGCATACCA
ATCGTTGTGT ATAACCCGCT GAATGTGTCA CGCGAGGATC TTGTGGAAGC AACAACTCTG
CTTGGCGATG GCGACGCCCG CGTGATTGGC CCGGATGGAA ACGAAGTGCC TTCGCAGCGC
GACGGAAACA AAGTAGTGTT CGCAGCAAAG GTCCCGTCTG TGGGCTTCGC TGTTTATCAC
GTGCAGAGCG GCGGAAATTC GCTTTCAGAA TTGAAAGTCA CGGAGTCCTC GCTTGAGAAC
GCGCGCTATC GCGTGCAGAT CGATGCGAAT GGCGATGTCA GCAGCATCTT CGACAAGAAG
CTCAATCGCG AACTGCTCTC GGCTCCGGCG CGACTTGCGT TCCTGACGGA GAATCCCGCG
CAGTGGCCGG CGTGGAATAT GGATTACGAA GATCAAATGC GGCCGCCGCG AGCATATGTC
TCTGGTCCGG CAAAAGTGCG GATCGCAGAA CGTGGACCGG CGCGAGTGGC GCTGGAGATC
GAGCGCGAGG CCGAAGGTTC GAAGTTCGTG CAGACGATTC GGCTTTCCGC AGGCGATGCC
GGCAATCGCG TGGAGTTCAC CAACGAGATC GATTGGCAAA CCAAGGAATC GGCGCTGAAA
GCTGTCTTCC CGCTGGCGTT CAGCAATGAG AACGCAACGT ACAACTGGGA CGTCGGGACG
ATCGAGCGTC CGACGAACAC GCCGAAGAAG TTCGAAGTAC CGTCGCATCA GTGGTTCGAT
CTCACCGACA GGAGCGGCAA CGGTGGCGTG ACGATCCTCT CCGATTGCAA GTACGGGTCC
GACAAGCCTG ACGATCACAC GCTGCGCTTG ACTCTCATCT ATTCGCCCGG GCTCGGCGGC
AAGCAGAGCG ACTACGCGGA CCAGACGACG CAGGATTTTG GCCACCACCA GATCGTCTAT
GGGCTCTCGG CACATGACGG AGACTGGCGG AAGGCGCGCA CCGACTGGCA AGGCTATCGG
CTGAATCAGC CGCTGATTGC GTTCGTCGTG CCGCAGCATG AAGGCGCGCT GGGGAAGCAA
TTCTCGCTGG TCTCAGTGGA CAACCCGTCC ATCCGCGTTT TGGCTTTGAA GAAGGCCGAG
AACTCCGGGG ACATCGTGGT GCGGTTGGTT GAAACCGATG GCCGCGACAC GAAGAACGTG
CACACGAAAT TTGCTTCTGC GATCTCCTCC GCGAGAGAAC TCAACGGCCA GGAGCAACCG
CTGGGTGCCG TGAGTGTCAG CAATGGTTCC CTGGAGACTT CCTTCACGCC GTACCAGCTA
CACACGATTG CGGTGAAACT CGCGCCTTCA ACTGCGCACG TTGCGCCCGC GAAATGGCAA
GCGGTCGCGC TGAATTACGA CACCTCGGTC GCGAGCTTCG AGGGCAAGCC CGCCGAAGGC
TGCATGGATT GCTCGTGGAA CGAGCCTGCC GCAGATGGCC AGGGGCACGC GTATCCAGCC
GAGATGTTGC CGGCGTCGAT CGCGTTCCAG GGCGTGGAAT TCCGCATCGC GCCGAGTGGC
AAGAATGATG CAGTCATCGC GCGTGGACAA GCCATCTCCC TGCCTGTGGG CGATTTCACG
CGCGCCTACG TGCTCGCATC GGCAATCGGC GACCAAAGCG CCAAGTTCAG AGGCGTCATG
CAAGCTTTCA AGATTCCTGA CTGGACTGGC TTCGTCGGCC AATGGGACAA CCGCAAGTGG
AACATTCGCA AAGAAACCGT TCCGGCGAAA GGCAACGATC CGGAGTACGT GCGCACCGTG
ATGGACTTCA CCGGCAAAAT TACGCCGGGC TTTATCAAGC GCGCCGATAT CGCGTGGTAC
GCATCGCATC GCCACGATAC GAATGGCAGC AACGAACCCT ATTCGTATTC GTACCTGTTC
GCGATCCCTA TTGACTTCCC ACCAGGCACA GAAACACTCA CTCTGCCCAA CAACGACAAA
GTCCGAATAC TCGCTATCAC CGTGACCAGC GACCATGCGG CCGCGCGACC GGTGCAGCCG
CTCTACGACA CGCTTGAACA CTAG
 
Protein sequence
MRRICSFVLA LVICSIFSGI LFAQTPAPKP VTKAKADLSK PSVYVVGYAH LDTEWRWEYP 
LVIREYLSKT MRNNFALFEK YPDYIFNFSG ANRYRLMKEY YPEDYKRLQH YVAAGRWFPA
GSSMEESDVN SPSAESIFRQ VLYGNEFFRR DFGKASSEYM LPDCFGFPAS LPSILAHAGV
KGFSTQKLTW GSSADAGGWD SPERTPMGTP FNVGLWEGPD GKSVIAAFNP GSYAADLSTD
LTKPLPEGTR STRGKTEEAH QLWLFQDDWA RRVARNGEVT GLFTDYHYFG TGDIGGSATE
RSVQLLEAML HKRPFALPAY TGQEQWTTFG KEALVGDGPV RVISATADQM FLDIGNNTAK
LPRYKGELEL TNHSAGSLTS EAYQKRWNRK NELLADAAEK ASVAAAWLGG RVYPQKRLND
AWTLVMGGQF HDIMAGTATP QSYNYSWNDD VIAMNQFAGV MQDAVGTVAA AMDTRGDGIP
IVVYNPLNVS REDLVEATTL LGDGDARVIG PDGNEVPSQR DGNKVVFAAK VPSVGFAVYH
VQSGGNSLSE LKVTESSLEN ARYRVQIDAN GDVSSIFDKK LNRELLSAPA RLAFLTENPA
QWPAWNMDYE DQMRPPRAYV SGPAKVRIAE RGPARVALEI EREAEGSKFV QTIRLSAGDA
GNRVEFTNEI DWQTKESALK AVFPLAFSNE NATYNWDVGT IERPTNTPKK FEVPSHQWFD
LTDRSGNGGV TILSDCKYGS DKPDDHTLRL TLIYSPGLGG KQSDYADQTT QDFGHHQIVY
GLSAHDGDWR KARTDWQGYR LNQPLIAFVV PQHEGALGKQ FSLVSVDNPS IRVLALKKAE
NSGDIVVRLV ETDGRDTKNV HTKFASAISS ARELNGQEQP LGAVSVSNGS LETSFTPYQL
HTIAVKLAPS TAHVAPAKWQ AVALNYDTSV ASFEGKPAEG CMDCSWNEPA ADGQGHAYPA
EMLPASIAFQ GVEFRIAPSG KNDAVIARGQ AISLPVGDFT RAYVLASAIG DQSAKFRGVM
QAFKIPDWTG FVGQWDNRKW NIRKETVPAK GNDPEYVRTV MDFTGKITPG FIKRADIAWY
ASHRHDTNGS NEPYSYSYLF AIPIDFPPGT ETLTLPNNDK VRILAITVTS DHAAARPVQP
LYDTLEH